This is a modern-English version of The Foundations of Science: Science and Hypothesis, The Value of Science, Science and Method, originally written by Poincaré, Henri. It has been thoroughly updated, including changes to sentence structure, words, spelling, and grammar—to ensure clarity for contemporary readers, while preserving the original spirit and nuance. If you click on a paragraph, you will see the original text that we modified, and you can toggle between the two versions.

Scroll to the bottom of this page and you will find a free ePUB download link for this book.


SCIENCE AND EDUCATION

A SERIES OF VOLUMES FOR THE PROMOTION OF
SCIENTIFIC RESEARCH AND EDUCATIONAL PROGRESS

Edited by J. McKEEN CATTELL

SCIENCE AND EDUCATION

A SERIES OF VOLUMES FOR THE PROMOTION OF
SCIENTIFIC RESEARCH AND EDUCATIONAL PROGRESS

Edited by J. McKEEN CATTELL

 

 

VOLUME I—THE FOUNDATIONS OF SCIENCE

VOLUME I—THE BASICS OF SCIENCE

 

 

UNDER THE SAME EDITORSHIP

UNDER THE SAME EDITORSHIP


SCIENCE AND EDUCATION. A series of volumes for the promotion of scientific research and educational progress.

SCIENCE AND EDUCATION. A collection of volumes aimed at advancing scientific research and educational development.

    Volume I. The Foundations of Science. By H. Poincaré. Containing the authorised English translation by George Bruce Halsted of "Science and Hypothesis," "The Value of Science," and "Science and Method."

Volume I. The Foundations of Science. By H. Poincaré. Containing the official English translation by George Bruce Halsted of "Science and Hypothesis," "The Value of Science," and "Science and Method."

    Volume II. Medical Research and Education. By Richard Mills Pearce, William H. Welch, W. H. Howell, Franklin P. Mall, Lewellys F. Barker, Charles S. Minot, W. B. Cannon, W. T. Councilman Theobald Smith, G. N. Stewart, C. M. Jackson, E. P. Lyon, James B. Herrick, John M. Dodson, C. R. Bardeen, W. Ophuls, S. J. Meltzer, James Ewing, W. W. Keen, Henry H. Donaldson, Christian A. Herter, and Henry P. Bowditch.

Volume II. Medical Research and Education. By Richard Mills Pearce, William H. Welch, W. H. Howell, Franklin P. Mall, Lewellys F. Barker, Charles S. Minot, W. B. Cannon, W. T. Councilman, Theobald Smith, G. N. Stewart, C. M. Jackson, E. P. Lyon, James B. Herrick, John M. Dodson, C. R. Bardeen, W. Ophuls, S. J. Meltzer, James Ewing, W. W. Keen, Henry H. Donaldson, Christian A. Herter, and Henry P. Bowditch.

    Volume III. University Control. By J. McKeen Cattell and other authors.

Volume III. University Control. By J. McKeen Cattell and other authors.

AMERICAN MEN OF SCIENCE. A Biographical Directory.

AMERICAN MEN OF SCIENCE: A Biographical Directory.

SCIENCE. A weekly journal devoted to the advancement of science. The official organ of the American Association for the Advancement of Science.

SCIENCE. A weekly magazine focused on promoting scientific progress. The official publication of the American Association for the Advancement of Science.

THE POPULAR SCIENCE MONTHLY. A monthly magazine devoted to the diffusion of science.

THE POPULAR SCIENCE MONTHLY. A monthly magazine dedicated to spreading knowledge about science.

THE AMERICAN NATURALIST. A monthly journal devoted to the biological sciences, with special reference to the factors of evolution.

THE AMERICAN NATURALIST. A monthly magazine dedicated to the biological sciences, focusing on the factors of evolution.


THE SCIENCE PRESS
NEW YORK        GARRISON, N. Y.

THE SCIENCE PRESS
NEW YORK GARRISON, N. Y.

 

THE FOUNDATIONS
OF SCIENCE

SCIENCE AND HYPOTHESIS
THE VALUE OF SCIENCE
SCIENCE AND METHOD

SCIENCE AND HYPOTHESIS
THE VALUE OF SCIENCE
SCIENCE AND METHOD

 

BY
H. POINCARÉ

 

AUTHORIZED TRANSLATION BY
GEORGE BRUCE HALSTED

AUTHORIZED TRANSLATION BY
GEORGE BRUCE HALSTED

 

WITH A SPECIAL PREFACE BY POINCARÉ, AND AN INTRODUCTION
BY JOSIAH ROYCE, HARVARD UNIVERSITY

WITH A SPECIAL PREFACE BY POINCARÉ, AND AN INTRODUCTION
BY JOSIAH ROYCE, HARVARD UNIVERSITY

 

THE SCIENCE PRESS
NEW YORK AND GARRISON, N. Y.
1913

THE SCIENCE PRESS
NEW YORK AND GARRISON, N. Y.
1913

 

 

Copyright, 1913
BY The Science Press

Copyright, 1913
BY The Science Press

 

 

PRESS OF
THE NEW ERA PRINTING COMPANY
LANCASTER, PA.

PRESS OF
THE NEW ERA PRINTING COMPANY
LANCASTER, PA.


CONTENTS

PAGE
Henri Poincaréix
Author's Preface to the Translation3
SCIENCE AND HYPOTHESIS
Introduction by Royce9
Introduction27
Part I. Number and Magnitude
Chapter 1.—On the Nature of Mathematical Reasoning31
Syllogistic Deduction31
Verification and Proof32
Elements of Arithmetic33
Reasoning by Recurrence37
Induction40
Mathematical Construction41
Chapter 2.—Mathematical Magnitude and Experience43
Definition of Incommensurables44
The Physical Continuum46
Creation of the Mathematical Continuum46
Measurable Magnitude49
Various Remarks (Curves without Tangents)50
The Physical Continuum of Several Dimensions52
The Mathematical Continuum of Several Dimensions53
Part II. Space
Chapter 3.—The Non-Euclidean Geometries55
The Bolyai-Lobachevski Geometry56
Riemann's Geometry57
The Surfaces of Constant Curvature58
Interpretation of Non-Euclidean Geometries59
The Implicit Axioms60
The Fourth Geometry62
Lie's Theorem62
Riemann's Geometries63
On the Nature of Axioms63
Chapter 4.—Space and Geometry66
Geometric Space and Perceptual Space66
Visual Space67
Tactile Space and Motor Space68
Characteristics of Perceptual Space69
Change of State and Change of Position70
Conditions of Compensation72
[Pg vi]Solid Bodies and Geometry72
Law of Homogeneity74
The Non-Euclidean World75
The World of Four Dimensions78
Conclusions79
Chapter 5.—Experience and Geometry81
Geometry and Astronomy81
The Law of Relativity83
Bearing of Experiments86
Supplement (What is a Point?)89
Ancestral Experience91
Part 3. Force
Chapter 6.—The Classic Mechanics92
The Principle of Inertia93
The Law of Acceleration97
Anthropomorphic Mechanics103
The School of the Thread104
Chapter 7.—Relative Motion and Absolute Motion107
The Principle of Relative Motion107
Newton's Argument108
Chapter 8.—Energy and Thermodynamics115
Energetics115
Thermodynamics119
General Conclusions on Part III123
Part 4. Nature
Chapter 9.—Hypotheses in Physics127
The Rôle of Experiment and Generalization127
The Unity of Nature130
The Rôle of Hypothesis133
Origin of Mathematical Physics136
Chapter X.—The Theories of Modern Physics140
Meaning of Physical Theories140
Physics and Mechanism144
Present State of the Science148
Chapter 11.—The Calculus of Probabilities155
Classification of the Problems of Probability158
Probability in Mathematics161
Probability in the Physical Sciences164
Rouge et noir167
The Probability of Causes169
The Theory of Errors170
Conclusions172
Chapter 12.—Optics and Electricity174
Fresnel's Theory174
Maxwell's Theory175
The Mechanical Explanation of Physical Phenomena177
[Pg vii]Chapter 13.—Electrodynamics184
Ampère's Theory184
Closed Currents185
Action of a Closed Current on a Portion of Current186
Continuous Rotations187
Mutual Action of Two Open Currents189
Induction190
Theory of Helmholtz191
Difficulties Raised by these Theories193
Maxwell's Theory193
Rowland's Experiment194
The Theory of Lorentz196
THE VALUE OF SCIENCE
Translator's Introduction201
Does the Scientist Create Science?201
The Mind Dispelling Optical Illusions202
Euclid not Necessary202
Without Hypotheses, no Science203
What Outcome?203
Introduction205
Part I. The Mathematical Sciences
Chapter 1.—Intuition and Logic in Mathematics210
Chapter 2.—The Measure of Time223
Chapter 3.—The Notion of Space235
Qualitative Geometry238
The Physical Continuum of Several Dimensions240
The Notion of Point244
The Notion of Displacement247
Visual Space252
Chapter 4.—Space and its Three Dimensions256
The Group of Displacements256
Identity of Two Points259
Tactile Space264
Identity of the Different Spaces268
Space and Empiricism271
Rôle of the Semicircular Canals276
Part II. The Physical Sciences
Chapter 5.—Analysis and Physics279
Chapter 6.—Astronomy289
Chapter 7.—The History of Mathematical Physics297
The Physics of Central Forces297
The Physics of the Principles299
Chapter 8.—The Present Crisis in Physics303
The New Crisis303
Carnot's Principle303
[Pg viii]The Principle of Relativity305
Newton's Principle308
Lavoisier's Principle310
Mayer's Principle312
Chapter 9.—The Future of Mathematical Physics314
The Principles and Experiment314
The Rôle of the Analyst314
Aberration and Astronomy315
Electrons and Spectra316
Conventions preceding Experiment317
Future Mathematical Physics319
Part 3. The Objective Value of Science
Chapter X.—Is Science Artificial?321
The Philosophy of LeRoy321
Science, Rule of Action323
The Crude Fact and the Scientific Fact325
Nominalism and the Universal Invariant333
Chapter 11.—Science and Reality340
Contingence and Determinism340
Objectivity of Science347
The Rotation of the Earth353
Science for Its Own Sake354
SCIENCE AND METHOD
Introduction359
Book One. Science and the Scientist
Chapter 1.—The Choice of Facts362
Chapter 2.—The Future of Mathematics369
Chapter 3.—Mathematical Creation383
Chapter 4.—Chance395
Book 2. Mathematical Reasoning
Chapter 1.—The Relativity of Space413
Chapter 2.—Mathematical Definitions and Teaching430
Chapter 3.—Mathematics and Logic448
Chapter 4.—The New Logics460
Chapter 5.—The Latest Efforts of the Logisticians472
Book 3. The New Mechanics
Chapter 1.—Mechanics and Radium486
Chapter 2.—Mechanics and Optics496
Chapter 3.—The New Mechanics and Astronomy512
Book 4. Astronomic Science
Chapter 1.—The Milky Way and the Theory of Gases523
Chapter 2.—French Geodesy535
General Conclusions544
Index547

HENRI POINCARÉ

Sir George Darwin, worthy son of an immortal father, said, referring to what Poincaré was to him and to his work: "He must be regarded as the presiding genius—or, shall I say, my patron saint?"

Sir George Darwin, esteemed son of a legendary father, remarked, referring to what Poincaré meant to him and his work: "He should be seen as the guiding genius—or, shall I say, my patron saint?"

Henri Poincaré was born April 29, 1854, at Nancy, where his father was a physician highly respected. His schooling was broken into by the war of 1870-71, to get news of which he learned to read the German newspapers. He outclassed the other boys of his age in all subjects and in 1873 passed highest into the École Polytechnique, where, like John Bolyai at Maros Vásárhely, he followed the courses in mathematics without taking a note and without the syllabus. He proceeded in 1875 to the School of Mines, and was Nommé, March 26, 1879. But he won his doctorate in the University of Paris, August 1, 1879, and was appointed to teach in the Faculté des Sciences de Caen, December 1, 1879, whence he was quickly called to the University of Paris, teaching there from October 21, 1881, until his death, July 17, 1912. So it is an error to say he started as an engineer. At the early age of thirty-two he became a member of l'Académie des Sciences, and, March 5, 1908, was chosen Membre de l'Académie Française. July 1, 1909, the number of his writings was 436.

Henri Poincaré was born on April 29, 1854, in Nancy, where his father was a well-respected physician. His education was interrupted by the war of 1870-71, during which he learned to read German newspapers to get news. He outperformed his peers in all subjects and in 1873, ranked highest at the École Polytechnique, where, like John Bolyai at Maros Vásárhely, he attended mathematics classes without taking notes or following the syllabus. In 1875, he moved on to the School of Mines and was Nommé on March 26, 1879. However, he earned his doctorate at the University of Paris on August 1, 1879, and was appointed to teach at the Faculté des Sciences de Caen on December 1, 1879, from where he was quickly called to the University of Paris, teaching there from October 21, 1881, until his death on July 17, 1912. Therefore, it's incorrect to say he started as an engineer. At the young age of thirty-two, he became a member of l'Académie des Sciences, and on March 5, 1908, he was selected as a member of l'Académie Française. By July 1, 1909, he had published 436 writings.

His earliest publication was in 1878, and was not important. Afterward came an essay submitted in competition for the Grand Prix offered in 1880, but it did not win. Suddenly there came a change, a striking fire, a bursting forth, in February, 1881, and Poincaré tells us the very minute it happened. Mounting an omnibus, "at the moment when I put my foot upon the step, the idea came to me, without anything in my previous thoughts seeming to foreshadow it, that the transformations I had used to define the Fuchsian functions were identical with those of non-Euclidean geometry." Thereby was opened a perspective new and immense. Moreover, the magic wand of his whole [Pg x]life-work had been grasped, the Aladdin's lamp had been rubbed, non-Euclidean geometry, whose necromancy was to open up a new theory of our universe, whose brilliant exposition was commenced in his book Science and Hypothesis, which has been translated into six languages and has already had a circulation of over 20,000. The non-Euclidean notion is that of the possibility of alternative laws of nature, which in the Introduction to the Électricité et Optique, 1901, is thus put: "If therefore a phenomenon admits of a complete mechanical explanation, it will admit of an infinity of Others which will account equally well for all the peculiarities disclosed by experiment."

His earliest publication was in 1878, and it wasn't significant. Afterwards, he submitted an essay for the Grand Prix competition in 1880, but it didn't win. Then, in February 1881, everything changed dramatically. Poincaré describes the exact moment it happened. While getting onto a bus, "the moment I stepped onto the step, the idea struck me, without any hint in my previous thoughts, that the transformations I had used to define the Fuchsian functions were the same as those in non-Euclidean geometry." This opened up a vast new perspective. Moreover, he had grasped the magic wand of his entire life’s work; it was like rubbing Aladdin's lamp. Non-Euclidean geometry, which had the potential to create a new theory of our universe, began to be brilliantly presented in his book Science and Hypothesis, which has been translated into six languages and has already sold over 20,000 copies. The non-Euclidean idea suggests the possibility of alternative laws of nature, which is summarized in the Introduction to Électricité et Optique, 1901: "If a phenomenon can be fully explained mechanically, then there will be countless other explanations that can equally account for all the peculiarities revealed by experiments."

The scheme of laws of nature so largely due to Newton is merely one of an infinite number of conceivable rational schemes for helping us master and make experience; it is commode, convenient; but perhaps another may be vastly more advantageous. The old conception of true has been revised. The first expression of the new idea occurs on the title page of John Bolyai's marvelous Science Absolute of Space, in the phrase "haud unquam a priori decidenda."

The framework of natural laws largely established by Newton is just one of countless possible rational frameworks for helping us understand and navigate our experiences; it is convenient, but another might be much more beneficial. The traditional concept of what is “true” has been updated. The initial expression of this new idea is found on the title page of John Bolyai's incredible *Science Absolute of Space*, in the phrase "haud unquam a priori decidenda."

With bearing on the history of the earth and moon system and the origin of double stars, in formulating the geometric criterion of stability, Poincaré proved the existence of a previously unknown pear-shaped figure, with the possibility that the progressive deformation of this figure with increasing angular velocity might result in the breaking up of the rotating body into two detached masses. Of his treatise Les Méthodes nouvelles de la Méchanique céleste, Sir George Darwin says: "It is probable that for half a century to come it will be the mine from which humbler investigators will excavate their materials." Brilliant was his appreciation of Poincaré in presenting the gold medal of the Royal Astronomical Society. The three others most akin in genius are linked with him by the Sylvester medal of the Royal Society, the Lobachevski medal of the Physico-Mathematical Society of Kazan, and the Bolyai prize of the Hungarian Academy of Sciences. His work must be reckoned with the greatest mathematical achievements of mankind.

With relevance to the history of the Earth and moon system and the origin of double stars, Poincaré demonstrated the existence of a previously unknown pear-shaped figure while establishing the geometric criterion of stability. He suggested that the progressive deformation of this figure with increasing angular velocity could lead to the breakup of the rotating body into two separate masses. In his treatise Les Méthodes nouvelles de la Méchanique céleste, Sir George Darwin remarked: "It is likely that for the next fifty years, it will be the resource from which less prominent researchers will extract their materials." His recognition of Poincaré was exceptional when he presented him with the gold medal of the Royal Astronomical Society. The three others most closely associated in brilliance are connected to him by the Sylvester medal of the Royal Society, the Lobachevski medal of the Physico-Mathematical Society of Kazan, and the Bolyai prize of the Hungarian Academy of Sciences. His work is among the greatest mathematical achievements in human history.

The kernel of Poincaré's power lies in an oracle Sylvester often quoted to me as from Hesiod: The whole is less than its part.

The core of Poincaré's strength is based on an oracle that Sylvester frequently quoted to me, which comes from Hesiod: The whole is less than its parts.

He penetrates at once the divine simplicity of the perfectly general case, and thence descends, as from Olympus, to the special concrete earthly particulars.

He immediately grasps the divine simplicity of the perfectly general situation and then descends, as if from Olympus, to the specific, concrete earthly details.

A combination of seemingly extremely simple analytic and geometric concepts gave necessary general conclusions of immense scope from which sprang a disconcerting wilderness of possible deductions. And so he leaves a noble, fruitful heritage.

A mix of what seem like very basic analytical and geometric ideas led to essential general conclusions of great significance, which resulted in a confusing range of potential deductions. And so, he leaves behind a valuable, meaningful legacy.

Says Love: "His right is recognized now, and it is not likely that future generations will revise the judgment, to rank among the greatest mathematicians of all time."

Says Love: "His right is acknowledged now, and it's unlikely that future generations will change the assessment, to place him among the greatest mathematicians of all time."

George Bruce Halsted.

George Bruce Halsted.


 

SCIENCE AND HYPOTHESIS

 


AUTHOR'S PREFACE TO THE
TRANSLATION

I am exceedingly grateful to Dr. Halsted, who has been so good as to present my book to American readers in a translation, clear and faithful.

I am very grateful to Dr. Halsted, who has kindly presented my book to American readers in a clear and faithful translation.

Every one knows that this savant has already taken the trouble to translate many European treatises and thus has powerfully contributed to make the new continent understand the thought of the old.

Everyone knows that this expert has already taken the time to translate many European writings and has significantly helped the new continent understand the ideas of the old.

Some people love to repeat that Anglo-Saxons have not the same way of thinking as the Latins or as the Germans; that they have quite another way of understanding mathematics or of understanding physics; that this way seems to them superior to all others; that they feel no need of changing it, nor even of knowing the ways of other peoples.

Some people love to say that Anglo-Saxons think differently than Latins or Germans; that they have a completely different approach to understanding mathematics and physics; that they consider this approach to be better than all the others; and that they don't feel the need to change it or even to learn about how other cultures think.

In that they would beyond question be wrong, but I do not believe that is true, or, at least, that is true no longer. For some time the English and Americans have been devoting themselves much more than formerly to the better understanding of what is thought and said on the continent of Europe.

In that they would definitely be wrong, but I don't think that's the case, or at least, it isn't true anymore. For a while now, the English and Americans have been focusing much more than before on better understanding what is being thought and said in Europe.

To be sure, each people will preserve its characteristic genius, and it would be a pity if it were otherwise, supposing such a thing possible. If the Anglo-Saxons wished to become Latins, they would never be more than bad Latins; just as the French, in seeking to imitate them, could turn out only pretty poor Anglo-Saxons.

To be honest, each culture will maintain its unique essence, and it would be a shame if it were any different, assuming that was even possible. If the Anglo-Saxons wanted to become Latins, they would only end up as mediocre Latins; similarly, the French, in trying to copy them, could only become somewhat inadequate Anglo-Saxons.

And then the English and Americans have made scientific conquests they alone could have made; they will make still more of which others would be incapable. It would therefore be deplorable if there were no longer Anglo-Saxons.

And then the English and Americans have achieved scientific breakthroughs that only they could have accomplished; they will achieve even more that others would be unable to. It would therefore be unfortunate if there were no more Anglo-Saxons.

But continentals have on their part done things an Englishman could not have done, so that there is no need either for wishing all the world Anglo-Saxon.

But people on the continent have done things that an Englishman could never do, so there’s no need to wish the whole world were Anglo-Saxon.

Each has his characteristic aptitudes, and these aptitudes[Pg 4] should be diverse, else would the scientific concert resemble a quartet where every one wanted to play the violin.

Each person has their unique skills, and these skills[Pg 4] should be varied, or else the scientific collaboration would look like a quartet where everyone wanted to play the violin.

And yet it is not bad for the violin to know what the violon-cello is playing, and vice versa.

And yet it's not bad for the violin to know what the cello is playing, and vice versa.

This it is that the English and Americans are comprehending more and more; and from this point of view the translations undertaken by Dr. Halsted are most opportune and timely.

This is something that the English and Americans are understanding more and more; from this perspective, the translations done by Dr. Halsted are very timely and relevant.

Consider first what concerns the mathematical sciences. It is frequently said the English cultivate them only in view of their applications and even that they despise those who have other aims; that speculations too abstract repel them as savoring of metaphysic.

Consider first what relates to the mathematical sciences. It is often said that the English pursue them only for their practical applications and that they look down on those with different goals; that overly abstract theories turn them off as they seem to lean towards metaphysics.

The English, even in mathematics, are to proceed always from the particular to the general, so that they would never have an idea of entering mathematics, as do many Germans, by the gate of the theory of aggregates. They are always to hold, so to speak, one foot in the world of the senses, and never burn the bridges keeping them in communication with reality. They thus are to be incapable of comprehending or at least of appreciating certain theories more interesting than utilitarian, such as the non-Euclidean geometries. According to that, the first two parts of this book, on number and space, should seem to them void of all substance and would only baffle them.

The English, even in math, always proceed from the specific to the general, so they would never think of entering mathematics, like many Germans do, through the theory of aggregates. They tend to keep, so to speak, one foot in the world of the senses and never completely cut ties with reality. As a result, they struggle to understand or at least appreciate certain theories that are more fascinating than practical, such as non-Euclidean geometries. Because of this, the first two parts of this book, about numbers and space, might seem to them empty and would only confuse them.

But that is not true. And first of all, are they such uncompromising realists as has been said? Are they absolutely refractory, I do not say to metaphysic, but at least to everything metaphysical?

But that’s not true. And first of all, are they really such uncompromising realists as has been claimed? Are they completely resistant, I'm not saying to metaphysics, but at least to anything metaphysical?

Recall the name of Berkeley, born in Ireland doubtless, but immediately adopted by the English, who marked a natural and necessary stage in the development of English philosophy.

Recall the name of Berkeley, born in Ireland for sure, but quickly embraced by the English, who represented a natural and essential phase in the evolution of English philosophy.

Is this not enough to show they are capable of making ascensions otherwise than in a captive balloon?

Isn't this enough to prove they can make ascensions in ways other than just in a hot air balloon?

And to return to America, is not the Monist published at Chicago, that review which even to us seems bold and yet which finds readers?

And to go back to America, isn't the Monist published in Chicago, that magazine which even we think is daring but still has readers?

And in mathematics? Do you think American geometers are concerned only about applications? Far from it. The part of the science they cultivate most devotedly is the theory of[Pg 5] groups of substitutions, and under its most abstract form, the farthest removed from the practical.

And in mathematics? Do you think American geometers care only about applications? Not at all. The area of the field they focus on the most is the theory of[Pg 5] groups of substitutions, especially in its most abstract form, which is the furthest away from practical use.

Moreover, Dr. Halsted gives regularly each year a review of all productions relative to the non-Euclidean geometry, and he has about him a public deeply interested in his work. He has initiated this public into the ideas of Hilbert, and he has even written an elementary treatise on 'Rational Geometry,' based on the principles of the renowned German savant.

Moreover, Dr. Halsted regularly presents a review every year of all works related to non-Euclidean geometry, and he has a public that is deeply interested in his work. He has introduced this audience to the ideas of Hilbert, and he has even written a beginner's guide on 'Rational Geometry,' based on the principles of the famous German scholar.

To introduce this principle into teaching is surely this time to burn all bridges of reliance upon sensory intuition, and this is, I confess, a boldness which seems to me almost rashness.

To bring this principle into education really means cutting off any dependence on sensory intuition, and I must admit, this feels like a bold move that borders on recklessness.

The American public is therefore much better prepared than has been thought for investigating the origin of the notion of space.

The American public is therefore much better prepared than previously thought to explore the origin of the concept of space.

Moreover, to analyze this concept is not to sacrifice reality to I know not what phantom. The geometric language is after all only a language. Space is only a word that we have believed a thing. What is the origin of this word and of other words also? What things do they hide? To ask this is permissible; to forbid it would be, on the contrary, to be a dupe of words; it would be to adore a metaphysical idol, like savage peoples who prostrate themselves before a statue of wood without daring to take a look at what is within.

Moreover, analyzing this concept doesn’t mean sacrificing reality to an unknown illusion. Geometric language is, after all, just a form of communication. Space is merely a term we've come to accept as a thing. What’s the origin of this term and others too? What truths do they conceal? It’s permissible to ask this; forbidding it would be, on the contrary, to fall for mere words; it would be to worship a metaphysical idol, much like primitive people who bow before a wooden statue without daring to examine what lies inside.

In the study of nature, the contrast between the Anglo-Saxon spirit and the Latin spirit is still greater.

In studying nature, the difference between the Anglo-Saxon spirit and the Latin spirit is even more pronounced.

The Latins seek in general to put their thought in mathematical form; the English prefer to express it by a material representation.

The Latins generally try to express their ideas in mathematical form; the English prefer to convey them through a physical representation.

Both doubtless rely only on experience for knowing the world; when they happen to go beyond this, they consider their foreknowledge as only provisional, and they hasten to ask its definitive confirmation from nature herself.

Both certainly rely solely on experience to understand the world; when they occasionally go beyond this, they view their prior knowledge as only temporary, and they quickly seek its ultimate confirmation from nature itself.

But experience is not all, and the savant is not passive; he does not wait for the truth to come and find him, or for a chance meeting to bring him face to face with it. He must go to meet it, and it is for his thinking to reveal to him the way leading thither. For that there is need of an instrument; well, just there begins the difference—the instrument the Latins ordinarily choose is not that preferred by the Anglo-Saxons.[Pg 6]

But experience isn't everything, and the expert isn't passive; he doesn't just wait for the truth to come to him or for a random encounter to bring it to light. He has to go out and seek it, and it's his thinking that will show him the path to get there. For that, he needs a tool; and this is where the difference lies—the tool that Latins typically choose isn't the same as the one preferred by Anglo-Saxons.[Pg 6]

For a Latin, truth can be expressed only by equations; it must obey laws simple, logical, symmetric and fitted to satisfy minds in love with mathematical elegance.

For a Latin speaker, truth can only be conveyed through equations; it has to follow laws that are simple, logical, symmetric, and designed to appeal to minds that appreciate mathematical beauty.

The Anglo-Saxon to depict a phenomenon will first be engrossed in making a model, and he will make it with common materials, such as our crude, unaided senses show us them. He also makes a hypothesis, he assumes implicitly that nature, in her finest elements, is the same as in the complicated aggregates which alone are within the reach of our senses. He concludes from the body to the atom.

The Anglo-Saxon who wants to illustrate a phenomenon will first be focused on creating a model, using everyday materials that our basic, unaided senses present to us. He also develops a hypothesis, assuming that nature, in its simplest forms, is the same as in the complex combinations that we can only perceive with our senses. He draws conclusions from the visible body to the smallest atom.

Both therefore make hypotheses, and this indeed is necessary, since no scientist has ever been able to get on without them. The essential thing is never to make them unconsciously.

Both therefore make hypotheses, and this is necessary, since no scientist has ever been able to do without them. The important thing is to never make them unconsciously.

From this point of view again, it would be well for these two sorts of physicists to know something of each other; in studying the work of minds so unlike their own, they will immediately recognize that in this work there has been an accumulation of hypotheses.

From this perspective, it would be beneficial for these two types of physicists to learn about each other; by examining the work of minds so different from their own, they will quickly realize that there has been a build-up of hypotheses in this work.

Doubtless this will not suffice to make them comprehend that they on their part have made just as many; each sees the mote without seeing the beam; but by their criticisms they will warn their rivals, and it may be supposed these will not fail to render them the same service.

Doubtless this will not be enough to make them understand that they, too, have made just as many mistakes; each sees the tiny flaw without noticing the larger issue; however, through their criticisms, they will alert their competitors, and it's likely that those competitors will reciprocate the favor.

The English procedure often seems to us crude, the analogies they think they discover to us seem at times superficial; they are not sufficiently interlocked, not precise enough; they sometimes permit incoherences, contradictions in terms, which shock a geometric spirit and which the employment of the mathematical method would immediately have put in evidence. But most often it is, on the other hand, very fortunate that they have not perceived these contradictions; else would they have rejected their model and could not have deduced from it the brilliant results they have often made to come out of it.

The English process can often seem rough to us, and the comparisons they think they find can sometimes feel shallow; they aren't interconnected enough and lack precision. At times, they allow for inconsistencies and contradictions that can be jarring to a logical mindset, which a mathematical approach would have highlighted immediately. However, it’s often a good thing that they haven’t noticed these contradictions; otherwise, they would have discarded their model and wouldn’t have been able to draw out the impressive results they frequently achieve.

And then these very contradictions, when they end by perceiving them, have the advantage of showing them the hypothetical character of their conceptions, whereas the mathematical method, by its apparent rigor and inflexible course, often inspires in us a confidence nothing warrants, and prevents our looking about us.[Pg 7]

And then these very contradictions, when they finally recognize them, have the benefit of revealing the tentative nature of their ideas. In contrast, the mathematical method, with its seeming precision and strict approach, often gives us an unwarranted sense of confidence and stops us from examining our surroundings.[Pg 7]

From another point of view, however, the two conceptions are very unlike, and if all must be said, they are very unlike because of a common fault.

From another perspective, though, the two ideas are quite different, and to be honest, they differ significantly due to a shared flaw.

The English wish to make the world out of what we see. I mean what we see with the unaided eye, not the microscope, nor that still more subtile microscope, the human head guided by scientific induction.

The English want to create the world based on what we can see. I mean what we see with our bare eyes, not with a microscope, nor that even more delicate microscope, the human mind guided by scientific reasoning.

The Latin wants to make it out of formulas, but these formulas are still the quintessenced expression of what we see. In a word, both would make the unknown out of the known, and their excuse is that there is no way of doing otherwise.

The Latin wants to create something from formulas, but these formulas are still the essential expression of what we see. In other words, both would turn the known into the unknown, claiming that there's no other way to do it.

And yet is this legitimate, if the unknown be the simple and the known the complex?

And yet, is this valid, if the unknown is simple and the known is complex?

Shall we not get of the simple a false idea, if we think it like the complex, or worse yet if we strive to make it out of elements which are themselves compounds?

Shall we not get a misleading idea of the simple if we consider it like the complex, or even worse, if we try to create it from elements that are themselves compounds?

Is not each great advance accomplished precisely the day some one has discovered under the complex aggregate shown by our senses something far more simple, not even resembling it—as when Newton replaced Kepler's three laws by the single law of gravitation, which was something simpler, equivalent, yet unlike?

Isn't every major breakthrough achieved the day someone discovers something much simpler beneath the complex layers our senses perceive—like when Newton replaced Kepler's three laws with the single law of gravitation, which was simpler, equivalent, yet completely different?

One is justified in asking if we are not on the eve of just such a revolution or one even more important. Matter seems on the point of losing its mass, its solidest attribute, and resolving itself into electrons. Mechanics must then give place to a broader conception which will explain it, but which it will not explain.

One can rightly question whether we are on the brink of such a revolution, or perhaps an even more significant one. It seems that matter is about to lose its mass, its most solid characteristic, and reduce itself to electrons. Mechanics will then need to give way to a broader understanding that will account for it, but which it will not itself explain.

So it was in vain the attempt was made in England to construct the ether by material models, or in France to apply to it the laws of dynamic.

So it was pointless to try to create ether in England using physical models, or in France to apply dynamic laws to it.

The ether it is, the unknown, which explains matter, the known; matter is incapable of explaining the ether.

The ether is the unknown that explains matter, the known; matter cannot explain the ether.

Poincaré.

Poincaré.


INTRODUCTION

BY PROFESSOR JOSIAH ROYCE

Harvard

The treatise of a master needs no commendation through the words of a mere learner. But, since my friend and former fellow student, the translator of this volume, has joined with another of my colleagues, Professor Cattell, in asking me to undertake the task of calling the attention of my fellow students to the importance and to the scope of M. Poincaré's volume, I accept the office, not as one competent to pass judgment upon the book, but simply as a learner, desirous to increase the number of those amongst us who are already interested in the type of researches to which M. Poincaré has so notably contributed.

The work of a master doesn't need praise from a mere student. However, since my friend and former classmate, the translator of this book, has asked me, along with my colleague Professor Cattell, to highlight the importance and scope of M. Poincaré's work, I gladly take on this role, not as someone qualified to critique the book, but simply as a student eager to encourage more of us to engage with the kind of research that M. Poincaré has significantly advanced.

I

The branches of inquiry collectively known as the Philosophy of Science have undergone great changes since the appearance of Herbert Spencer's First Principles, that volume which a large part of the general public in this country used to regard as the representative compend of all modern wisdom relating to the foundations of scientific knowledge. The summary which M. Poincaré gives, at the outset of his own introduction to the present work, where he states the view which the 'superficial observer' takes of scientific truth, suggests, not indeed Spencer's own most characteristic theories, but something of the spirit in which many disciples of Spencer interpreting their master's formulas used to conceive the position which science occupies in dealing with experience. It was well known to them, indeed, that experience is a constant guide, and an inexhaustible source both of novel scientific results and of unsolved problems; but the fundamental Spencerian principles of science, such as 'the persistence of force,' the 'rhythm of motion' and the rest, were treated by Spencer himself as demonstrably objective, although[Pg 10] indeed 'relative' truths, capable of being tested once for all by the 'inconceivability of the opposite,' and certain to hold true for the whole 'knowable' universe. Thus, whether one dwelt upon the results of such a mathematical procedure as that to which M. Poincaré refers in his opening paragraphs, or whether, like Spencer himself, one applied the 'first principles' to regions of less exact science, this confidence that a certain orthodoxy regarding the principles of science was established forever was characteristic of the followers of the movement in question. Experience, lighted up by reason, seemed to them to have predetermined for all future time certain great theoretical results regarding the real constitution of the 'knowable' cosmos. Whoever doubted this doubted 'the verdict of science.'

The branches of inquiry collectively known as the Philosophy of Science have undergone significant changes since Herbert Spencer's First Principles, which many in the general public used to see as the definitive guide to all modern wisdom about the foundations of scientific knowledge. The summary that M. Poincaré provides at the beginning of his introduction to this work reflects not so much Spencer's most characteristic theories but rather the spirit in which many of Spencer's followers interpreted his teachings regarding the role of science in understanding experience. They understood that experience is a constant guide and an endless source of new scientific insights and unresolved issues; however, the fundamental Spencerian principles of science, such as 'the persistence of force,' 'the rhythm of motion,' and others, were treated by Spencer as demonstrably objective, although indeed 'relative' truths that could be validated once and for all by the 'inconceivability of the opposite,' and were believed to hold true for the entire 'knowable' universe. Thus, whether one focused on the outcomes of a mathematical procedure, such as that which M. Poincaré mentions in his opening paragraphs, or whether one, like Spencer himself, applied the 'first principles' to areas of less precise science, there was a strong belief among the followers of this movement that a certain orthodoxy regarding the principles of science was established for good. They believed that experience, illuminated by reason, had determined certain major theoretical outcomes about the true nature of the 'knowable' cosmos for all time. Anyone who questioned this was doubting 'the verdict of science.'

Some of us well remember how, when Stallo's 'Principles and Theories of Modern Physics' first appeared, this sense of scientific orthodoxy was shocked amongst many of our American readers and teachers of science. I myself can recall to mind some highly authoritative reviews of that work in which the author was more or less sharply taken to task for his ignorant presumption in speaking with the freedom that he there used regarding such sacred possessions of humanity as the fundamental concepts of physics. That very book, however, has quite lately been translated into German as a valuable contribution to some of the most recent efforts to reconstitute a modern 'philosophy of nature.' And whatever may be otherwise thought of Stallo's critical methods, or of his results, there can be no doubt that, at the present moment, if his book were to appear for the first time, nobody would attempt to discredit the work merely on account of its disposition to be agnostic regarding the objective reality of the concepts of the kinetic theory of gases, or on account of its call for a logical rearrangement of the fundamental concepts of the theory of energy. We are no longer able so easily to know heretics at first sight.

Some of us remember how, when Stallo's 'Principles and Theories of Modern Physics' was first published, it shocked many American readers and science teachers who held a sense of scientific orthodoxy. I can recall several authoritative reviews of that work where the author was criticized for his ignorant arrogance in discussing such cherished aspects of humanity like the fundamental concepts of physics. However, that very book has recently been translated into German as an important contribution to the latest efforts to create a modern 'philosophy of nature.' Regardless of what might be thought of Stallo's critical methods or his findings, there is no doubt that if his book were to be published today, no one would try to discredit it simply because it is agnostic about the objective reality of the concepts of the kinetic theory of gases or because it calls for a logical reorganization of the fundamental concepts of the theory of energy. We can no longer easily identify heretics at first glance.

For we now appear to stand in this position: The control of natural phenomena, which through the sciences men have attained, grows daily vaster and more detailed, and in its details more assured. Phenomena men know and predict better than ever. But regarding the most general theories, and the[Pg 11] most fundamental, of science, there is no longer any notable scientific orthodoxy. Thus, as knowledge grows firmer and wider, conceptual construction becomes less rigid. The field of the theoretical philosophy of nature—yes, the field of the logic of science—this whole region is to-day an open one. Whoever will work there must indeed accept the verdict of experience regarding what happens in the natural world. So far he is indeed bound. But he may undertake without hindrance from mere tradition the task of trying afresh to reduce what happens to conceptual unity. The circle-squarers and the inventors of devices for perpetual motion are indeed still as unwelcome in scientific company as they were in the days when scientific orthodoxy was more rigidly defined; but that is not because the foundations of geometry are now viewed as completely settled, beyond controversy, nor yet because the 'persistence of force' has been finally so defined as to make the 'opposite inconceivable' and the doctrine of energy beyond the reach of novel formulations. No, the circle-squarers and the inventors of devices for perpetual motion are to-day discredited, not because of any unorthodoxy of their general philosophy of nature, but because their views regarding special facts and processes stand in conflict with certain equally special results of science which themselves admit of very various general theoretical interpretations. Certain properties of the irrational number π are known, in sufficient multitude to justify the mathematician in declining to listen to the arguments of the circle-squarer; but, despite great advances, and despite the assured results of Dedekind, of Cantor, of Weierstrass and of various others, the general theory of the logic of the numbers, rational and irrational, still presents several important features of great obscurity; and the philosophy of the concepts of geometry yet remains, in several very notable respects, unconquered territory, despite the work of Hilbert and of Pieri, and of our author himself. The ordinary inventors of the perpetual motion machines still stand in conflict with accepted generalizations; but nobody knows as yet what the final form of the theory of energy will be, nor can any one say precisely what place the phenomena of the radioactive bodies will occupy in that theory. The alchemists would not[Pg 12] be welcome workers in modern laboratories; yet some sorts of transformation and of evolution of the elements are to-day matters which theory can find it convenient, upon occasion, to treat as more or less exactly definable possibilities; while some newly observed phenomena tend to indicate, not indeed that the ancient hopes of the alchemists were well founded, but that the ultimate constitution of matter is something more fluent, less invariant, than the theoretical orthodoxy of a recent period supposed. Again, regarding the foundations of biology, a theoretical orthodoxy grows less possible, less definable, less conceivable (even as a hope) the more knowledge advances. Once 'mechanism' and 'vitalism' were mutually contradictory theories regarding the ultimate constitution of living bodies. Now they are obviously becoming more and more 'points of view,' diverse but not necessarily conflicting. So far as you find it convenient to limit your study of vital processes to those phenomena which distinguish living matter from all other natural objects, you may assume, in the modern 'pragmatic' sense, the attitude of a 'neo-vitalist.' So far, however, as you are able to lay stress, with good results, upon the many ways in which the life processes can be assimilated to those studied in physics and in chemistry, you work as if you were a partisan of 'mechanics.' In any case, your special science prospers by reason of the empirical discoveries that you make. And your theories, whatever they are, must not run counter to any positive empirical results. But otherwise, scientific orthodoxy no longer predetermines what alone it is respectable for you to think about the nature of living substance.

We now seem to be in this situation: The understanding of natural phenomena, achieved through the sciences, is continually expanding in complexity and detail, and this complexity is becoming more certain. People can now know and predict phenomena better than ever before. However, when it comes to the most fundamental theories in science, there is no longer any significant scientific orthodoxy. As knowledge becomes broader and more solid, conceptual frameworks are becoming less rigid. The area of theoretical philosophy of nature—yes, the area of the logic of science—today remains open. Anyone who engages in this field must accept the experiences that inform us about what happens in the natural world. This requirement is non-negotiable. However, they can freely pursue the task of trying to unify what occurs under a single conceptual framework, without being restricted by tradition. The creators of impossible geometrical constructions and perpetual motion devices are just as unwelcome in scientific circles now as they were when scientific orthodoxy was more strictly defined; yet, this rejection is not because the principles of geometry are considered settled and beyond debate, nor because the concept of energy is so well-defined that any alternative views are inconceivable. Rather, the reason these creators are discredited today is not due to any unorthodox nature philosophy but because their specific claims about particular facts and processes conflict with certain established scientific results, which themselves can be interpreted in various theoretical ways. Some properties of the irrational number π are established enough to justify mathematicians in ignoring the arguments of those trying to square the circle; but despite significant advancements and established results from thinkers like Dedekind, Cantor, Weierstrass, and others, the general theory surrounding the logic of numbers—rational and irrational—still contains several important aspects that are unclear; likewise, the philosophy concerning the concepts of geometry remains, in several significant ways, uncharted territory, despite the contributions of Hilbert, Pieri, and our author. The ordinary inventors of perpetual motion machines still contradict established generalizations; however, no one yet knows what the final form of the theory of energy will be, nor can anyone specify how the phenomena of radioactive materials will fit into that theory. Alchemists would not be welcome in modern labs; yet some types of transformations and evolutions of elements are now treated by theories as more or less definable possibilities, while some newly observed phenomena suggest—not that the ancient dreams of alchemists were valid—but that the fundamental nature of matter is more fluid and less fixed than recent theoretical orthodoxy believed. Similarly, regarding the foundations of biology, forming a theoretical orthodoxy becomes less feasible, definable, or even conceivable (as a hope) as knowledge progresses. Once, 'mechanism' and 'vitalism' were opposing theories about the ultimate nature of living beings. Now, they are increasingly seen as 'points of view' that are varied but not necessarily conflicting. As long as it’s useful for your study of life processes to focus on what distinguishes living matter from other materials, you can adopt a 'neo-vitalist' perspective in a modern, pragmatic sense. Conversely, when you emphasize how life processes can be aligned with those observed in physics and chemistry, you're acting as if you support 'mechanics.' In any case, your specific field thrives due to the empirical discoveries you make, and your theories, whatever they may be, cannot contradict any positive empirical results. However, outside of that, scientific orthodoxy no longer dictates what it is acceptable for you to think about the nature of living matter.

This gain in the freedom of theory, coming, as it does, side by side with a constant increase of a positive knowledge of nature, lends itself to various interpretations, and raises various obvious questions.

This increase in the freedom of theory, happening alongside a continual growth in our understanding of nature, can be interpreted in different ways and brings up several clear questions.

II

One of the most natural of these interpretations, one of the most obvious of these questions, may be readily stated. Is not the lesson of all these recent discussions simply this, that general theories are simply vain, that a philosophy of nature is an idle[Pg 13] dream, and that the results of science are coextensive with the range of actual empirical observation and of successful prediction? If this is indeed the lesson, then the decline of theoretical orthodoxy in science is—like the eclipse of dogma in religion—merely a further lesson in pure positivism, another proof that man does best when he limits himself to thinking about what can be found in human experience, and in trying to plan what can be done to make human life more controllable and more reasonable. What we are free to do as we please—is it any longer a serious business? What we are free to think as we please—is it of any further interest to one who is in search of truth? If certain general theories are mere conceptual constructions, which to-day are, and to-morrow are cast into the oven, why dignify them by the name of philosophy? Has science any place for such theories? Why be a 'neo-vitalist,' or an 'evolutionist,' or an 'atomist,' or an 'Energetiker'? Why not say, plainly: "Such and such phenomena, thus and thus described, have been observed; such and such experiences are to be expected, since the hypotheses by the terms of which we are required to expect them have been verified too often to let us regard the agreement with experience as due merely to chance; so much then with reasonable assurance we know; all else is silence—or else is some matter to be tested by another experiment?" Why not limit our philosophy of science strictly to such a counsel of resignation? Why not substitute, for the old scientific orthodoxy, simply a confession of ignorance, and a resolution to devote ourselves to the business of enlarging the bounds of actual empirical knowledge?

One of the most straightforward interpretations of these discussions, one of the most obvious questions, can be easily stated. Isn't the lesson from all these recent debates simply this: that general theories are ultimately futile, that a philosophy of nature is a pointless dream, and that the results of science are limited to actual empirical observation and successful predictions? If that’s really the case, then the decline of theoretical orthodoxy in science is—like the decline of dogma in religion—just another example of pure positivism, further proof that humanity does best when it focuses on what can be found in human experience and works on making life more manageable and reasonable. Is what we can do as we wish still a serious matter? Is what we can think as we wish still relevant to those seeking the truth? If certain general theories are just concepts that today exist and tomorrow get discarded, why call them philosophy? Does science even have room for such theories? Why be a 'neo-vitalist,' or an 'evolutionist,' or an 'atomist,' or an 'Energetiker'? Why not just say: "These phenomena have been observed and described this way; these experiences can be expected because the hypotheses supporting them have been verified too often for us to consider the correlation with experience as mere coincidence; this much we can know with reasonable certainty; everything else is silence—or something to be tested in another experiment?" Why not restrict our philosophy of science to such a viewpoint? Why not replace the old scientific orthodoxy with a simple acknowledgment of ignorance and a commitment to expanding the limits of actual empirical knowledge?

Such comments upon the situation just characterized are frequently made. Unfortunately, they seem not to content the very age whose revolt from the orthodoxy of traditional theory, whose uncertainty about all theoretical formulations, and whose vast wealth of empirical discoveries and of rapidly advancing special researches, would seem most to justify these very comments. Never has there been better reason than there is to-day to be content, if rational man could be content, with a pure positivism. The splendid triumphs of special research in the most various fields, the constant increase in our practical control over[Pg 14] nature—these, our positive and growing possessions, stand in glaring contrast to the failure of the scientific orthodoxy of a former period to fix the outlines of an ultimate creed about the nature of the knowable universe. Why not 'take the cash and let the credit go'? Why pursue the elusive theoretical 'unification' any further, when what we daily get from our sciences is an increasing wealth of detailed information and of practical guidance?

Such comments about the situation described are often made. Unfortunately, they don’t seem to satisfy the very era that has turned away from the traditional theories, is unsure about all theoretical ideas, and possesses a wealth of empirical discoveries and rapidly advancing specialized research, which would seem to validate these comments. There has never been a better reason than today to embrace pure positivism, if rational people could actually be satisfied with it. The remarkable successes of specialized research across various fields and our constantly increasing control over[Pg 14]nature—these positive and growing achievements starkly contrast with the previous scientific orthodoxy's failure to establish a definitive belief about the nature of the knowable universe. Why not 'take the cash and let the credit go'? Why continue chasing the elusive theoretical 'unification' when what we gain daily from our sciences is a growing abundance of detailed information and practical guidance?

As a fact, however, the known answer of our own age to these very obvious comments is a constant multiplication of new efforts towards large and unifying theories. If theoretical orthodoxy is no longer clearly definable, theoretical construction was never more rife. The history of the doctrine of evolution, even in its most recent phases, when the theoretical uncertainties regarding the 'factors of evolution' are most insisted upon, is full of illustrations of this remarkable union of scepticism in critical work with courage regarding the use of the scientific imagination. The history of those controversies regarding theoretical physics, some of whose principal phases M. Poincaré, in his book, sketches with the hand of the master, is another illustration of the consciousness of the time. Men have their freedom of thought in these regions; and they feel the need of making constant and constructive use of this freedom. And the men who most feel this need are by no means in the majority of cases professional metaphysicians—or students who, like myself, have to view all these controversies amongst the scientific theoreticians from without as learners. These large theoretical constructions are due, on the contrary, in a great many cases to special workers, who have been driven to the freedom of philosophy by the oppression of experience, and who have learned in the conflict with special problems the lesson that they now teach in the form of general ideas regarding the philosophical aspects of science.

As a fact, though, the response from our own time to these very obvious observations is a continuous increase in new efforts toward comprehensive and unifying theories. If theoretical orthodoxy can no longer be clearly defined, theoretical construction has never been more prevalent. The history of the theory of evolution, even in its most recent stages—when the uncertainties about the 'factors of evolution' are most emphasized—is filled with examples of this remarkable blend of skepticism in critical work and boldness in applying scientific imagination. The history of the debates in theoretical physics, some of which M. Poincaré outlines expertly in his book, is another example of the awareness of the era. People have their freedom of thought in these areas, and they feel the need to continually and constructively use this freedom. Interestingly, those who feel this need are often not professional metaphysicians—or students like myself who have to observe these debates among scientific theorists from the sidelines as learners. These expansive theoretical constructions often come from specialized workers who have turned to philosophy due to the constraints of experience and who have learned from grappling with specific problems the lessons they now express as general ideas about the philosophical aspects of science.

Why, then, does science actually need general theories, despite the fact that these theories inevitably alter and pass away? What is the service of a philosophy of science, when it is certain that the philosophy of science which is best suited to the needs of one generation must be superseded by the advancing insight of the next generation? Why must that which endlessly grows,[Pg 15] namely, man's knowledge of the phenomenal order of nature, be constantly united in men's minds with that which is certain to decay, namely, the theoretical formulation of special knowledge in more or less completely unified systems of doctrine?

Why does science really need general theories, even though these theories inevitably change and fade away? What’s the purpose of a philosophy of science, knowing that the philosophy that fits one generation's needs will surely be replaced by the new understandings of the next? Why does humanity's ever-expanding knowledge of the natural world have to be constantly linked in our minds with what is bound to fall apart, specifically, the theoretical frameworks of specialized knowledge within largely unified systems of thought?

I understand our author's volume to be in the main an answer to this question. To be sure, the compact and manifold teachings which this text contains relate to a great many different special issues. A student interested in the problems of the philosophy of mathematics, or in the theory of probabilities, or in the nature and office of mathematical physics, or in still other problems belonging to the wide field here discussed, may find what he wants here and there in the text, even in case the general issues which give the volume its unity mean little to him, or even if he differs from the author's views regarding the principal issues of the book. But in the main, this volume must be regarded as what its title indicates—a critique of the nature and place of hypothesis in the work of science and a study of the logical relations of theory and fact. The result of the book is a substantial justification of the scientific utility of theoretical construction—an abandonment of dogma, but a vindication of the rights of the constructive reason.

I see the author's work primarily as a response to this question. Certainly, the compact and diverse teachings in this text address a wide range of specific issues. A student interested in the philosophy of mathematics, probability theory, the nature and role of mathematical physics, or other problems within this broad field may find relevant insights scattered throughout the text, even if the overarching themes that unify the volume don’t resonate with him or if he disagrees with the author’s views on the main issues of the book. However, overall, this volume should be viewed as its title suggests—a critique of the nature and role of hypothesis in scientific work and an exploration of the logical connections between theory and fact. The book ultimately offers a strong justification for the scientific value of theoretical construction—moving away from dogma while affirming the importance of constructive reasoning.

III

The most notable of the results of our author's investigation of the logic of scientific theories relates, as I understand his work, to a topic which the present state of logical investigation, just summarized, makes especially important, but which has thus far been very inadequately treated in the text-books of inductive logic. The useful hypotheses of science are of two kinds:

The most significant result of our author's research into the logic of scientific theories, as I grasp his work, pertains to a subject that the current state of logical investigation, just summarized, renders particularly crucial, yet has so far been poorly addressed in the textbooks of inductive logic. The useful hypotheses in science fall into two categories:

1. The hypotheses which are valuable precisely because they are either verifiable or else refutable through a definite appeal to the tests furnished by experience; and

1. The hypotheses that are valuable precisely because they can either be verified or disproven through a clear reference to the tests provided by experience; and

2. The hypotheses which, despite the fact that experience suggests them, are valuable despite, or even because, of the fact that experience can neither confirm nor refute them. The contrast between these two kinds of hypotheses is a prominent topic of our author's discussion.

2. The hypotheses that, even though experience supports them, are valuable despite, or even because, the fact that experience can neither confirm nor refute them. The difference between these two types of hypotheses is a key topic in our author's discussion.

Hypotheses of the general type which I have here placed first[Pg 16] in order are the ones which the text-books of inductive logic and those summaries of scientific method which are customary in the course of the elementary treatises upon physical science are already accustomed to recognize and to characterize. The value of such hypotheses is indeed undoubted. But hypotheses of the type which I have here named in the second place are far less frequently recognized in a perfectly explicit way as useful aids in the work of special science. One usually either fails to admit their presence in scientific work, or else remains silent as to the reasons of their usefulness. Our author's treatment of the work of science is therefore especially marked by the fact that he explicitly makes prominent both the existence and the scientific importance of hypotheses of this second type. They occupy in his discussion a place somewhat analogous to each of the two distinct positions occupied by the 'categories' and the 'forms of sensibility,' on the one hand, and by the 'regulative principles of the reason,' on the other hand, in the Kantian theory of our knowledge of nature. That is, these hypotheses which can neither be confirmed nor refuted by experience appear, in M. Poincaré's account, partly (like the conception of 'continuous quantity') as devices of the understanding whereby we give conceptual unity and an invisible connectedness to certain types of phenomenal facts which come to us in a discrete form and in a confused variety; and partly (like the larger organizing concepts of science) as principles regarding the structure of the world in its wholeness; i. e., as principles in the light of which we try to interpret our experience, so as to give to it a totality and an inclusive unity such as Euclidean space, or such as the world of the theory of energy is conceived to possess. Thus viewed, M. Poincaré's logical theory of this second class of hypotheses undertakes to accomplish, with modern means and in the light of to-day's issues, a part of what Kant endeavored to accomplish in his theory of scientific knowledge with the limited means which were at his disposal. Those aspects of science which are determined by the use of the hypotheses of this second kind appear in our author's account as constituting an essential human way of viewing nature, an interpretation rather than a portrayal or a prediction of the objective facts of nature, an[Pg 17] adjustment of our conceptions of things to the internal needs of our intelligence, rather than a grasping of things as they are in themselves.

Hypotheses of the general kind that I’ve placed first[Pg 16] are the ones that textbooks on inductive logic and summaries of scientific methods usually acknowledge and describe in basic courses on physical science. Their value is definitely unquestionable. However, the hypotheses that I’ve named second are much less frequently recognized as useful tools in specialized scientific work. Usually, people either overlook their presence in scientific work or stay silent about why they’re valuable. Our author’s approach to scientific work stands out because he clearly highlights both the existence and scientific significance of these second-type hypotheses. In his discussion, they take on a role somewhat similar to the two distinct positions held by 'categories' and 'forms of sensibility,' on one side, and 'regulative principles of reason,' on the other hand, in Kant’s theory of our understanding of nature. These hypotheses, which can’t be confirmed or disproven by experience, are presented in M. Poincaré’s account partly (like the idea of 'continuous quantity') as tools of understanding that help us bring together and connect certain types of phenomena that come to us in a fragmented and confusing manner; and partly (like the broader organizing concepts in science) as principles regarding the overall structure of the world; i.e., as principles through which we attempt to interpret our experiences, aiming to give them a totality and comprehensive unity similar to how we conceive Euclidean space or the world described by energy theory. Viewed this way, M. Poincaré's logical theory of this second category of hypotheses aims to achieve, with modern tools and today’s challenges, some of what Kant sought to do in his scientific knowledge theory with the limited resources available to him. The aspects of science shaped by these second-type hypotheses appear in our author’s explanation as a fundamental human perspective on nature, serving as an interpretation rather than a mere portrayal or prediction of objective facts, and an adaptation of our understanding of things according to our intellectual needs, rather than a direct acknowledgment of things as they exist independently.

To be sure, M. Poincaré's view, in this portion of his work, obviously differs, meanwhile, from that of Kant, as well as this agrees, in a measure, with the spirit of the Kantian epistemology. I do not mean therefore to class our author as a Kantian. For Kant, the interpretations imposed by the 'forms of sensibility,' and by the 'categories of the understanding,' upon our doctrine of nature are rigidly predetermined by the unalterable 'form' of our intellectual powers. We 'must' thus view facts, whatever the data of sense must be. This, of course, is not M. Poincaré's view. A similarly rigid predetermination also limits the Kantian 'ideas of the reason' to a certain set of principles whose guidance of the course of our theoretical investigations is indeed only 'regulative,' but is 'a priori,' and so unchangeable. For M. Poincaré, on the contrary, all this adjustment of our interpretations of experience to the needs of our intellect is something far less rigid and unalterable, and is constantly subject to the suggestions of experience. We must indeed interpret in our own way; but our way is itself only relatively determinate; it is essentially more or less plastic; other interpretations of experience are conceivable. Those that we use are merely the ones found to be most convenient. But this convenience is not absolute necessity. Unverifiable and irrefutable hypotheses in science are indeed, in general, indispensable aids to the organization and to the guidance of our interpretation of experience. But it is experience itself which points out to us what lines of interpretation will prove most convenient. Instead of Kant's rigid list of a priori 'forms,' we consequently have in M. Poincaré's account a set of conventions, neither wholly subjective and arbitrary, nor yet imposed upon us unambiguously by the external compulsion of experience. The organization of science, so far as this organization is due to hypotheses of the kind here in question, thus resembles that of a constitutional government—neither absolutely necessary, nor yet determined apart from the will of the subjects, nor yet accidental—a free, yet not a capricious establishment of good order, in conformity with empirical needs.[Pg 18]

To be sure, M. Poincaré's perspective in this part of his work clearly differs from Kant's, even though it somewhat aligns with the spirit of Kantian epistemology. I don't intend to label our author as a Kantian. For Kant, the interpretations imposed by the 'forms of sensibility' and the 'categories of understanding' on our understanding of nature are strictly predetermined by the unchangeable 'form' of our intellectual powers. We 'must' see facts in a certain way, regardless of the sensory data. This is certainly not M. Poincaré's perspective. A similar strict predetermination also confines the Kantian 'ideas of reason' to a limited set of principles that guide our theoretical investigations in a way that is only 'regulative,' but 'a priori,' and therefore unchangeable. In contrast, for M. Poincaré, the adjustment of our interpretations of experience to fit the needs of our intellect is much more flexible and constantly influenced by experience. We do need to interpret in our own way; however, our approach is only relatively specific; it is essentially more or less adaptable; other interpretations of experience are possible. The ones we use are simply the most convenient. But this convenience is not an absolute necessity. In science, unverifiable and irrefutable hypotheses are generally essential tools for organizing and guiding our interpretation of experience. However, it is experience itself that shows us which lines of interpretation will be most beneficial. Instead of Kant's rigid list of a priori 'forms,' M. Poincaré provides a set of conventions that are neither completely subjective and arbitrary nor clearly dictated by the external pressure of experience. The organization of science, insofar as it relies on such hypotheses, resembles that of a constitutional government—neither absolutely necessary, nor completely determined independent of the will of the subjects, nor purely coincidental—a free, yet not capricious establishment of good order, in line with empirical needs.[Pg 18]

Characteristic remains, however, for our author, as, in his decidedly contrasting way, for Kant, the thought that without principles which at every stage transcend precise confirmation through such experience as is then accessible the organization of experience is impossible. Whether one views these principles as conventions or as a priori 'forms,' they may therefore be described as hypotheses, but as hypotheses that, while lying at the basis of our actual physical sciences, at once refer to experience and help us in dealing with experience, and are yet neither confirmed nor refuted by the experiences which we possess or which we can hope to attain.

Characteristic remains, however, for our author, just as it does in a distinctly different way for Kant, the idea that without principles that at every stage go beyond exact confirmation through the experiences currently available, organizing experience is impossible. Whether you see these principles as conventions or as a priori 'forms,' they can be described as hypotheses, but these are hypotheses that, while underpinning our actual physical sciences, both refer to experience and assist us in navigating experience, and yet are neither confirmed nor disproved by the experiences we have or those we might achieve.

Three special instances or classes of instances, according to our author's account, may be used as illustrations of this general type of hypotheses. They are: (1) The hypothesis of the existence of continuous extensive quanta in nature; (2) The principles of geometry; (3) The principles of mechanics and of the general theory of energy. In case of each of these special types of hypotheses we are at first disposed, apart from reflection, to say that we find the world to be thus or thus, so that, for instance, we can confirm the thesis according to which nature contains continuous magnitudes; or can prove or disprove the physical truth of the postulates of Euclidean geometry; or can confirm by definite experience the objective validity of the principles of mechanics. A closer examination reveals, according to our author, the incorrectness of all such opinions. Hypotheses of these various special types are needed; and their usefulness can be empirically shown. They are in touch with experience; and that they are not merely arbitrary conventions is also verifiable. They are not a priori necessities; and we can easily conceive intelligent beings whose experience could be best interpreted without using these hypotheses. Yet these hypotheses are not subject to direct confirmation or refutation by experience. They stand then in sharp contrast to the scientific hypotheses of the other, and more frequently recognized, type, i. e., to the hypotheses which can be tested by a definite appeal to experience. To these other hypotheses our author attaches, of course, great importance. His treatment of them is full of a living appreciation of the significance of empirical investigation. But the central[Pg 19] problem of the logic of science thus becomes the problem of the relation between the two fundamentally distinct types of hypotheses, i. e., between those which can not be verified or refuted through experience, and those which can be empirically tested.

Three specific examples or categories of examples, according to our author's explanation, can be used to illustrate this general type of hypotheses. They are: (1) The hypothesis of the existence of continuous extensive quanta in nature; (2) The principles of geometry; (3) The principles of mechanics and the general theory of energy. Initially, we might feel inclined, without much reflection, to say that we find the world to be in a certain way, so that, for example, we can support the idea that nature has continuous magnitudes; or we can prove or disprove the physical validity of the postulates of Euclidean geometry; or we can validate through specific experiences the objective authority of the principles of mechanics. However, a deeper investigation indicates, according to our author, that such beliefs are mistaken. These different types of hypotheses are necessary, and their practical value can be demonstrated empirically. They relate to experience, and it can also be shown that they are not merely arbitrary conventions. They are not a priori necessities; we can easily imagine intelligent beings whose experiences could be best understood without using these hypotheses. Yet these hypotheses are not subject to direct confirmation or disproof through experience. They sharply contrast with scientific hypotheses of the other, more commonly recognized type, i.e., those hypotheses that can be tested through a definite appeal to experience. Our author places great importance on these other hypotheses, and his discussion of them shows a deep appreciation for the significance of empirical investigation. Thus, the central[Pg 19] issue in the logic of science becomes the problem of the relationship between the two fundamentally different types of hypotheses, i.e., those that cannot be verified or disproven through experience, and those that can be empirically tested.

IV

The detailed treatment which M. Poincaré gives to the problem thus defined must be learned from his text. It is no part of my purpose to expound, to defend or to traverse any of his special conclusions regarding this matter. Yet I can not avoid observing that, while M. Poincaré strictly confines his illustrations and his expressions of opinion to those regions of science wherein, as special investigator, he is himself most at home, the issues which he thus raises regarding the logic of science are of even more critical importance and of more impressive interest when one applies M. Poincaré's methods to the study of the concepts and presuppositions of the organic and of the historical and social sciences, than when one confines one's attention, as our author here does, to the physical sciences. It belongs to the province of an introduction like the present to point out, however briefly and inadequately, that the significance of our author's ideas extends far beyond the scope to which he chooses to confine their discussion.

The in-depth analysis that M. Poincaré provides on the problem as defined needs to be taken from his text. I don’t intend to explain, defend, or critique any of his specific conclusions on this matter. However, I can’t help but notice that while M. Poincaré limits his examples and opinions to the areas of science where he is most knowledgeable, the issues he raises about the logic of science become even more crucial and fascinating when you apply his methods to the concepts and assumptions of the organic, historical, and social sciences, rather than focusing solely, as he does, on the physical sciences. It's the role of an introduction like this one to briefly and modestly highlight that the importance of the author’s ideas goes well beyond the areas he chooses to discuss.

The historical sciences, and in fact all those sciences such as geology, and such as the evolutionary sciences in general, undertake theoretical constructions which relate to past time. Hypotheses relating to the more or less remote past stand, however, in a position which is very interesting from the point of view of the logic of science. Directly speaking, no such hypothesis is capable of confirmation or of refutation, because we can not return into the past to verify by our own experience what then happened. Yet indirectly, such hypotheses may lead to predictions of coming experience. These latter will be subject to control. Thus, Schliemann's confidence that the legend of Troy had a definite historical foundation led to predictions regarding what certain excavations would reveal. In a sense somewhat different from that which filled Schliemann's enthusiastic mind, these predictions proved verifiable. The result has been a considerable[Pg 20] change in the attitude of historians toward the legend of Troy. Geological investigation leads to predictions regarding the order of the strata or the course of mineral veins in a district, regarding the fossils which may be discovered in given formations, and so on. These hypotheses are subject to the control of experience. The various theories of evolutionary doctrine include many hypotheses capable of confirmation and of refutation by empirical tests. Yet, despite all such empirical control, it still remains true that whenever a science is mainly concerned with the remote past, whether this science be archeology, or geology, or anthropology, or Old Testament history, the principal theoretical constructions always include features which no appeal to present or to accessible future experience can ever definitely test. Hence the suspicion with which students of experimental science often regard the theoretical constructions of their confrères of the sciences that deal with the past. The origin of the races of men, of man himself, of life, of species, of the planet; the hypotheses of anthropologists, of archeologists, of students of 'higher criticism'—all these are matters which the men of the laboratory often regard with a general incredulity as belonging not at all to the domain of true science. Yet no one can doubt the importance and the inevitableness of endeavoring to apply scientific method to these regions also. Science needs theories regarding the past history of the world. And no one who looks closer into the methods of these sciences of past time can doubt that verifiable and unverifiable hypotheses are in all these regions inevitably interwoven; so that, while experience is always the guide, the attitude of the investigator towards experience is determined by interests which have to be partially due to what I should call that 'internal meaning,' that human interest in rational theoretical construction which inspires the scientific inquiry; and the theoretical constructions which prevail in such sciences are neither unbiased reports of the actual constitution of an external reality, nor yet arbitrary constructions of fancy. These constructions in fact resemble in a measure those which M. Poincaré in this book has analyzed in the case of geometry. They are constructions molded, but not predetermined in their details, by experience. We report facts; we let the facts speak; but we, as[Pg 21] we investigate, in the popular phrase, 'talk back' to the facts. We interpret as well as report. Man is not merely made for science, but science is made for man. It expresses his deepest intellectual needs, as well as his careful observations. It is an effort to bring internal meanings into harmony with external verifications. It attempts therefore to control, as well as to submit, to conceive with rational unity, as well as to accept data. Its arts are those directed towards self-possession as well as towards an imitation of the outer reality which we find. It seeks therefore a disciplined freedom of thought. The discipline is as essential as the freedom; but the latter has also its place. The theories of science are human, as well as objective, internally rational, as well as (when that is possible) subject to external tests.

The historical sciences, and really all sciences like geology and evolutionary sciences, build theoretical models about the past. However, hypotheses about the more distant past have a particularly intriguing position in terms of scientific logic. Directly, no hypothesis can be confirmed or refuted because we can't go back in time to verify through our own experience what happened. Yet indirectly, these hypotheses can lead to predictions about future experiences, which can be tested. For example, Schliemann's belief that the legend of Troy was based on historical fact led to predictions about what certain digs would uncover. In a different way from how Schliemann envisioned it, these predictions turned out to be verifiable. This has significantly changed historians' attitudes toward the legend of Troy. Geological studies lead to predictions about the sequence of rock layers, the location of mineral veins, and the types of fossils found in specific formations, all of which can be tested. The various theories of evolutionary science include many hypotheses that can be confirmed or refuted through empirical testing. However, even with all this empirical control, it remains true that when a science primarily focuses on the distant past—whether it's archaeology, geology, anthropology, or Old Testament history—the main theoretical models often contain elements that can't truly be tested against present or easily observable future experiences. This leads to skepticism among experimental scientists regarding the theoretical frameworks used by their peers who study the past. The origins of human races, humanity itself, life, species, and the planet; the hypotheses put forward by anthropologists, archaeologists, and ‘higher critics’ are often met with general disbelief from lab scientists who feel these topics don't belong to genuine science. Nevertheless, nobody can deny the importance and necessity of applying scientific methods to these areas as well. Science requires theories about the past history of the world. Anyone who examines these methods closely can see that verifiable and unverifiable hypotheses are inevitably intertwined in these subjects. Thus, while experience always serves as a guide, the investigator's attitude toward experience is influenced by interests connected to what I would call that 'internal meaning'—the human drive for rational theoretical frameworks that fuels scientific inquiry. The theoretical models that prevail in these fields are not simple, unbiased reflections of actual external reality nor are they purely fanciful constructions. They somewhat resemble the models M. Poincaré analyzed regarding geometry. They are shaped by experience but not predetermined in detail. We report facts; we let the facts communicate; yet we, as [Pg 21] we investigate, engage with the facts. We interpret as well as report. Humans are not just made for science; science is made for humans. It reflects our deepest intellectual needs as well as our careful observations. It strives to align internal meanings with external validations. Therefore, it seeks to control but also to submit, to conceive with rational unity while accepting data. Its techniques focus on self-awareness as well as mimicking the outer reality we encounter. Thus, it seeks a disciplined freedom of thought. Both discipline and freedom are essential; each has its own important role. Scientific theories are both human and objective, internally logical and, when possible, subject to external validation.

In a field very different from that of the historical sciences, namely, in a science of observation and of experiment, which is at the same time an organic science, I have been led in the course of some study of the history of certain researches to notice the existence of a theoretical conception which has proved extremely fruitful in guiding research, but which apparently resembles in a measure the type of hypotheses of which M. Poincaré speaks when he characterizes the principles of mechanics and of the theory of energy. I venture to call attention here to this conception, which seems to me to illustrate M. Poincaré's view of the functions of hypothesis in scientific work.

In a field that's quite different from the historical sciences—specifically, in the science of observation and experimentation, which is also an organic science—I’ve come across a theoretical idea during my study of the history of certain research that has proven to be incredibly helpful in guiding studies. This idea seems to somewhat resemble the type of hypotheses that M. Poincaré references when he describes the principles of mechanics and energy theory. I want to highlight this concept, which I believe illustrates M. Poincaré's perspective on the role of hypotheses in scientific work.

The modern science of pathology is usually regarded as dating from the earlier researches of Virchow, whose 'Cellular Pathology' was the outcome of a very careful and elaborate induction. Virchow, himself, felt a strong aversion to mere speculation. He endeavored to keep close to observation, and to relieve medical science from the control of fantastic theories, such as those of the Naturphilosophen had been. Yet Virchow's researches were, as early as 1847, or still earlier, already under the guidance of a theoretical presupposition which he himself states as follows: "We have learned to recognize," he says, "that diseases are not autonomous organisms, that they are no entities that have entered into the body, that they are no parasites which take root in the body, but that they merely show us the course of[Pg 22] the vital processes under altered conditions" ('dasz sie nur Ablauf der Lebenserscheinungen unter veränderten Bedingungen darstellen').

The modern science of pathology is usually seen as starting with the earlier research of Virchow, whose 'Cellular Pathology' resulted from very careful and thorough observations. Virchow himself had a strong dislike for mere speculation. He aimed to stay grounded in observation and to free medical science from the influence of fanciful theories, like those proposed by the Naturphilosophen. However, as early as 1847, or even earlier, Virchow's research was already guided by a theoretical assumption, which he expressed as follows: "We have learned to recognize," he states, "that diseases are not independent organisms, that they are not entities that enter the body, that they are not parasites that establish themselves within the body, but that they merely show us the course of [Pg 22] the vital processes under altered conditions" ('dasz sie nur Ablauf der Lebenserscheinungen unter veränderten Bedingungen darstellen').

The enormous importance of this theoretical presupposition for all the early successes of modern pathological investigation is generally recognized by the experts. I do not doubt this opinion. It appears to be a commonplace of the history of this science. But in Virchow's later years this very presupposition seemed to some of his contemporaries to be called in question by the successes of recent bacteriology. The question arose whether the theoretical foundations of Virchow's pathology had not been set aside. And in fact the theory of the parasitical origin of a vast number of diseased conditions has indeed come upon an empirical basis to be generally recognized. Yet to the end of his own career Virchow stoutly maintained that in all its essential significance his own fundamental principle remained quite untouched by the newer discoveries. And, as a fact, this view could indeed be maintained. For if diseases proved to be the consequences of the presence of parasites, the diseases themselves, so far as they belonged to the diseased organism, were still not the parasites, but were, as before, the reaction of the organism to the veränderte Bedingungen which the presence of the parasites entailed. So Virchow could well insist. And if the famous principle in question is only stated with sufficient generality, it amounts simply to saying that if a disease involves a change in an organism, and if this change is subject to law at all, then the nature of the organism and the reaction of the organism to whatever it is which causes the disease must be understood in case the disease is to be understood.

The huge importance of this theoretical assumption for all the early achievements of modern pathological research is generally acknowledged by experts. I have no doubt about this view. It seems to be a standard part of the history of this science. However, in the later years of Virchow's life, some of his contemporaries questioned whether this very assumption was being challenged by the successes of recent bacteriology. The issue arose as to whether the theoretical foundations of Virchow's pathology had been dismissed. In fact, the theory that many diseases have a parasitic origin has indeed gained empirical support that is widely accepted. Yet, throughout his career, Virchow firmly argued that his fundamental principle remained unaffected by these newer discoveries. This perspective can certainly be defended. For while diseases may indeed be caused by the presence of parasites, the diseases themselves, as far as they relate to the affected organism, are still not the parasites but rather the organism's response to the altered conditions that the parasites create. Thus, Virchow could confidently assert this. If the well-known principle in question is stated with enough generality, it essentially says that if a disease leads to a change in an organism, and if that change follows any laws, then understanding the nature of the organism and its reaction to whatever causes the disease is essential for comprehending the disease itself.

For this very reason, however, Virchow's theoretical principle in its most general form could be neither confirmed nor refuted by experience. It would remain empirically irrefutable, so far as I can see, even if we should learn that the devil was the true cause of all diseases. For the devil himself would then simply predetermine the veränderte Bedingungen to which the diseased organism would be reacting. Let bullets or bacteria, poisons or compressed air, or the devil be the Bedingungen to which a diseased organism reacts, the postulate that Virchow[Pg 23] states in the passage just quoted will remain irrefutable, if only this postulate be interpreted to meet the case. For the principle in question merely says that whatever entity it may be, bullet, or poison, or devil, that affects the organism, the disease is not that entity, but is the resulting alteration in the process of the organism.

For this reason, however, Virchow's theoretical principle in its most general form could neither be confirmed nor disproven by experience. It would remain empirically unchallengeable, as far as I can see, even if we found out that the devil was the true cause of all diseases. The devil himself would then simply set the veränderte Bedingungen that the diseased organism would be responding to. Whether it's bullets, bacteria, poisons, compressed air, or the devil acting as the Bedingungen that a diseased organism reacts to, Virchow's statement in the quoted passage will remain unrefutable, as long as we interpret this statement appropriately. The principle in question merely states that no matter what entity it is—be it a bullet, poison, or the devil—that affects the organism, the disease is not that entity but the subsequent change in the organism's processes.

I insist, then, that this principle of Virchow's is no trial supposition, no scientific hypothesis in the narrower sense—capable of being submitted to precise empirical tests. It is, on the contrary, a very precious leading idea, a theoretical interpretation of phenomena, in the light of which observations are to be made—'a regulative principle' of research. It is equivalent to a resolution to search for those detailed connections which link the processes of disease to the normal process of the organism. Such a search undertakes to find the true unity, whatever that may prove to be, wherein the pathological and the normal processes are linked. Now without some such leading idea, the cellular pathology itself could never have been reached; because the empirical facts in question would never have been observed. Hence this principle of Virchow's was indispensable to the growth of his science. Yet it was not a verifiable and not a refutable hypothesis. One value of unverifiable and irrefutable hypotheses of this type lies, then, in the sort of empirical inquiries which they initiate, inspire, organize and guide. In these inquiries hypotheses in the narrower sense, that is, trial propositions which are to be submitted to definite empirical control, are indeed everywhere present. And the use of the other sort of principles lies wholly in their application to experience. Yet without what I have just proposed to call the 'leading ideas' of a science, that is, its principles of an unverifiable and irrefutable character, suggested, but not to be finally tested, by experience, the hypotheses in the narrower sense would lack that guidance which, as M. Poincaré has shown, the larger ideas of science give to empirical investigation.

I assert that Virchow's principle is not just a trial assumption or a scientific hypothesis in the strict sense that can be subjected to exact empirical tests. Rather, it is a valuable leading idea, a theoretical way to interpret phenomena, which guides how observations should be made—'a regulative principle' of research. It represents a commitment to searching for the specific connections that link disease processes to the normal functioning of the organism. This search aims to uncover the true unity, whatever that may be, that connects pathological and normal processes. Without such a leading idea, cellular pathology could not have been developed because the necessary empirical facts would not have been observed. Thus, this principle was essential for the advancement of his field. However, it was neither a verifiable nor a refutable hypothesis. The significance of unverifiable and irrefutable hypotheses like this one lies in the empirical inquiries they prompt, inspire, organize, and direct. In these inquiries, hypotheses in the narrower sense—trial propositions meant to undergo specific empirical testing—are always present. The utility of the other type of principles is entirely in how they are applied to experience. Yet, without what I would call the 'leading ideas' of a science—its principles that are unverifiable and irrefutable, suggested but not meant to be definitively tested by experience—hypotheses in the narrower sense would lack the direction that, as M. Poincaré has demonstrated, larger scientific ideas provide to empirical investigation.

V

I have dwelt, no doubt, at too great length upon one aspect only of our author's varied and well-balanced discussion of the[Pg 24] problems and concepts of scientific theory. Of the hypotheses in the narrower sense and of the value of direct empirical control, he has also spoken with the authority and the originality which belong to his position. And in dealing with the foundations of mathematics he has raised one or two questions of great philosophical import into which I have no time, even if I had the right, to enter here. In particular, in speaking of the essence of mathematical reasoning, and of the difficult problem of what makes possible novel results in the field of pure mathematics, M. Poincaré defends a thesis regarding the office of 'demonstration by recurrence'—a thesis which is indeed disputable, which has been disputed and which I myself should be disposed, so far as I at present understand the matter, to modify in some respects, even in accepting the spirit of our author's assertion. Yet there can be no doubt of the importance of this thesis, and of the fact that it defines a characteristic that is indeed fundamental in a wide range of mathematical research. The philosophical problems that lie at the basis of recurrent proofs and processes are, as I have elsewhere argued, of the most fundamental importance.

I have undoubtedly spent too much time focusing on just one aspect of our author's diverse and balanced discussion of the[Pg 24] issues and ideas in scientific theory. He has also addressed the hypotheses in a more specific sense and the importance of direct empirical control with the authority and originality that come from his position. When discussing the foundations of mathematics, he raises one or two questions of great philosophical significance that I unfortunately don't have time to explore, even if I had the right to do so. Specifically, in talking about the essence of mathematical reasoning and the challenging problem of what enables new results in pure mathematics, M. Poincaré supports a thesis about the role of 'demonstration by recurrence'—a thesis that is indeed open to debate, has been contested, and which I would personally feel inclined to modify in some ways, even while embracing the essence of our author's claim. Nevertheless, there is no doubt about the importance of this thesis and how it outlines a fundamental characteristic in a wide range of mathematical research. The philosophical issues underlying recurrent proofs and processes are, as I've argued elsewhere, of utmost importance.

These, then, are a few hints relating to the significance of our author's discussion, and a few reasons for hoping that our own students will profit by the reading of the book as those of other nations have already done.

These are a few hints about the importance of our author's discussion and a few reasons to hope that our own students will benefit from reading the book, just like students from other countries have already done.

Of the person and of the life-work of our author a few words are here, in conclusion, still in place, addressed, not to the students of his own science, to whom his position is well known, but to the general reader who may seek guidance in these pages.

Of the person and the career of our author, a few closing words are appropriate here, directed not at the students of his field, who are already familiar with his status, but at the general reader who might look for guidance in these pages.

Jules Henri Poincaré was born at Nancy, in 1854, the son of a professor in the Faculty of Medicine at Nancy. He studied at the École Polytechnique and at the École des Mines, and later received his doctorate in mathematics in 1879. In 1883 he began courses of instruction in mathematics at the École Polytechnique; in 1886 received a professorship of mathematical physics in the Faculty of Sciences at Paris; then became member of the Academy of Sciences at Paris, in 1887, and devoted his life to instruction and investigation in the regions of pure mathematics, of mathematical physics and of celestial mechanics. His list of published treatises relating to[Pg 25] various branches of his chosen sciences is long; and his original memoirs have included several momentous investigations, which have gone far to transform more than one branch of research. His presence at the International Congress of Arts and Science in St. Louis was one of the most noticeable features of that remarkable gathering of distinguished foreign guests. In Poincaré the reader meets, then, not one who is primarily a speculative student of general problems for their own sake, but an original investigator of the highest rank in several distinct, although interrelated, branches of modern research. The theory of functions—a highly recondite region of pure mathematics—owes to him advances of the first importance, for instance, the definition of a new type of functions. The 'problem of the three bodies,' a famous and fundamental problem of celestial mechanics, has received from his studies a treatment whose significance has been recognized by the highest authorities. His international reputation has been confirmed by the conferring of more than one important prize for his researches. His membership in the most eminent learned societies of various nations is widely extended; his volumes bearing upon various branches of mathematics and of mathematical physics are used by special students in all parts of the learned world; in brief, he is, as geometer, as analyst and as a theoretical physicist, a leader of his age.

Jules Henri Poincaré was born in Nancy in 1854, the son of a professor at the Faculty of Medicine in Nancy. He studied at the École Polytechnique and the École des Mines, and later earned his doctorate in mathematics in 1879. In 1883, he started teaching mathematics at the École Polytechnique; in 1886, he became a professor of mathematical physics at the Faculty of Sciences in Paris; and in 1887, he was elected to the Academy of Sciences in Paris. He dedicated his life to teaching and research in pure mathematics, mathematical physics, and celestial mechanics. His list of published papers covering various branches of his chosen fields is extensive, and his original articles include several important investigations that have significantly impacted multiple areas of research. His presence at the International Congress of Arts and Science in St. Louis was one of the standout highlights of that notable gathering of distinguished international guests. In Poincaré, the reader finds not just a speculative thinker interested in general problems for their own sake, but an original investigator of the highest caliber across several distinct, yet interconnected, branches of modern research. The theory of functions—a complex area of pure mathematics—benefits from his crucial advances, such as the definition of a new type of functions. The 'problem of the three bodies,' a well-known and fundamental issue in celestial mechanics, has been significantly advanced by his studies, and this importance has been acknowledged by leading authorities. His international reputation has been supported by the awarding of several important prizes for his research. He is a member of many prestigious scholarly societies from various countries; his books on different areas of mathematics and mathematical physics are utilized by advanced students all over the academic world. In short, he is, as a geometer, analyst, and theoretical physicist, a leader of his time.

Meanwhile, as contributor to the philosophical discussion of the bases and methods of science, M. Poincaré has long been active. When, in 1893, the admirable Revue de Métaphysique et de Morale began to appear, M. Poincaré was soon found amongst the most satisfactory of the contributors to the work of that journal, whose office it has especially been to bring philosophy and the various special sciences (both natural and moral) into a closer mutual understanding. The discussions brought together in the present volume are in large part the outcome of M. Poincaré's contributions to the Revue de Métaphysique et de Morale. The reader of M. Poincaré's book is in presence, then, of a great special investigator who is also a philosopher.

Meanwhile, M. Poincaré has been actively contributing to the philosophical discussion on the foundations and methods of science for a long time. When the impressive Revue de Métaphysique et de Morale started publishing in 1893, M. Poincaré quickly became one of the most valued contributors to the journal, which has aimed to foster a deeper understanding between philosophy and various specialized sciences, both natural and social. The discussions compiled in this volume largely stem from M. Poincaré's contributions to the Revue de Métaphysique et de Morale. Thus, the reader of M. Poincaré's book encounters a significant researcher who is also a philosopher.


SCIENCE AND HYPOTHESIS

INTRODUCTION

For a superficial observer, scientific truth is beyond the possibility of doubt; the logic of science is infallible, and if the scientists are sometimes mistaken, this is only from their mistaking its rules.

For someone who only scratches the surface, scientific truth seems indisputable; the logic of science is flawless, and if scientists occasionally get it wrong, it's simply because they've misunderstood its principles.

"The mathematical verities flow from a small number of self-evident propositions by a chain of impeccable reasonings; they impose themselves not only on us, but on nature itself. They fetter, so to speak, the Creator and only permit him to choose between some relatively few solutions. A few experiments then will suffice to let us know what choice he has made. From each experiment a crowd of consequences will follow by a series of mathematical deductions, and thus each experiment will make known to us a corner of the universe."

"The truths of mathematics come from a small set of obvious principles through a series of flawless logic; they are imposed not just on us but on nature itself. They bind, so to speak, the Creator and only allow Him to choose from a limited number of solutions. A few experiments will be enough to show us what choice has been made. Each experiment will lead to a multitude of consequences through a series of mathematical deductions, and thus each experiment will reveal a part of the universe to us."

Behold what is for many people in the world, for scholars getting their first notions of physics, the origin of scientific certitude. This is what they suppose to be the rôle of experimentation and mathematics. This same conception, a hundred years ago, was held by many savants who dreamed of constructing the world with as little as possible taken from experiment.

Behold what is, for many people in the world, the starting point of scientific certainty for students first learning about physics. This is what they think experimentation and math should do. The same idea, a hundred years ago, was shared by many scientists who envisioned building a model of the world with minimal reliance on experiments.

On a little more reflection it was perceived how great a place hypothesis occupies; that the mathematician can not do without it, still less the experimenter. And then it was doubted if all these constructions were really solid, and believed that a breath would overthrow them. To be skeptical in this fashion is still to be superficial. To doubt everything and to believe everything are two equally convenient solutions; each saves us from thinking.

Upon further reflection, it became clear how important a role hypothesis plays; the mathematician can't do without it, and neither can the experimenter. Then doubts arose about whether all these constructions were truly solid, leading to the belief that even a slightest breath could bring them down. Being skeptical in this way is still a superficial stance. To doubt everything and to believe anything are two equally easy solutions; both keep us from having to think.

Instead of pronouncing a summary condemnation, we ought therefore to examine with care the rôle of hypothesis; we shall then recognize, not only that it is necessary, but that usually it is[Pg 28] legitimate. We shall also see that there are several sorts of hypotheses; that some are verifiable, and once confirmed by experiment become fruitful truths; that others, powerless to lead us astray, may be useful to us in fixing our ideas; that others, finally, are hypotheses only in appearance and are reducible to disguised definitions or conventions.

Instead of making a quick judgment, we should carefully examine the role of hypothesis; we will then realize not only that it is necessary, but that it is usually legitimate. We will also see that there are different types of hypotheses: some are verifiable, and once confirmed by experimentation, they turn into valuable truths; others, while not misleading us, can help clarify our thoughts; and finally, some are only hypotheses in name and can be reduced to disguised definitions or conventions.

These last are met with above all in mathematics and the related sciences. Thence precisely it is that these sciences get their rigor; these conventions are the work of the free activity of our mind, which, in this domain, recognizes no obstacle. Here our mind can affirm, since it decrees; but let us understand that while these decrees are imposed upon our science, which, without them, would be impossible, they are not imposed upon nature. Are they then arbitrary? No, else were they sterile. Experiment leaves us our freedom of choice, but it guides us by aiding us to discern the easiest way. Our decrees are therefore like those of a prince, absolute but wise, who consults his council of state.

These are mostly found in mathematics and related sciences. This is where these sciences derive their rigor; these conventions come from the free activity of our minds, which face no obstacles in this realm. Here, our minds can affirm because they decide; however, we should understand that while these decisions are essential for our science, which would be impossible without them, they are not imposed on nature. Are they then arbitrary? No, because if they were, they would be unproductive. Experiment allows us the freedom to choose, but it also helps us find the simplest path. Our decisions are therefore like those of a wise but absolute ruler who consults his council of state.

Some people have been struck by this character of free convention recognizable in certain fundamental principles of the sciences. They have wished to generalize beyond measure, and, at the same time, they have forgotten that liberty is not license. Thus they have reached what is called nominalism, and have asked themselves if the savant is not the dupe of his own definitions and if the world he thinks he discovers is not simply created by his own caprice.[1] Under these conditions science would be certain, but deprived of significance.

Some people have been amazed by the aspect of free convention that can be seen in certain key principles of science. They have tried to generalize excessively and, at the same time, they have overlooked that freedom isn't the same as giving in to chaos. As a result, they've ended up with what is called nominalism, questioning whether the scholar is just fooling himself with his own definitions and if the world he thinks he is uncovering is merely a product of his own whims.[1] In this situation, science would be certain, but lacking in meaning.

If this were so, science would be powerless. Now every day we see it work under our very eyes. That could not be if it taught us nothing of reality. Still, the things themselves are not what it can reach, as the naïve dogmatists think, but only the relations between things. Outside of these relations there is no knowable reality.

If that were the case, science would be ineffective. But every day we see it in action right before our eyes. That wouldn't be possible if it taught us nothing about reality. However, the actual things aren't what it can grasp, as the simple-minded dogmatists believe, but only the connections between things. Outside of these connections, there's no reality we can know.

Such is the conclusion to which we shall come, but for that we must review the series of sciences from arithmetic and geometry to mechanics and experimental physics.[Pg 29]

This is the conclusion we will reach, but to get there, we need to look at the progression of sciences from arithmetic and geometry to mechanics and experimental physics.[Pg 29]

What is the nature of mathematical reasoning? Is is really deductive, as is commonly supposed? A deeper analysis shows us that it is not, that it partakes in a certain measure of the nature of inductive reasoning, and just because of this is it so fruitful. None the less does it retain its character of rigor absolute; this is the first thing that had to be shown.

What is the nature of mathematical reasoning? Is it really deductive, as people commonly think? A deeper look reveals that it isn’t; it actually involves a certain degree of inductive reasoning, and that's what makes it so effective. Nevertheless, it still maintains its character of absolute rigor; this is the first point that needed to be established.

Knowing better now one of the instruments which mathematics puts into the hands of the investigator, we had to analyze another fundamental notion, that of mathematical magnitude. Do we find it in nature, or do we ourselves introduce it there? And, in this latter case, do we not risk marring everything? Comparing the rough data of our senses with that extremely complex and subtile concept which mathematicians call magnitude, we are forced to recognize a difference; this frame into which we wish to force everything is of our own construction; but we have not made it at random. We have made it, so to speak, by measure and therefore we can make the facts fit into it without changing what is essential in them.

Knowing more now about one of the tools that mathematics gives to the researcher, we needed to examine another key idea: mathematical magnitude. Is it something we find in nature, or do we create it ourselves? And if it is the latter, aren’t we at risk of ruining everything? When we compare the raw data from our senses with the incredibly complex and subtle idea that mathematicians call magnitude, we have to acknowledge there’s a difference; the framework we try to fit everything into is something we’ve constructed ourselves, but it’s not random. We’ve built it, so to speak, through measurement, and because of that, we can fit the facts into it without altering what’s fundamentally important about them.

Another frame which we impose on the world is space. Whence come the first principles of geometry? Are they imposed on us by logic? Lobachevski has proved not, by creating non-Euclidean geometry. Is space revealed to us by our senses? Still no, for the space our senses could show us differs absolutely from that of the geometer. Is experience the source of geometry? A deeper discussion will show us it is not. We therefore conclude that the first principles of geometry are only conventions; but these conventions are not arbitrary and if transported into another world (that I call the non-Euclidean world and seek to imagine), then we should have been led to adopt others.

Another way we understand the world is through space. Where do the basic principles of geometry come from? Are they dictated by logic? Lobachevski demonstrated that they are not, by creating non-Euclidean geometry. Is space shown to us by our senses? Still no, because the space our senses perceive is completely different from that of the geometer. Is experience the foundation of geometry? A deeper analysis will reveal that it is not. Therefore, we conclude that the basic principles of geometry are merely conventions; however, these conventions are not arbitrary, and if we were to be placed in another world (which I call the non-Euclidean world and try to visualize), then we would have been led to adopt different conventions.

In mechanics we should be led to analogous conclusions, and should see that the principles of this science, though more directly based on experiment, still partake of the conventional character of the geometric postulates. Thus far nominalism triumphs; but now we arrive at the physical sciences, properly so called. Here the scene changes; we meet another sort of hypotheses and we see their fertility. Without doubt, at first blush, the theories seem to us fragile, and the history of science proves to us how ephemeral they are; yet they do not entirely perish,[Pg 30] and of each of them something remains. It is this something we must seek to disentangle, since there and there alone is the veritable reality.

In mechanics, we should come to similar conclusions and recognize that while the principles of this science are more directly based on experiments, they still share the conventional nature of geometric postulates. So far, nominalism is winning; but now we arrive at the physical sciences in the true sense. Here, things change; we encounter a different type of hypotheses and see their richness. At first glance, these theories may seem fragile, and the history of science shows us how temporary they can be; yet they don't completely disappear,[Pg 30] and something remains of each. It is this remnant that we must strive to unravel, for there lies the true reality.

The method of the physical sciences rests on the induction which makes us expect the repetition of a phenomenon when the circumstances under which it first happened are reproduced. If all these circumstances could be reproduced at once, this principle could be applied without fear; but that will never happen; some of these circumstances will always be lacking. Are we absolutely sure they are unimportant? Evidently not. That may be probable, it can not be rigorously certain. Hence the important rôle the notion of probability plays in the physical sciences. The calculus of probabilities is therefore not merely a recreation or a guide to players of baccarat, and we must seek to go deeper with its foundations. Under this head I have been able to give only very incomplete results, so strongly does this vague instinct which lets us discern probability defy analysis.

The method of the physical sciences relies on induction, which leads us to expect that a phenomenon will occur again when the conditions that produced it are replicated. If all of these conditions could be recreated at once, we could apply this principle without hesitation; however, that will never happen; some of these conditions will always be missing. Are we completely sure they don't matter? Clearly not. While it may seem likely, it can't be said with absolute certainty. This is why the concept of probability is so crucial in the physical sciences. The study of probabilities is not just a pastime or a tool for gamblers, and we need to explore its foundations more thoroughly. In this regard, I've only been able to present partial results, as this vague instinct that allows us to recognize probability is resistant to analysis.

After a study of the conditions under which the physicist works, I have thought proper to show him at work. For that I have taken instances from the history of optics and of electricity. We shall see whence have sprung the ideas of Fresnel, of Maxwell, and what unconscious hypotheses were made by Ampère and the other founders of electrodynamics.

After looking into the conditions in which physicists work, I decided to show them in action. For that, I’ve drawn examples from the history of optics and electricity. We’ll explore where the ideas of Fresnel and Maxwell originated, as well as the unintentional assumptions made by Ampère and the other pioneers of electrodynamics.


PART I

NUMBER AND MAGNITUDE


CHAPTER I

On the Nature of Mathematical Reasoning

I

The very possibility of the science of mathematics seems an insoluble contradiction. If this science is deductive only in appearance, whence does it derive that perfect rigor no one dreams of doubting? If, on the contrary, all the propositions it enunciates can be deduced one from another by the rules of formal logic, why is not mathematics reduced to an immense tautology? The syllogism can teach us nothing essentially new, and, if everything is to spring from the principle of identity, everything should be capable of being reduced to it. Shall we then admit that the enunciations of all those theorems which fill so many volumes are nothing but devious ways of saying A is A?

The very idea of mathematics as a science seems like an impossible contradiction. If this science is only deductive on the surface, where does it get that perfect precision no one doubts? On the other hand, if all the statements it makes can be deduced from one another using formal logic, why isn't mathematics just an enormous tautology? The syllogism doesn't teach us anything fundamentally new, and if everything comes from the principle of identity, everything should be reducible to that. Should we then accept that the statements of all those theorems that fill so many volumes are just roundabout ways of saying A is A?

Without doubt, we can go back to the axioms, which are at the source of all these reasonings. If we decide that these can not be reduced to the principle of contradiction, if still less we see in them experimental facts which could not partake of mathematical necessity, we have yet the resource of classing them among synthetic a priori judgments. This is not to solve the difficulty, but only to baptize it; and even if the nature of synthetic judgments were for us no mystery, the contradiction would not have disappeared, it would only have moved back; syllogistic reasoning remains incapable of adding anything to the data given it: these data reduce themselves to a few axioms, and we should find nothing else in the conclusions.

Without a doubt, we can return to the basic principles that are the foundation of all this reasoning. If we decide that these cannot be simplified to the principle of contradiction, and even more so if we recognize them as experimental facts that cannot be part of mathematical necessity, we still have the option of categorizing them as synthetic a priori judgments. This does not resolve the issue, but merely renames it; and even if the nature of synthetic judgments were not a mystery to us, the contradiction would not vanish; it would just be pushed back. Syllogistic reasoning remains unable to add anything to the information given to it: this information boils down to a few basic principles, and we would find nothing more in the conclusions.

No theorem could be new if no new axiom intervened in its demonstration; reasoning could give us only the immediately[Pg 32] evident verities borrowed from direct intuition; it would be only an intermediary parasite, and therefore should we not have good reason to ask whether the whole syllogistic apparatus did not serve solely to disguise our borrowing?

No theorem can be considered new if there isn't a new axiom involved in its proof; reasoning can only provide us with the obvious truths we get directly from intuition; it would just be a kind of intermediary parasite. So, shouldn't we question whether the entire syllogistic framework is just a way to hide our reliance on those borrowed ideas?

The contradiction will strike us the more if we open any book on mathematics; on every page the author will announce his intention of generalizing some proposition already known. Does the mathematical method proceed from the particular to the general, and, if so, how then can it be called deductive?

The contradiction will stand out even more if we open any math book; on every page, the author will express their intent to generalize a known proposition. Does the mathematical method move from the specific to the general, and if it does, how can it be described as deductive?

If finally the science of number were purely analytic, or could be analytically derived from a small number of synthetic judgments, it seems that a mind sufficiently powerful could at a glance perceive all its truths; nay more, we might even hope that some day one would invent to express them a language sufficiently simple to have them appear self-evident to an ordinary intelligence.

If the science of numbers were completely analytical or could be derived analytically from a few synthetic judgments, it seems that a strong enough mind could grasp all its truths at once; moreover, we might even hope that one day someone will create a language simple enough to make them seem obvious to an average person.

If we refuse to admit these consequences, it must be conceded that mathematical reasoning has of itself a sort of creative virtue and consequently differs from the syllogism.

If we refuse to acknowledge these consequences, we have to accept that mathematical reasoning has its own kind of creative value and is therefore different from syllogism.

The difference must even be profound. We shall not, for example, find the key to the mystery in the frequent use of that rule according to which one and the same uniform operation applied to two equal numbers will give identical results.

The difference has to be significant. For instance, we won't discover the key to the mystery in the common application of that principle where the same consistent operation applied to two equal numbers yields the same results.

All these modes of reasoning, whether or not they be reducible to the syllogism properly so called, retain the analytic character, and just because of that are powerless.

All these ways of thinking, whether or not they can be reduced to a proper syllogism, keep an analytical nature, and for that reason, they are ineffective.

II

The discussion is old; Leibnitz tried to prove 2 and 2 make 4; let us look a moment at his demonstration.

The debate isn't new; Leibnitz attempted to show that 2 plus 2 equals 4; let's take a moment to look at his proof.

I will suppose the number 1 defined and also the operation x + 1 which consists in adding unity to a given number x.

I will assume that the number 1 is defined, along with the operation x + 1, which involves adding one to a given number x.

These definitions, whatever they be, do not enter into the course of the reasoning.

These definitions, whatever they are, do not factor into the reasoning process.

I define then the numbers 2, 3 and 4 by the equalities

I then define the numbers 2, 3, and 4 by the equalities

(1) 1 + 1 = 2;       (2) 2 + 1 = 3;       (3) 3 + 1 = 4.

(1) 1 + 1 = 2;       (2) 2 + 1 = 3;       (3) 3 + 1 = 4.

In the same way, I define the operation x + 2 by the relation:[Pg 33]

In the same way, I define the operation x + 2 by the relation:[Pg 33]

(4) x + 2 = (x + 1) + 1.

(4) x + 2 = (x + 1) + 1.

That presupposed, we have

That assumption made, we have

2 + 1 + 1 = 3 + 1(Definition 2),
3 + 1 = 4(Definition 3),
2 + 2 = (2 + 1) + 1    (Definition 4),

whence

where from

2 + 2 = 4 Q.E.D.

2 + 2 = 4 Q.E.D.

It can not be denied that this reasoning is purely analytic. But ask any mathematician: 'That is not a demonstration properly so called,' he will say to you: 'that is a verification.' We have confined ourselves to comparing two purely conventional definitions and have ascertained their identity; we have learned nothing new. Verification differs from true demonstration precisely because it is purely analytic and because it is sterile. It is sterile because the conclusion is nothing but the premises translated into another language. On the contrary, true demonstration is fruitful because the conclusion here is in a sense more general than the premises.

It can’t be denied that this reasoning is purely analytical. But ask any mathematician: "That’s not a proper demonstration," they will tell you: "That’s a verification." We’ve limited ourselves to comparing two purely conventional definitions and confirmed their similarity; we haven’t learned anything new. Verification differs from true demonstration precisely because it is purely analytical and because it lacks substance. It lacks substance because the conclusion is simply the premises expressed in another way. In contrast, true demonstration is productive because the conclusion is, in some sense, broader than the premises.

The equality 2 + 2 = 4 is thus susceptible of a verification only because it is particular. Every particular enunciation in mathematics can always be verified in this same way. But if mathematics could be reduced to a series of such verifications, it would not be a science. So a chess-player, for example, does not create a science in winning a game. There is no science apart from the general.

The equation 2 + 2 = 4 can only be verified because it’s specific. Every specific statement in mathematics can be verified in the same way. However, if math could be broken down into just a series of these verifications, it wouldn’t be considered a science. For instance, a chess player doesn’t create a science by winning a game. There’s no science without the general.

It may even be said the very object of the exact sciences is to spare us these direct verifications.

It could be said that the main goal of the exact sciences is to save us from these direct verifications.

III

Let us, therefore, see the geometer at work and seek to catch his process.

Let’s watch the geometer at work and try to understand his process.

The task is not without difficulty; it does not suffice to open a work at random and analyze any demonstration in it.

The task isn't easy; it's not enough to just open a book randomly and analyze any argument in it.

We must first exclude geometry, where the question is complicated by arduous problems relative to the rôle of the postulates, to the nature and the origin of the notion of space. For analogous reasons we can not turn to the infinitesimal analysis.[Pg 34] We must seek mathematical thought where it has remained pure, that is, in arithmetic.

We must first set aside geometry, where the issue gets complicated by tough problems related to the role of the postulates and the nature and origin of the concept of space. For similar reasons, we also can't focus on infinitesimal analysis.[Pg 34] We need to look for mathematical thought where it has stayed pure, which is in arithmetic.

A choice still is necessary; in the higher parts of the theory of numbers, the primitive mathematical notions have already undergone an elaboration so profound that it becomes difficult to analyze them.

A choice is still necessary; in the higher areas of number theory, the basic mathematical concepts have already been developed so deeply that it becomes hard to break them down.

It is, therefore, at the beginning of arithmetic that we must expect to find the explanation we seek, but it happens that precisely in the demonstration of the most elementary theorems the authors of the classic treatises have shown the least precision and rigor. We must not impute this to them as a crime; they have yielded to a necessity; beginners are not prepared for real mathematical rigor; they would see in it only useless and irksome subtleties; it would be a waste of time to try prematurely to make them more exacting; they must pass over rapidly, but without skipping stations, the road traversed slowly by the founders of the science.

It’s important to start with arithmetic to find the explanation we’re looking for, but the authors of classic texts have shown the least precision and rigor when demonstrating the most basic theorems. We shouldn’t blame them for this; they faced a necessity. Beginners aren’t ready for true mathematical rigor; they would only see it as tedious and unnecessary details. It would be a waste of time to try to make them more demanding too early; they need to move through the stages quickly, but not skip any, as they follow the path laid out slowly by the pioneers of the field.

Why is so long a preparation necessary to become habituated to this perfect rigor, which, it would seem, should naturally impress itself upon all good minds? This is a logical and psychological problem well worthy of study.

Why is such a long preparation needed to get used to this perfect discipline, which should naturally have an impact on all good minds? This is a logical and psychological issue that deserves serious examination.

But we shall not take it up; it is foreign to our purpose; all I wish to insist on is that, not to fail of our purpose, we must recast the demonstrations of the most elementary theorems and give them, not the crude form in which they are left, so as not to harass beginners, but the form that will satisfy a skilled geometer.

But we won't delve into that; it's not relevant to our goal. What I want to emphasize is that, to achieve our aim, we need to reshape the proofs of the most basic theorems. We should present them not in the rough form they currently have, which could confuse beginners, but in a way that meets the expectations of an experienced geometer.

Definition of Addition.—I suppose already defined the operation x + 1, which consists in adding the number 1 to a given number x.

Definition of Addition.—I think I've already defined the operation x + 1, which is adding the number 1 to a specific number x.

This definition, whatever it be, does not enter into our subsequent reasoning.

This definition, whatever it is, doesn't factor into our later reasoning.

We now have to define the operation x + a, which consists in adding the number a to a given number x.

We now need to define the operation x + a, which involves adding the number a to a specific number x.

Supposing we have defined the operation

Supposing we have defined the operation

x + (a − 1),

x + (a - 1),

the operation x + a will be defined by the equality

the operation x + a will be defined by the equality

(1) x + a = [x + (a − 1)] + 1.

(1) x + a = [x + (a − 1)] + 1.

We shall know then what x + a is when we know what x + (a − 1) is, and as I have supposed that to start with we knew what x + 1 is, we can define successively and 'by recurrence' the operations x + 2, x + 3, etc.

We will know what x + a is once we know what x + (a − 1) is. Since I assumed that we initially knew what x + 1 is, we can successively define and 'recursively' find the operations x + 2, x + 3, and so on.

This definition deserves a moment's attention; it is of a particular nature which already distinguishes it from the purely logical definition; the equality (1) contains an infinity of distinct definitions, each having a meaning only when one knows the preceding.

This definition deserves a moment's attention; it has a unique nature that already sets it apart from a purely logical definition; the equality (1) includes an infinite number of distinct definitions, each of which makes sense only when you understand the previous one.

Properties of Addition.Associativity.—I say that

Properties of Addition.—Associativity.—I say that

a + (b + c) = (a + b) + c.

a + (b + c) = (a + b) + c.

In fact the theorem is true for c = 1; it is then written

In fact, the theorem is true for c = 1; it's then written

a + (b + 1) = (a + b) + 1,

a + (b + 1) = (a + b) + 1,

which, apart from the difference of notation, is nothing but the equality (1), by which I have just defined addition.

which, besides the difference in notation, is really just the equality (1), which I just defined as addition.

Supposing the theorem true for c = γ, I say it will be true for c = γ + 1.

Supposing the theorem is true for c = γ, I say it will also be true for c = γ + 1.

In fact, supposing

In fact, assuming

(a + b) + γ = a + (b + γ),

(a + b) + γ = a + (b + γ),

it follows that

therefore

[(a + b) + γ] + 1 = [a + (b + γ)] + 1

[(a + b) + γ] + 1 = [a + (b + γ)] + 1

or by definition (1)

or by definition (1)

(a + b) + (γ + 1) = a + (b + γ + 1) = a + [b + (γ + 1)],

(a + b) + (γ + 1) = a + (b + γ + 1) = a + [b + (γ + 1)],

which shows, by a series of purely analytic deductions, that the theorem is true for γ + 1.

which shows, through a series of purely analytical deductions, that the theorem is true for γ + 1.

Being true for c = 1, we thus see successively that so it is for c = 2, for c = 3, etc.

Being true for c = 1, we can see that it also holds for c = 2, for c = 3, and so on.

Commutativity.—1º I say that

Commutativity.—First, I say that

a + 1 = 1 + a.

a + 1 = 1 + a.

The theorem is evidently true for a = 1; we can verify by purely analytic reasoning that if it is true for a = γ it will be true for a = γ + 1; for then

The theorem is clearly true for a = 1; we can verify through straightforward analytic reasoning that if it holds for a = γ, it will also hold for a = γ + 1; for then

(γ + 1) + 1 = (1 + γ) + 1 = 1 + (γ + 1);

(γ + 1) + 1 = (1 + γ) + 1 = 1 + (γ + 1);

now it is true for a = 1, therefore it will be true for a = 2, for a = 3, etc., which is expressed by saying that the enunciated proposition is demonstrated by recurrence.[Pg 36]

now it is true for a = 1, therefore it will be true for a = 2, for a = 3, etc., which is expressed by saying that the stated proposition is proven by recurrence.[Pg 36]

2º I say that

I say that

a + b = b + a.

a + b = b + a.

The theorem has just been demonstrated for b = 1; it can be verified analytically that if it is true for b = β, it will be true for b = β + 1.

The theorem has just been proven for b = 1; it can be verified analytically that if it holds for b = β, it will also hold for b = β + 1.

The proposition is therefore established by recurrence.

The idea is thus established by repetition.

Definition of Multiplication.—We shall define multiplication by the equalities.

Definition of Multiplication.—We will define multiplication using equalities.

(1) a × 1 = a.

a × 1 = a.

(2) a × b = [a × (b − 1)] + a.

(2) a × b = [a × (b − 1)] + a.

Like equality (1), equality (2) contains an infinity of definitions; having defined a × 1, it enables us to define successively: a × 2, a × 3, etc.

Like equality (1), equality (2) has countless definitions; having defined a × 1, it allows us to define successively: a × 2, a × 3, etc.

Properties of Multiplication.Distributivity.—I say that

Multiplication Properties.Distributive Property.—I say that

(a + b) × c = (a × c) + (b × c).

(a + b) × c = (a × c) + (b × c).

We verify analytically that the equality is true for c = 1; then that if the theorem is true for c = γ, it will be true for c = γ + 1.

We verify analytically that the equality holds for c = 1; then we show that if the theorem is true for c = γ, it will also be true for c = γ + 1.

The proposition is, therefore, demonstrated by recurrence.

The idea is, therefore, proven by repetition.

Commutativity.—1º I say that

Commutativity.—1st I say that

a × 1 = 1 × a.

a × 1 = 1 × a.

The theorem is evident for a = 1.

The theorem is clear for a = 1.

We verify analytically that if it is true for a = α, it will be true for a = α + 1.

We analytically confirm that if it holds for a = α, it will also hold for a = α + 1.

2º I say that

I say that

a × b = b × a.

a × b = b × a.

The theorem has just been proven for b = 1. We could verify analytically that if it is true for b = β, it will be true for b = β + 1.

The theorem has just been proven for b = 1. We can confirm through analysis that if it's true for b = β, it will also be true for b = β + 1.

IV

Here I stop this monotonous series of reasonings. But this very monotony has the better brought out the procedure which is uniform and is met again at each step.

Here I pause this repetitive chain of arguments. But this very repetition has highlighted the consistent process that occurs at each stage.

This procedure is the demonstration by recurrence. We first establish a theorem for n = 1; then we show that if it is true of n − 1, it is true of n, and thence conclude that it is true for all the whole numbers.[Pg 37]

This procedure involves demonstrating by induction. We first establish a theorem for n = 1; then we show that if it's true for n − 1, it must also be true for n, allowing us to conclude that it is true for all whole numbers.[Pg 37]

We have just seen how it may be used to demonstrate the rules of addition and multiplication, that is to say, the rules of the algebraic calculus; this calculus is an instrument of transformation, which lends itself to many more differing combinations than does the simple syllogism; but it is still an instrument purely analytic, and incapable of teaching us anything new. If mathematics had no other instrument, it would therefore be forthwith arrested in its development; but it has recourse anew to the same procedure, that is, to reasoning by recurrence, and it is able to continue its forward march.

We have just seen how it can be used to show the rules of addition and multiplication, which means the rules of algebra; this algebra is a tool for transformation that allows for many more different combinations than the simple syllogism does. However, it is still a purely analytical tool and can't teach us anything new. If mathematics relied solely on this tool, it would quickly come to a halt in its development. But it again turns to the same method, which is reasoning by recurrence, and it can keep progressing.

If we look closely, at every step we meet again this mode of reasoning, either in the simple form we have just given it, or under a form more or less modified.

If we look closely, at every step we encounter this way of thinking again, either in the straightforward form we've just discussed or in a slightly altered version.

Here then we have the mathematical reasoning par excellence, and we must examine it more closely.

Here we have the mathematical reasoning par excellence, and we need to take a closer look at it.

V

The essential characteristic of reasoning by recurrence is that it contains, condensed, so to speak, in a single formula, an infinity of syllogisms.

The key feature of reasoning by recurrence is that it includes, in a way, a whole range of syllogisms packed into a single formula.

That this may the better be seen, I will state one after another these syllogisms which are, if you will allow me the expression, arranged in 'cascade.'

That this may be more clearly understood, I will present these syllogisms one after another, which are, if you don't mind my saying, organized in a 'cascade.'

These are of course hypothetical syllogisms.

These are, of course, hypothetical syllogisms.

The theorem is true of the number 1.

The theorem is true for the number 1.

Now, if it is true of 1, it is true of 2.

Now, if it's true for 1, it's true for 2.

Therefore it is true of 2.

Therefore, it is true for 2.

Now, if it is true of 2, it is true of 3.

Now, if it's true for 2, it's true for 3.

Therefore it is true of 3, and so on.

Therefore it is true for 3, and so on.

We see that the conclusion of each syllogism serves as minor to the following.

We see that the conclusion of each syllogism acts as a minor premise for the next one.

Furthermore the majors of all our syllogisms can be reduced to a single formula.

Furthermore, all the main points of our arguments can be simplified to a single formula.

If the theorem is true of n − 1, so it is of n.

If the theorem is true for n − 1, then it is also true for n.

We see, then, that in reasoning by recurrence we confine ourselves to stating the minor of the first syllogism, and the general formula which contains as particular cases all the majors.

We can see that when reasoning by recurrence, we limit ourselves to stating the minor premise of the first syllogism and the general formula that includes all the major premises as specific cases.

This never-ending series of syllogisms is thus reduced to a phrase of a few lines.[Pg 38]

This endless chain of logical arguments is simplified to just a few lines.[Pg 38]

It is now easy to comprehend why every particular consequence of a theorem can, as I have explained above, be verified by purely analytic procedures.

It’s now easy to understand why every specific outcome of a theorem can, as I explained earlier, be confirmed through purely analytical methods.

If instead of showing that our theorem is true of all numbers, we only wish to show it true of the number 6, for example, it will suffice for us to establish the first 5 syllogisms of our cascade; 9 would be necessary if we wished to prove the theorem for the number 10; more would be needed for a larger number; but, however great this number might be, we should always end by reaching it, and the analytic verification would be possible.

If we just want to prove our theorem is true for the number 6 instead of all numbers, we only need to establish the first 5 syllogisms in our series. To prove it for the number 10, we would need 9 syllogisms; and we'd require even more for larger numbers. Regardless of how large that number is, we would always be able to reach it, making analytic verification possible.

And yet, however far we thus might go, we could never rise to the general theorem, applicable to all numbers, which alone can be the object of science. To reach this, an infinity of syllogisms would be necessary; it would be necessary to overleap an abyss that the patience of the analyst, restricted to the resources of formal logic alone, never could fill up.

And yet, no matter how far we might go, we could never reach the overall principle that applies to all numbers, which is the true goal of science. To achieve this, countless syllogisms would be needed; we would need to cross a gap that the analyst's patience, limited to just the tools of formal logic, could never bridge.

I asked at the outset why one could not conceive of a mind sufficiently powerful to perceive at a glance the whole body of mathematical truths.

I asked at the beginning why it wasn't possible to imagine a mind strong enough to see all the mathematical truths at once.

The answer is now easy; a chess-player is able to combine four moves, five moves, in advance, but, however extraordinary he may be, he will never prepare more than a finite number of them; if he applies his faculties to arithmetic, he will not be able to perceive its general truths by a single direct intuition; to arrive at the smallest theorem he can not dispense with the aid of reasoning by recurrence, for this is an instrument which enables us to pass from the finite to the infinite.

The answer is now simple; a chess player can plan four or five moves ahead, but no matter how extraordinary they are, they will never prepare more than a limited number of moves. If they focus on math, they won't grasp its general truths through a single direct intuition; to understand even the simplest theorem, they cannot do without reasoning by recurrence, as this is a tool that helps us move from the finite to the infinite.

This instrument is always useful, for, allowing us to overleap at a bound as many stages as we wish, it spares us verifications, long, irksome and monotonous, which would quickly become impracticable. But it becomes indispensable as soon as we aim at the general theorem, to which analytic verification would bring us continually nearer without ever enabling us to reach it.

This tool is always useful because it lets us skip over as many stages as we want, saving us from tedious, lengthy, and repetitive checks that would soon become unfeasible. However, it becomes essential as soon as we focus on the general theorem, which analytic checks would get us closer to without ever allowing us to fully achieve it.

In this domain of arithmetic, we may think ourselves very far from the infinitesimal analysis, and yet, as we have just seen, the idea of the mathematical infinite already plays a preponderant rôle, and without it there would be no science, because there would be nothing general.

In this area of arithmetic, we might believe we're quite far from infinitesimal calculus, and yet, as we've just observed, the concept of the mathematical infinite plays a significant role here, and without it, there would be no science because there would be nothing universal.

VI

The judgment on which reasoning by recurrence rests can be put under other forms; we may say, for example, that in an infinite collection of different whole numbers there is always one which is less than all the others.

The idea behind reasoning by recurrence can be expressed in different ways; for instance, we can say that in an infinite set of different whole numbers, there is always one that is smaller than all the others.

We can easily pass from one enunciation to the other and thus get the illusion of having demonstrated the legitimacy of reasoning by recurrence. But we shall always be arrested, we shall always arrive at an undemonstrable axiom which will be in reality only the proposition to be proved translated into another language.

We can quickly move from one statement to another and create the illusion that we've proven the validity of reasoning through repetition. However, we'll always find ourselves stuck, ultimately reaching an axiom that can't be proven, which is really just the proposition we need to prove expressed in different words.

We can not therefore escape the conclusion that the rule of reasoning by recurrence is irreducible to the principle of contradiction.

We can't avoid the conclusion that the rule of reasoning by recurrence can't be simplified to the principle of contradiction.

Neither can this rule come to us from experience; experience could teach us that the rule is true for the first ten or hundred numbers; for example, it can not attain to the indefinite series of numbers, but only to a portion of this series, more or less long but always limited.

Neither can this rule come to us from experience; experience could teach us that the rule is true for the first ten or hundred numbers; for example, it cannot apply to the infinite series of numbers, but only to a portion of this series, which may be longer or shorter but is always limited.

Now if it were only a question of that, the principle of contradiction would suffice; it would always allow of our developing as many syllogisms as we wished; it is only when it is a question of including an infinity of them in a single formula, it is only before the infinite that this principle fails, and there too, experience becomes powerless. This rule, inaccessible to analytic demonstration and to experience, is the veritable type of the synthetic a priori judgment. On the other hand, we can not think of seeing in it a convention, as in some of the postulates of geometry.

Now, if it were just a matter of that, the principle of contradiction would be enough; it would always let us create as many syllogisms as we wanted. It’s only when we try to fit an infinite number of them into a single formula that this principle falls short, and here, experience also fails to help. This rule, which cannot be accessed through analytical proof or experience, is the true example of the synthetic a priori judgment. On the other hand, we can't consider it a mere convention, like some of the postulates in geometry.

Why then does this judgment force itself upon us with an irresistible evidence? It is because it is only the affirmation of the power of the mind which knows itself capable of conceiving the indefinite repetition of the same act when once this act is possible. The mind has a direct intuition of this power, and experience can only give occasion for using it and thereby becoming conscious of it.

Why does this judgment impose itself on us with such undeniable clarity? It's because it simply confirms the mind's ability to understand that it can endlessly repeat the same action once that action is possible. The mind has an immediate awareness of this ability, and experience only creates an opportunity to use it and, in doing so, become aware of it.

But, one will say, if raw experience can not legitimatize reasoning by recurrence, is it so of experiment aided by[Pg 40] induction? We see successively that a theorem is true of the number 1, of the number 2, of the number 3 and so on; the law is evident, we say, and it has the same warranty as every physical law based on observations, whose number is very great but limited.

But, one might ask, if direct experience can't validate reasoning through recurrence, can it be the case with experiments supported by[Pg 40] induction? We observe that a theorem holds true for the number 1, for the number 2, for the number 3, and so on; the law is clear, we say, and it has the same reliability as any physical law based on observations, which are numerous but finite.

Here is, it must be admitted, a striking analogy with the usual procedures of induction. But there is an essential difference. Induction applied to the physical sciences is always uncertain, because it rests on the belief in a general order of the universe, an order outside of us. Mathematical induction, that is, demonstration by recurrence, on the contrary, imposes itself necessarily because it is only the affirmation of a property of the mind itself.

Here is, it must be admitted, a striking analogy with the usual procedures of induction. But there is an essential difference. Induction applied to the physical sciences is always uncertain because it depends on the belief in a general order of the universe, an order that exists outside of us. Mathematical induction, or proof by recurrence, on the other hand, is mandatory because it simply confirms a property of the mind itself.

VII

Mathematicians, as I have said before, always endeavor to generalize the propositions they have obtained, and, to seek no other example, we have just proved the equality:

Mathematicians, as I mentioned before, always try to generalize the propositions they've discovered, and, just as an example, we've just proven the equality:

a + 1 = 1 + a

a + 1 = 1 + a

and afterwards used it to establish the equality

and later used it to establish equality

a + b = b + a

a + b = b + a

which is manifestly more general.

which is clearly more general.

Mathematics can, therefore, like the other sciences, proceed from the particular to the general.

Mathematics can, like the other sciences, move from the specific to the general.

This is a fact which would have appeared incomprehensible to us at the outset of this study, but which is no longer mysterious to us, since we have ascertained the analogies between demonstration by recurrence and ordinary induction.

This is a fact that would have seemed confusing to us at the start of this study, but it's no longer mysterious since we've figured out the similarities between demonstration by recurrence and regular induction.

Without doubt recurrent reasoning in mathematics and inductive reasoning in physics rest on different foundations, but their march is parallel, they advance in the same sense, that is to say, from the particular to the general.

Without a doubt, repeated reasoning in mathematics and inductive reasoning in physics are based on different foundations, but they move in parallel; they progress in the same direction, meaning from the specific to the general.

Let us examine the case a little more closely.

Let’s take a closer look at the case.

To demonstrate the equality

To show the equality

a + 2 = 2 + a

a + 2 = 2 + a

it suffices to twice apply the rule

it’s enough to apply the rule twice

(1) a + 1 = 1 + a

a + 1 = 1 + a

and write

and type

(2) a + 2 = a + 1 + 1 = 1 + a + 1 = 1 + 1 + a = 2 + a.

(2) a + 2 = a + 1 + 1 = 1 + a + 1 = 1 + 1 + a = 2 + a.

The equality (2) thus deduced in purely analytic way from the equality (1) is, however, not simply a particular ease of it; it is something quite different.

The equality (2) that we’ve derived analytically from the equality (1) isn’t just a specific case of the first; it’s fundamentally different.

We can not therefore even say that in the really analytic and deductive part of mathematical reasoning we proceed from the general to the particular in the ordinary sense of the word.

We can't really say that in the truly analytical and deductive part of mathematical reasoning, we move from the general to the specific in the usual sense of the term.

The two members of the equality (2) are simply combinations more complicated than the two members of the equality (1), and analysis only serves to separate the elements which enter into these combinations and to study their relations.

The two parts of the equality (2) are just more complex combinations than the two parts of the equality (1), and analysis just helps to break down the elements involved in these combinations and examine their relationships.

Mathematicians proceed therefore 'by construction,' they 'construct' combinations more and more complicated. Coming back then by the analysis of these combinations, of these aggregates, so to speak, to their primitive elements, they perceive the relations of these elements and from them deduce the relations of the aggregates themselves.

Mathematicians move forward through 'construction,' creating increasingly complex combinations. By analyzing these combinations, or aggregates, they return to their basic elements, notice the relationships between these elements, and from these derive the relationships of the aggregates themselves.

This is a purely analytical proceeding, but it is not, however, a proceeding from the general to the particular, because evidently the aggregates can not be regarded as more particular than their elements.

This is a purely analytical process, but it is not a progression from the general to the specific, because clearly the aggregates cannot be seen as more specific than their individual elements.

Great importance, and justly, has been attached to this procedure of 'construction,' and some have tried to see in it the necessary and sufficient condition for the progress of the exact sciences.

Great importance, rightly so, has been given to this process of 'construction,' and some have attempted to see it as the essential and adequate foundation for the advancement of the exact sciences.

Necessary, without doubt; but sufficient, no.

Necessary, no doubt; but not enough.

For a construction to be useful and not a vain toil for the mind, that it may serve as stepping-stone to one wishing to mount, it must first of all possess a sort of unity enabling us to see in it something besides the juxtaposition of its elements.

For a construction to be useful and not just a pointless effort for the mind, so it can act as a stepping stone for someone trying to rise, it must first have a kind of unity that allows us to see more than just the arrangement of its parts.

Or, more exactly, there must be some advantage in considering the construction rather than its elements themselves.

Or, more specifically, there has to be some benefit in looking at the construction instead of just its individual elements.

What can this advantage be?

What could this advantage be?

Why reason on a polygon, for instance, which is always decomposable into triangles, and not on the elementary triangles?

Why reason about a polygon, for example, which can always be broken down into triangles, but not about the basic triangles?

It is because there are properties appertaining to polygons of any number of sides and that may be immediately applied to any particular polygon.

It’s because there are properties associated with polygons of any number of sides that can be directly applied to any specific polygon.

Usually, on the contrary, it is only at the cost of the most[Pg 42] prolonged exertions that they could be found by studying directly the relations of the elementary triangles. The knowledge of the general theorem spares us these efforts.

Usually, on the other hand, it is only through the most[Pg 42] extended efforts that they can be discovered by directly studying the relationships of the basic triangles. Understanding the general theorem saves us from these struggles.

A construction, therefore, becomes interesting only when it can be ranged beside other analogous constructions, forming species of the same genus.

A construction only becomes interesting when it can be compared to other similar constructions, creating types within the same category.

If the quadrilateral is something besides the juxtaposition of two triangles, this is because it belongs to the genus polygon.

If the quadrilateral is more than just two triangles placed next to each other, it's because it falls under the category of polygons.

Moreover, one must be able to demonstrate the properties of the genus without being forced to establish them successively for each of the species.

Moreover, one must be able to show the properties of the genus without having to establish them one by one for each species.

To attain that, we must necessarily mount from the particular to the general, ascending one or more steps.

To achieve that, we must climb from the specific to the general, making one or more steps up.

The analytic procedure 'by construction' does not oblige us to descend, but it leaves us at the same level.

The analytic procedure 'by construction' doesn’t require us to go down; it keeps us at the same level.

We can ascend only by mathematical induction, which alone can teach us something new. Without the aid of this induction, different in certain respects from physical induction, but quite as fertile, construction would be powerless to create science.

We can only rise through mathematical induction, which is the only method that can teach us something new. Without this induction, which differs in some ways from physical induction but is equally productive, the process of building knowledge would be unable to establish science.

Observe finally that this induction is possible only if the same operation can be repeated indefinitely. That is why the theory of chess can never become a science, for the different moves of the same game do not resemble one another.

Observe finally that this induction is possible only if the same operation can be repeated indefinitely. That is why the theory of chess can never become a science, because the different moves of the same game are not alike.


CHAPTER II

Math Impact and Experience

To learn what mathematicians understand by a continuum, one should not inquire of geometry. The geometer always seeks to represent to himself more or less the figures he studies, but his representations are for him only instruments; in making geometry he uses space just as he does chalk; so too much weight should not be attached to non-essentials, often of no more importance than the whiteness of the chalk.

To understand what mathematicians mean by a continuum, one shouldn't ask about geometry. Geometers always try to visualize the figures they're studying, but those visuals are just tools for them; they use space like they use chalk. So, we shouldn't place too much emphasis on trivial things that are often no more significant than the whiteness of the chalk.

The pure analyst has not this rock to fear. He has disengaged the science of mathematics from all foreign elements, and can answer our question: 'What exactly is this continuum about which mathematicians reason?' Many analysts who reflect on their art have answered already; Monsieur Tannery, for example, in his Introduction à la théorie des fonctions d'une variable.

The pure analyst doesn’t have this problem to worry about. He has separated the science of mathematics from any outside influences and can answer our question: 'What exactly is this continuum that mathematicians discuss?' Many analysts who think about their work have already provided answers; for instance, Monsieur Tannery in his Introduction à la théorie des fonctions d'une variable.

Let us start from the scale of whole numbers; between two consecutive steps, intercalate one or more intermediary steps, then between these new steps still others, and so on indefinitely. Thus we shall have an unlimited number of terms; these will be the numbers called fractional, rational or commensurable. But this is not yet enough; between these terms, which, however, are already infinite in number, it is still necessary to intercalate others called irrational or incommensurable. A remark before going further. The continuum so conceived is only a collection of individuals ranged in a certain order, infinite in number, it is true, but exterior to one another. This is not the ordinary conception, wherein is supposed between the elements of the continuum a sort of intimate bond which makes of them a whole, where the point does not exist before the line, but the line before the point. Of the celebrated formula, 'the continuum is unity in multiplicity,' only the multiplicity remains, the unity has disappeared. The analysts are none the less right in defining their continuum as they do, for they always reason on just this as soon as they pique themselves on their rigor. But this is[Pg 44] enough to apprise us that the veritable mathematical continuum is a very different thing from that of the physicists and that of the metaphysicians.

Let’s begin with whole numbers; between two consecutive steps, insert one or more middle steps, then between these new steps, add even more, and keep going indefinitely. This way, we'll have an endless number of terms, which are referred to as fractional, rational, or commensurable numbers. However, that's still not enough; between these terms—which are already infinite—we also need to insert others called irrational or incommensurable. One note before we proceed: the continuum, as envisioned here, is simply a collection of individuals arranged in a specific order. It's infinite in number, yes, but they are external to one another. This isn't the usual understanding, which assumes that a kind of close connection exists between the elements of the continuum, making them a whole, where the point does not exist before the line, but the line exists before the point. Of the famous saying, 'the continuum is unity in multiplicity,' only the multiplicity remains; the unity has vanished. Analysts are still correct in defining their continuum this way because they always reason from this perspective once they pride themselves on their rigor. But this is[Pg 44] enough to remind us that the true mathematical continuum is quite different from that of physicists and metaphysicians.

It may also be said perhaps that the mathematicians who are content with this definition are dupes of words, that it is necessary to say precisely what each of these intermediary steps is, to explain how they are to be intercalated and to demonstrate that it is possible to do it. But that would be wrong; the only property of these steps which is used in their reasonings[2] is that of being before or after such and such steps; therefore also this alone should occur in the definition.

It might be argued that the mathematicians who are satisfied with this definition are being misled by language. It's essential to clarify what each of these intermediate steps means, explain how they fit together, and show that it's feasible to do so. However, that viewpoint is mistaken; the only characteristic of these steps that is relevant to their reasoning[2] is whether they come before or after certain steps. So, this aspect should be the only one included in the definition.

So how the intermediary terms should be intercalated need not concern us; on the other hand, no one will doubt the possibility of this operation, unless from forgetting that possible, in the language of geometers, simply means free from contradiction.

So, we don't need to worry about how the intermediary terms should be inserted; however, no one can doubt that this operation is possible, unless they forget that, in the language of geometers, "possible" just means free from contradiction.

Our definition, however, is not yet complete, and I return to it after this over-long digression.

Our definition, however, isn't finished yet, and I’ll come back to it after this lengthy tangent.

Definition of Incommensurables.—The mathematicians of the Berlin school, Kronecker in particular, have devoted themselves to constructing this continuous scale of fractional and irrational numbers without using any material other than the whole number. The mathematical continuum would be, in this view, a pure creation of the mind, where experience would have no part.

Definition of Incommensurables.—The mathematicians from the Berlin school, especially Kronecker, focused on creating a complete range of fractional and irrational numbers using only whole numbers. In this perspective, the mathematical continuum is seen as a purely mental construct, with no involvement from experience.

The notion of the rational number seeming to them to present no difficulty, they have chiefly striven to define the incommensurable number. But before producing here their definition, I must make a remark to forestall the astonishment it is sure to arouse in readers unfamiliar with the customs of geometers.

The idea of rational numbers not appearing to be an issue for them, they have primarily focused on defining the irrational number. But before presenting their definition here, I need to point out something to prepare readers who might be surprised by the practices of geometers.

Mathematicians study not objects, but relations between objects; the replacement of these objects by others is therefore indifferent to them, provided the relations do not change. The matter is for them unimportant, the form alone interests them.

Mathematicians focus not on objects but on the relationships between objects; replacing these objects with others doesn't matter to them as long as the relationships remain the same. The substance is unimportant to them; only the structure captures their interest.

Without recalling this, it would scarcely be comprehensible that Dedekind should designate by the name incommensurable number a mere symbol, that is to say, something very different[Pg 45] from the ordinary idea of a quantity, which should be measurable and almost tangible.

Without recalling this, it would hardly make sense that Dedekind would use the term incommensurable number to refer to a mere symbol, meaning something very different[Pg 45] from the usual idea of a quantity, which is expected to be measurable and almost tangible.

Let us see now what Dedekind's definition is:

Let’s take a look at Dedekind’s definition:

The commensurable numbers can in an infinity of ways be partitioned into two classes, such that any number of the first class is greater than any number of the second class.

The comparable numbers can be divided in countless ways into two groups, where any number from the first group is greater than any number from the second group.

It may happen that among the numbers of the first class there is one smaller than all the others; if, for example, we range in the first class all numbers greater than 2, and 2 itself, and in the second class all numbers less than 2, it is clear that 2 will be the least of all numbers of the first class. The number 2 may be chosen as symbol of this partition.

It’s possible that within the first group of numbers, there’s one that’s smaller than all the others; for instance, if we include all numbers greater than 2, as well as 2 itself, in the first group, and in the second group all numbers less than 2, it’s clear that 2 will be the smallest number in the first group. The number 2 can be used as a symbol for this division.

It may happen, on the contrary, that among the numbers of the second class is one greater than all the others; this is the case, for example, if the first class comprehends all numbers greater than 2, and the second all numbers less than 2, and 2 itself. Here again the number 2 may be chosen as symbol of this partition.

It might happen, however, that among the numbers in the second class, there is one that is greater than all the others. This occurs, for instance, when the first class includes all numbers greater than 2, while the second class includes all numbers less than 2 and 2 itself. In this case, the number 2 can again be used as a symbol for this division.

But it may equally well happen that neither is there in the first class a number less than all the others, nor in the second class a number greater than all the others. Suppose, for example, we put in the first class all commensurable numbers whose squares are greater than 2 and in the second all whose squares are less than 2. There is none whose square is precisely 2. Evidently there is not in the first class a number less than all the others, for, however near the square of a number may be to 2, we can always find a commensurable number whose square is still closer to 2.

But it could also happen that in the first group, there isn’t a number that’s less than all the others, and in the second group, there isn’t a number that’s greater than all the others. For example, let’s say we put all the commensurable numbers whose squares are greater than 2 in the first group, and all the ones whose squares are less than 2 in the second group. There isn’t a number whose square is exactly 2. Clearly, there isn't a number in the first group that’s less than all the others, because no matter how close a number's square may be to 2, we can always find a commensurable number whose square is even closer to 2.

In Dedekind's view, the incommensurable number

In Dedekind's view, the incommensurable number

√2 or (2)½

√2 or (2)½

is nothing but the symbol of this particular mode of partition of commensurable numbers; and to each mode of partition corresponds thus a number, commensurable or not, which serves as its symbol.

is nothing but the symbol of this specific way of dividing commensurable numbers; and to each way of dividing corresponds a number, whether commensurable or not, that acts as its symbol.

But to be content with this would be to forget too far the origin of these symbols; it remains to explain how we have been led to attribute to them a sort of concrete existence, and, besides,[Pg 46] does not the difficulty begin even for the fractional numbers themselves? Should we have the notion of these numbers if we had not previously known a matter that we conceive as infinitely divisible, that is to say, a continuum?

But being satisfied with this would mean forgetting the true origin of these symbols. We still need to explain how we have come to give them a kind of concrete existence. Moreover, [Pg 46] doesn’t the real difficulty start even with the fractional numbers themselves? Would we even have the concept of these numbers if we hadn’t first understood something that we see as infinitely divisible, in other words, a continuum?

The Physical Continuum.—We ask ourselves then if the notion of the mathematical continuum is not simply drawn from experience. If it were, the raw data of experience, which are our sensations, would be susceptible of measurement. We might be tempted to believe they really are so, since in these latter days the attempt has been made to measure them and a law has even been formulated, known as Fechner's law, according to which sensation is proportional to the logarithm of the stimulus.

The Physical Continuum.—We wonder if the idea of the mathematical continuum is really just based on our experiences. If that were true, the raw data from our experiences, which are our sensations, would be measurable. We might be inclined to think they actually are, especially since recent efforts have been made to measure them, and a principle has even been established, called Fechner's law, which states that sensation is proportional to the logarithm of the stimulus.

But if we examine more closely the experiments by which it has been sought to establish this law, we shall be led to a diametrically opposite conclusion. It has been observed, for example, that a weight A of 10 grams and a weight B of 11 grams produce identical sensations, that the weight B is just as indistinguishable from a weight C of 12 grams, but that the weight A is easily distinguished from the weight C. Thus the raw results of experience may be expressed by the following relations:

But if we take a closer look at the experiments that aimed to prove this law, we’ll see a completely different conclusion. For instance, a weight A of 10 grams and a weight B of 11 grams create the same sensations, and the weight B is just as hard to tell apart from a weight C of 12 grams, but weight A is easily distinguished from weight C. Therefore, the basic results of the experiments can be summarized by the following relationships:

A = B,       B = C,       A < C,

A = B,       B = C,       A < C,

which may be regarded as the formula of the physical continuum.

which can be seen as the formula for the physical continuum.

But here is an intolerable discord with the principle of contradiction, and the need of stopping this has compelled us to invent the mathematical continuum.

But this creates an unacceptable conflict with the principle of contradiction, and the need to resolve this has forced us to create the mathematical continuum.

We are, therefore, forced to conclude that this notion has been created entirely by the mind, but that experience has given the occasion.

We are, therefore, compelled to conclude that this idea has been completely formed by the mind, but that experience has provided the opportunity.

We can not believe that two quantities equal to a third are not equal to one another, and so we are led to suppose that A is different from B and B from C, but that the imperfection of our senses has not permitted of our distinguishing them.

We can't believe that two things that are equal to a third aren't equal to each other. So we end up thinking that A is different from B, and B is different from C, but that our senses are just not good enough to help us tell them apart.

Creation of the Mathematical Continuum.First Stage. So far it would suffice, in accounting for the facts, to intercalate between A and B a few terms, which would remain discrete. What happens now if we have recourse to some instrument to[Pg 47] supplement the feebleness of our senses, if, for example, we make use of a microscope? Terms such as A and B, before indistinguishable, appear now distinct; but between A and B, now become distinct, will be intercalated a new term, D, that we can distinguish neither from A nor from B. Despite the employment of the most highly perfected methods, the raw results of our experience will always present the characteristics of the physical continuum with the contradiction which is inherent in it.

Creation of the Math Continuum.First Stage. Until now, it was enough to add a few discrete terms between A and B to explain the facts. What happens if we use some tool to[Pg 47] enhance our limited senses, like using a microscope? Terms like A and B, which we couldn’t distinguish before, now appear separate; however, a new term, D, will be placed between A and B, and we still won’t be able to tell it apart from either A or B. Even with the most advanced techniques, the raw results of our observations will always reflect the characteristics of the physical continuum along with its inherent contradictions.

We shall escape it only by incessantly intercalating new terms between the terms already distinguished, and this operation must be continued indefinitely. We might conceive the stopping of this operation if we could imagine some instrument sufficiently powerful to decompose the physical continuum into discrete elements, as the telescope resolves the milky way into stars. But this we can not imagine; in fact, it is with the eye we observe the image magnified by the microscope, and consequently this image must always retain the characteristics of visual sensation and consequently those of the physical continuum.

We will only be able to escape it by constantly inserting new terms between the already identified terms, and this process has to go on forever. We might think about stopping this process if we could picture some tool strong enough to break the physical continuum into separate elements, like how a telescope breaks down the Milky Way into stars. But we can't imagine that; in reality, we observe the image enlarged by the microscope with our eyes, so this image will always have the qualities of visual sensation and thus those of the physical continuum.

Nothing distinguishes a length observed directly from the half of this length doubled by the microscope. The whole is homogeneous with the part; this is a new contradiction, or rather it would be if the number of terms were supposed finite; in fact, it is clear that the part containing fewer terms than the whole could not be similar to the whole.

Nothing sets apart a length observed directly from the half of that length magnified by the microscope. The whole is the same as the part; this presents a new contradiction, or it would if we assumed the number of terms were finite; in reality, it's evident that a part with fewer terms than the whole couldn't possibly resemble the whole.

The contradiction ceases when the number of terms is regarded as infinite; nothing hinders, for example, considering the aggregate of whole numbers as similar to the aggregate of even numbers, which, however, is only a part of it; and, in fact, to each whole number corresponds an even number, its double.

The contradiction goes away when we see the number of terms as infinite; there’s nothing stopping us from viewing the set of whole numbers as being similar to the set of even numbers, which is just a subset of the whole numbers. In fact, for every whole number, there’s an even number that corresponds to it, which is simply double that whole number.

But it is not only to escape this contradiction contained in the empirical data that the mind is led to create the concept of a continuum, formed of an indefinite number of terms.

But it’s not just to escape this contradiction in the empirical data that the mind feels compelled to create the concept of a continuum, made up of an infinite number of terms.

All happens as in the sequence of whole numbers. We have the faculty of conceiving that a unit can be added to a collection of units; thanks to experience, we have occasion to exercise this faculty and we become conscious of it; but from this moment we feel that our power has no limit and that we can count indefinitely, though we have never had to count more than a finite number of objects.[Pg 48]

Everything happens in the order of whole numbers. We have the ability to understand that we can add one unit to a collection of units; thanks to our experiences, we have the chance to use this ability and become aware of it. From that point on, we realize that our potential is limitless and that we can count forever, even though we have never needed to count more than a finite number of items.[Pg 48]

Just so, as soon as we have been led to intercalate means between two consecutive terms of a series, we feel that this operation can be continued beyond all limit, and that there is, so to speak, no intrinsic reason for stopping.

Just like that, once we start inserting values between two adjacent terms in a series, it becomes clear that this process can go on indefinitely, and there really isn’t any inherent reason to stop.

As an abbreviation, let me call a mathematical continuum of the first order every aggregate of terms formed according to the same law as the scale of commensurable numbers. If we afterwards intercalate new steps according to the law of formation of incommensurable numbers, we shall obtain what we will call a continuum of the second order.

As a shorthand, let's refer to a mathematical continuum of the first order as any collection of terms created using the same rules as the set of commensurable numbers. If we later add new elements following the rules for forming incommensurable numbers, we will end up with what we’ll call a continuum of the second order.

Second Stage.—We have made hitherto only the first stride; we have explained the origin of continua of the first order; but it is necessary to see why even they are not sufficient and why the incommensurable numbers had to be invented.

Second Stage.—We have only taken the first step so far; we have explained the origins of first order continua; however, we need to understand why they are still not enough and why we had to create incommensurable numbers.

If we try to imagine a line, it must have the characteristics of the physical continuum, that is to say, we shall not be able to represent it except with a certain breadth. Two lines then will appear to us under the form of two narrow bands, and, if we are content with this rough image, it is evident that if the two lines cross they will have a common part.

If we try to picture a line, it needs to have the qualities of a physical continuum, meaning we can't represent it without a bit of width. So, two lines will look like two narrow bands, and if we accept this basic image, it's clear that if the two lines intersect, they'll share a common section.

But the pure geometer makes a further effort; without entirely renouncing the aid of the senses, he tries to reach the concept of the line without breadth, of the point without extension. This he can only attain to by regarding the line as the limit toward which tends an ever narrowing band, and the point as the limit toward which tends an ever lessening area. And then, our two bands, however narrow they may be, will always have a common area, the smaller as they are the narrower, and whose limit will be what the pure geometer calls a point.

But the pure geometer puts in more effort; while not completely giving up on using the senses, he tries to grasp the idea of a line that has no width and a point that has no size. He can only achieve this by seeing the line as the limit that an increasingly narrow band approaches and the point as the limit that an increasingly smaller area approaches. Then, our two bands, no matter how narrow they are, will always share a common area, which, the smaller and narrower they are, will ultimately lead to what the pure geometer refers to as a point.

This is why it is said two lines which cross have a point in common, and this truth seems intuitive.

This is why it’s said that two lines that cross share a common point, and this idea feels obvious.

But it would imply contradiction if lines were conceived as continua of the first order, that is to say, if on the lines traced by the geometer should be found only points having for coordinates rational numbers. The contradiction would be manifest as soon as one affirmed, for example, the existence of straights and circles.

But it would imply a contradiction if lines were thought of as first-order continuums, meaning that the lines drawn by the geometer should only consist of points with rational number coordinates. The contradiction would become obvious as soon as one claimed, for example, the existence of straight lines and circles.

It is clear, in fact, that if the points whose coordinates are[Pg 49] commensurable were alone regarded as real, the circle inscribed in a square and the diagonal of this square would not intersect, since the coordinates of the point of intersection are incommensurable.

It’s clear, in fact, that if only the points with coordinates that are[Pg 49] commensurable were considered real, the circle inside a square and the square’s diagonal would not cross each other, since the coordinates of the intersection point are incommensurable.

That would not yet be sufficient, because we should get in this way only certain incommensurable numbers and not all those numbers.

That still wouldn't be enough, because we would only obtain certain incommensurable numbers and not all of them.

But conceive of a straight line divided into two rays. Each of these rays will appear to our imagination as a band of a certain breadth; these bands moreover will encroach one on the other, since there must be no interval between them. The common part will appear to us as a point which will always remain when we try to imagine our bands narrower and narrower, so that we admit as an intuitive truth that if a straight is cut into two rays their common frontier is a point; we recognize here the conception of Dedekind, in which an incommensurable number was regarded as the common frontier of two classes of rational numbers.

But imagine a straight line divided into two rays. Each of these rays will seem to us like a band of a certain width; these bands will also overlap since there can be no gap between them. The shared part will appear to us as a point that remains as we try to visualize our bands becoming narrower and narrower. We thus accept as an obvious truth that when a straight line is split into two rays, their shared boundary is a point. Here, we recognize Dedekind's concept, in which an irrational number is seen as the common boundary of two sets of rational numbers.

Such is the origin of the continuum of the second order, which is the mathematical continuum properly so called.

Such is the origin of the second-order continuum, which is the mathematical continuum in the true sense.

Résumé.—In recapitulation, the mind has the faculty of creating symbols, and it is thus that it has constructed the mathematical continuum, which is only a particular system of symbols. Its power is limited only by the necessity of avoiding all contradiction; but the mind only makes use of this faculty if experience furnishes it a stimulus thereto.

Summary.—To sum up, the mind has the ability to create symbols, and it is through this ability that it has developed the mathematical continuum, which is just a specific system of symbols. Its capability is only restricted by the need to prevent any contradictions; however, the mind only utilizes this ability if experience provides a trigger for it.

In the case considered, this stimulus was the notion of the physical continuum, drawn from the rough data of the senses. But this notion leads to a series of contradictions from which it is necessary successively to free ourselves. So we are forced to imagine a more and more complicated system of symbols. That at which we stop is not only exempt from internal contradiction (it was so already at all the stages we have traversed), but neither is it in contradiction with various propositions called intuitive, which are derived from empirical notions more or less elaborated.

In this case, the driving idea was the concept of the physical continuum, based on the basic data from our senses. However, this concept brings about several contradictions that we need to gradually overcome. As a result, we end up having to create an increasingly complex system of symbols. What we ultimately settle on is not only free from internal contradictions (it has been at every stage we’ve gone through so far), but it also does not contradict various ideas that are considered intuitive, which come from somewhat developed empirical concepts.

Measurable Magnitude.—The magnitudes we have studied hitherto are not measurable; we can indeed say whether a given[Pg 50] one of these magnitudes is greater than another, but not whether it is twice or thrice as great.

Measurable Impact.—The magnitudes we've looked at so far are not measurable; we can definitely say if one of these magnitudes is greater than another, but we can't say if it is twice or three times as great.

So far, I have only considered the order in which our terms are ranged. But for most applications that does not suffice. We must learn to compare the interval which separates any two terms. Only on this condition does the continuum become a measurable magnitude and the operations of arithmetic applicable.

So far, I’ve only looked at the order in which our terms are organized. But for most purposes, that’s not enough. We need to learn how to compare the distance between any two terms. Only then does the continuum become a measurable quantity and the operations of arithmetic applicable.

This can only be done by the aid of a new and special convention. We will agree that in such and such a case the interval comprised between the terms A and B is equal to the interval which separates C and D. For example, at the beginning of our work we have set out from the scale of the whole numbers and we have supposed intercalated between two consecutive steps n intermediary steps; well, these new steps will be by convention regarded as equidistant.

This can only be achieved with a new and specific convention. We will agree that in certain cases, the interval between terms A and B is the same as the interval separating C and D. For instance, at the start of our work, we began with the scale of whole numbers and assumed there were n intermediary steps inserted between two consecutive steps; well, these new steps will be considered equidistant by convention.

This is a way of defining the addition of two magnitudes, because if the interval AB is by definition equal to the interval CD, the interval AD will be by definition the sum of the intervals AB and AC.

This is a way to define the addition of two quantities because if the interval AB is defined to be equal to the interval CD, the interval AD will, by definition, be the sum of the intervals AB and AC.

This definition is arbitrary in a very large measure. It is not completely so, however. It is subjected to certain conditions and, for example, to the rules of commutativity and associativity of addition. But provided the definition chosen satisfies these rules, the choice is indifferent, and it is useless to particularize it.

This definition is largely arbitrary. However, it's not entirely so. It is subject to specific conditions and, for instance, to the rules of commutativity and associativity in addition. But as long as the chosen definition meets these rules, the choice doesn't matter, and there’s no point in being specific about it.

Various Remarks.—We can now discuss several important questions:

Comments.—We can now talk about several important questions:

1º Is the creative power of the mind exhausted by the creation of the mathematical continuum?

1º Is the creative power of the mind worn out by the creation of the mathematical continuum?

No: the works of Du Bois-Reymond demonstrate it in a striking way.

No: the works of Du Bois-Reymond clearly show this in a striking way.

We know that mathematicians distinguish between infinitesimals of different orders and that those of the second order are infinitesimal, not only in an absolute way, but also in relation to those of the first order. It is not difficult to imagine infinitesimals of fractional or even of irrational order, and thus we find again that scale of the mathematical continuum which has been dealt with in the preceding pages.[Pg 51]

We know that mathematicians differentiate between infinitesimals of various orders, with second-order infinitesimals being tiny not just in an absolute sense but also in comparison to first-order infinitesimals. It's easy to picture infinitesimals that are fractional or even irrational, which brings us back to the idea of the mathematical continuum discussed in the earlier pages.[Pg 51]

Further, there are infinitesimals which are infinitely small in relation to those of the first order, and, on the contrary, infinitely great in relation to those of order 1 + ε, and that however small ε may be. Here, then, are new terms intercalated in our series, and if I may be permitted to revert to the phraseology lately employed which is very convenient though not consecrated by usage, I shall say that thus has been created a sort of continuum of the third order.

Furthermore, there are infinitesimals that are infinitely small compared to those of the first order, and on the other hand, infinitely large compared to those of order 1 + ε, no matter how small ε is. So, we have new terms added to our series, and if I may refer back to the terminology recently used which is very useful even if it's not traditional, I would say that this has led to the creation of a kind of continuum of the third order.

It would be easy to go further, but that would be idle; one would only be imagining symbols without possible application, and no one will think of doing that. The continuum of the third order, to which the consideration of the different orders of infinitesimals leads, is itself not useful enough to have won citizenship, and geometers regard it only as a mere curiosity. The mind uses its creative faculty only when experience requires it.

It would be simple to take this further, but that would be pointless; one would just be thinking about symbols with no real use, and no one would want to do that. The continuum of the third order, which comes from looking at the different orders of infinitesimals, is not practical enough to be accepted, and geometers see it merely as a curiosity. The mind uses its creativity only when needed by experience.

2º Once in possession of the concept of the mathematical continuum, is one safe from contradictions analogous to those which gave birth to it?

2º Once someone understands the concept of the mathematical continuum, are they free from contradictions similar to those that led to its creation?

No, and I will give an example.

No, and I’ll provide an example.

One must be very wise not to regard it as evident that every curve has a tangent; and in fact if we picture this curve and a straight as two narrow bands we can always so dispose them that they have a part in common without crossing. If we imagine then the breadth of these two bands to diminish indefinitely, this common part will always subsist and, at the limit, so to speak, the two lines will have a point in common without crossing, that is to say, they will be tangent.

One should be very careful not to assume it's obvious that every curve has a tangent. If we visualize this curve and a straight line as two narrow strips, we can arrange them in such a way that they share a section without intersecting. If we then imagine the width of these two strips getting smaller and smaller, this shared section will always exist, and in the limit, so to speak, the two lines will touch at a point without crossing each other, meaning they will be tangent.

The geometer who reasons in this way, consciously or not, is only doing what we have done above to prove two lines which cut have a point in common, and his intuition might seem just as legitimate.

The geometer who reasons this way, whether intentionally or not, is just doing what we did earlier to show that two intersecting lines have a point in common, and his intuition might seem just as valid.

It would deceive him however. We can demonstrate that there are curves which have no tangent, if such a curve is defined as an analytic continuum of the second order.

It would mislead him, though. We can show that there are curves that don’t have a tangent if we define such a curve as an analytic continuum of the second order.

Without doubt some artifice analogous to those we have discussed above would have sufficed to remove the contradiction; but, as this is met with only in very exceptional cases, it has received no further attention.[Pg 52]

Without a doubt, a trick similar to the ones we've talked about would have been enough to eliminate the contradiction; however, since this only happens in very rare cases, it hasn't been given much more consideration.[Pg 52]

Instead of seeking to reconcile intuition with analysis, we have been content to sacrifice one of the two, and as analysis must remain impeccable, we have decided against intuition.

Instead of trying to bring together intuition and analysis, we have been okay with giving up one of them, and since analysis needs to be flawless, we have chosen to abandon intuition.

The Physical Continuum of Several Dimensions.—We have discussed above the physical continuum as derived from the immediate data of our senses, or, if you wish, from the rough results of Fechner's experiments; I have shown that these results are summed up in the contradictory formulas

The Physical Continuum of Multiple Dimensions.—We have talked about the physical continuum based on the direct information from our senses, or, if you prefer, from the basic outcomes of Fechner's experiments; I have demonstrated that these results are represented in the conflicting formulas.

A = B,       B = C,       A < C.

A = B,       B = C,       A < C.

Let us now see how this notion has been generalized and how from it has come the concept of many-dimensional continua.

Let’s now explore how this idea has been expanded and how it has led to the concept of multi-dimensional continua.

Consider any two aggregates of sensations. Either we can discriminate them one from another, or we can not, just as in Fechner's experiments a weight of 10 grams can be distinguished from a weight of 12 grams, but not from a weight of 11 grams. This is all that is required to construct the continuum of several dimensions.

Consider any two collections of sensations. We can either tell them apart or we can't, just like in Fechner's experiments where a weight of 10 grams can be distinguished from 12 grams, but not from 11 grams. That’s all that's needed to create a continuum of multiple dimensions.

Let us call one of these aggregates of sensations an element. That will be something analogous to the point of the mathematicians; it will not be altogether the same thing however. We can not say our element is without extension, since we can not distinguish it from neighboring elements and it is thus surrounded by a sort of haze. If the astronomical comparison may be allowed, our 'elements' would be like nebulae, whereas the mathematical points would be like stars.

Let’s refer to one of these clusters of sensations as an element. This will be similar to a point in mathematics, but not exactly the same. We can't say our element doesn't have extension, since we can't tell it apart from nearby elements, and it’s surrounded by a kind of haze. If we can use an astronomical comparison, our 'elements' would resemble nebulae, while the mathematical points would be like stars.

That being granted, a system of elements will form a continuum if we can pass from any one of them to any other, by a series of consecutive elements such that each is indistinguishable from the preceding. This linear series is to the line of the mathematician what an isolated element was to the point.

That being said, a system of elements will create a continuum if we can transition from one element to another through a sequence of consecutive elements, where each one is indistinguishable from the one before it. This linear series is to the line of the mathematician what an isolated element was to the point.

Before going farther, I must explain what is meant by a cut. Consider a continuum C and remove from it certain of its elements which for an instant we shall regard as no longer belonging to this continuum. The aggregate of the elements so removed will be called a cut. It may happen that, thanks to this cut, C may be subdivided into several distinct continua, the aggregate of the remaining elements ceasing to form a unique continuum.[Pg 53]

Before going further, I need to explain what a cut means. Imagine a continuum C and remove certain elements from it, which for a moment we’ll consider as no longer part of this continuum. The collection of the elements that are removed will be called a cut. It’s possible that because of this cut, C may be subdivided into several distinct continua, and the remaining elements will no longer form a unique continuum.[Pg 53]

There will then be on C two elements, A and B, that must be regarded as belonging to two distinct continua, and this will be recognized because it will be impossible to find a linear series of consecutive elements of C, each of these elements indistinguishable from the preceding, the first being A and the last B, without one of the elements of this series being indistinguishable from one of the elements of the cut.

There will then be two elements, A and B, on C, which must be seen as belonging to two different continuums. This will be obvious because it will be impossible to find a straight series of consecutive elements of C, where each of these elements is indistinguishable from the one before it, with the first being A and the last B, without one of the elements in this series being indistinguishable from one of the elements of the cut.

On the contrary, it may happen that the cut made is insufficient to subdivide the continuum C. To classify the physical continua, we will examine precisely what are the cuts which must be made to subdivide them.

On the contrary, it might happen that the cut made is not enough to break up the continuum C. To classify the physical continua, we will look closely at what cuts need to be made to divide them.

If a physical continuum C can be subdivided by a cut reducing to a finite number of elements all distinguishable from one another (and consequently forming neither a continuum, nor several continua), we shall say C is a one-dimensional continuum.

If a physical continuum C can be divided by a cut to reduce it to a finite number of elements that are all distinct from each other (and therefore do not form either a continuum or several continua), we will say C is a one-dimensional continuum.

If, on the contrary, C can be subdivided only by cuts which are themselves continua, we shall say C has several dimensions. If cuts which are continua of one dimension suffice, we shall say C has two dimensions; if cuts of two dimensions suffice, we shall say C has three dimensions, and so on.

If, on the other hand, C can only be divided by cuts that are themselves continuous, we will say C has multiple dimensions. If cuts that are continuous in one dimension are enough, we will say C has two dimensions; if cuts in two dimensions are sufficient, we will say C has three dimensions, and so forth.

Thus is defined the notion of the physical continuum of several dimensions, thanks to this very simple fact that two aggregates of sensations are distinguishable or indistinguishable.

Thus is defined the concept of the physical continuum with multiple dimensions, thanks to the straightforward fact that two sets of sensations can be either distinguishable or indistinguishable.

The Mathematical Continuum of Several Dimensions.—Thence the notion of the mathematical continuum of n dimensions has sprung quite naturally by a process very like that we discussed at the beginning of this chapter. A point of such a continuum, you know, appears to us as defined by a system of n distinct magnitudes called its coordinates.

The Mathematical Continuum of Multiple Dimensions.—From here, the idea of the mathematical continuum of n dimensions has emerged quite naturally through a process similar to what we discussed at the start of this chapter. A point in this continuum, as you know, is defined by a system of n distinct values called its coordinates.

These magnitudes need not always be measurable; there is, for instance, a branch of geometry independent of the measurement of these magnitudes, in which it is only a question of knowing, for example, whether on a curve ABC, the point B is between the points A and C, and not of knowing whether the arc AB is equal to the arc BC or twice as great. This is what is called Analysis Situs.

These sizes don't always have to be measurable; there is, for example, a part of geometry that doesn't depend on measuring these sizes, where it’s only about determining whether, on a curve ABC, the point B is located between points A and C, without needing to know if the arc AB is equal to the arc BC or twice its length. This is known as Analysis Situs.

This is a whole body of doctrine which has attracted the[Pg 54] attention of the greatest geometers and where we see flow one from another a series of remarkable theorems. What distinguishes these theorems from those of ordinary geometry is that they are purely qualitative and that they would remain true if the figures were copied by a draughtsman so awkward as to grossly distort the proportions and replace straights by strokes more or less curved.

This is a complete set of principles that has caught the[Pg 54] attention of the top mathematicians, showcasing a series of impressive theorems that connect to one another. What sets these theorems apart from standard geometry is that they are entirely qualitative and would still hold true even if the shapes were drawn by someone who poorly distorted the proportions and replaced straight lines with lines that are somewhat curved.

Through the wish to introduce measure next into the continuum just defined this continuum becomes space, and geometry is born. But the discussion of this is reserved for Part Second.

Through the desire to add measurement next into the continuum already established, this continuum transforms into space, and geometry comes into existence. However, this topic will be addressed in Part Two.


PART II

SPACE


CHAPTER III

The Non-Euclidean Geometries

Every conclusion supposes premises; these premises themselves either are self-evident and need no demonstration, or can be established only by relying upon other propositions, and since we can not go back thus to infinity, every deductive science, and in particular geometry, must rest on a certain number of undemonstrable axioms. All treatises on geometry begin, therefore, by the enunciation of these axioms. But among these there is a distinction to be made: Some, for example, 'Things which are equal to the same thing are equal to one another,' are not propositions of geometry, but propositions of analysis. I regard them as analytic judgments a priori, and shall not concern myself with them.

Every conclusion is based on premises; these premises are either obvious and don’t need proof, or they can only be established by depending on other statements. Since we can't keep going back forever, every deductive science, especially geometry, must rely on a certain number of axioms that can't be proven. Therefore, all geometry texts start with the statement of these axioms. However, there is a distinction to be made: some, like 'Things that are equal to the same thing are equal to one another,' are not geometry propositions, but rather propositions of analysis. I consider them analytic judgments a priori, and I won’t focus on them.

But I must lay stress upon other axioms which are peculiar to geometry. Most treatises enunciate three of these explicitly:

But I need to emphasize other principles that are specific to geometry. Most texts state three of these clearly:

1º Through two points can pass only one straight;

1º Only one straight line can pass through two points;

2º The straight line is the shortest path from one point to another;

2º The straight line is the shortest distance between two points;

3º Through a given point there is not more than one parallel to a given straight.

3º Through a given point, there is only one line that is parallel to a given straight line.

Although generally a proof of the second of these axioms is omitted, it would be possible to deduce it from the other two and from those, much more numerous, which are implicitly admitted without enunciating them, as I shall explain further on.

Although a proof of the second of these axioms is usually left out, it could be derived from the other two and from the many more that are implicitly accepted without stating them, as I will explain later.

It was long sought in vain to demonstrate likewise the third axiom, known as Euclid's Postulate. What vast effort has been wasted in this chimeric hope is truly unimaginable. Finally, in[Pg 56] the first quarter of the nineteenth century, and almost at the same time, a Hungarian and a Russian, Bolyai and Lobachevski, established irrefutably that this demonstration is impossible; they have almost rid us of inventors of geometries 'sans postulatum'; since then the Académie des Sciences receives only about one or two new demonstrations a year.

It was long pursued in vain to prove the third axiom, known as Euclid's Postulate. The amount of effort wasted on this illusory hope is truly unimaginable. Finally, in [Pg 56] the early part of the nineteenth century, around the same time, a Hungarian and a Russian, Bolyai and Lobachevski, conclusively showed that this proof is impossible; they have nearly freed us from inventors of geometries without a postulate. Since then, the Académie des Sciences receives only about one or two new proofs a year.

The question was not exhausted; it soon made a great stride by the publication of Riemann's celebrated memoir entitled: Ueber die Hypothesen welche der Geometrie zu Grunde liegen. This paper has inspired most of the recent works of which I shall speak further on, and among which it is proper to cite those of Beltrami and of Helmholtz.

The question was far from settled; it quickly gained momentum with the release of Riemann's famous essay titled: Ueber die Hypothesen welche der Geometrie zu Grunde liegen. This paper has influenced most of the recent works I will discuss later, notably those by Beltrami and Helmholtz.

The Bolyai-Lobachevski Geometry.—If it were possible to deduce Euclid's postulate from the other axioms, it is evident that in denying the postulate and admitting the other axioms, we should be led to contradictory consequences; it would therefore be impossible to base on such premises a coherent geometry.

Bolyai-Lobachevskian Geometry.—If it were possible to prove Euclid's postulate using the other axioms, it’s clear that by rejecting the postulate while accepting the other axioms, we would end up with contradictory results; therefore, it would be impossible to build a consistent geometry on such foundations.

Now this is precisely what Lobachevski did.

Now this is exactly what Lobachevski did.

He assumes at the start that: Through a given point can be drawn two parallels to a given straight.

He starts by assuming that: Through a given point, two parallel lines can be drawn to a given line.

And he retains besides all Euclid's other axioms. From these hypotheses he deduces a series of theorems among which it is impossible to find any contradiction, and he constructs a geometry whose faultless logic is inferior in nothing to that of the Euclidean geometry.

And he also keeps all of Euclid's other axioms. From these assumptions, he derives a series of theorems where it's impossible to find any contradictions, and he builds a geometry whose flawless logic is just as strong as that of Euclidean geometry.

The theorems are, of course, very different from those to which we are accustomed, and they can not fail to be at first a little disconcerting.

The theorems are, of course, very different from what we're used to, and they can definitely be a bit unsettling at first.

Thus the sum of the angles of a triangle is always less than two right angles, and the difference between this sum and two right angles is proportional to the surface of the triangle.

Thus the total of the angles in a triangle is always less than two right angles, and the difference between this total and two right angles is proportional to the area of the triangle.

It is impossible to construct a figure similar to a given figure but of different dimensions.

It’s impossible to create a shape that looks like a given shape but has different dimensions.

If we divide a circumference into n equal parts, and draw tangents at the points of division, these n tangents will form a polygon if the radius of the circle is small enough; but if this radius is sufficiently great they will not meet.

If we split a circle into n equal sections and draw tangents at those points, these n tangents will create a polygon if the circle's radius is small enough; however, if the radius is large enough, they won't intersect.

It is useless to multiply these examples; Lobachevski's[Pg 57] propositions have no relation to those of Euclid, but they are not less logically bound one to another.

It’s pointless to keep listing these examples; Lobachevski's[Pg 57] propositions aren’t related to Euclid’s, but they are still logically connected to each other.

Riemann's Geometry.—Imagine a world uniquely peopled by beings of no thickness (height); and suppose these 'infinitely flat' animals are all in the same plane and can not get out. Admit besides that this world is sufficiently far from others to be free from their influence. While we are making hypotheses, it costs us no more to endow these beings with reason and believe them capable of creating a geometry. In that case, they will certainly attribute to space only two dimensions.

Riemannian Geometry.—Imagine a world filled with beings that have no thickness (height); and suppose these 'infinitely flat' creatures are all in the same plane and can't escape it. Also, let's assume that this world is far enough from others to be unaffected by them. While we’re making assumptions, it doesn’t cost us anything to give these beings reason and believe they could create a geometry. In that scenario, they would definitely assign only two dimensions to space.

But suppose now that these imaginary animals, while remaining without thickness, have the form of a spherical, and not of a plane, figure, and are all on the same sphere without power to get off. What geometry will they construct? First it is clear they will attribute to space only two dimensions; what will play for them the rôle of the straight line will be the shortest path from one point to another on the sphere, that is to say, an arc of a great circle; in a word, their geometry will be the spherical geometry.

But let's imagine now that these fictional animals, while still being two-dimensional, take on a spherical shape instead of a flat one, and they all exist on the same sphere without the ability to move off it. What kind of geometry will they create? First, it's obvious they will only recognize two dimensions in space; to them, what acts as the straight line will be the shortest route from one point to another on the sphere, meaning an arc of a great circle. In short, their geometry will be spherical geometry.

What they will call space will be this sphere on which they must stay, and on which happen all the phenomena they can know. Their space will therefore be unbounded since on a sphere one can always go forward without ever being stopped, and yet it will be finite; one can never find the end of it, but one can make a tour of it.

What they will refer to as space will be this sphere where they must remain, and where all the phenomena they can understand occur. Their space will therefore be unbounded since on a sphere you can always move forward without ever being halted, and yet it will be finite; you can never discover its end, but you can travel around it.

Well, Riemann's geometry is spherical geometry extended to three dimensions. To construct it, the German mathematician had to throw overboard, not only Euclid's postulate, but also the first axiom: Only one straight can pass through two points.

Well, Riemann's geometry is spherical geometry expanded into three dimensions. To create it, the German mathematician had to discard not just Euclid's postulate, but also the first axiom: Only one straight line can go through two points.

On a sphere, through two given points we can draw in general only one great circle (which, as we have just seen, would play the rôle of the straight for our imaginary beings); but there is an exception: if the two given points are diametrically opposite, an infinity of great circles can be drawn through them.

On a sphere, we can generally draw only one great circle through two given points (which, as we've just seen, would serve as the straight line for our imaginary beings). However, there's an exception: if the two given points are directly opposite each other, an infinite number of great circles can be drawn through them.

In the same way, in Riemann's geometry (at least in one of its forms), through two points will pass in general only a single straight; but there are exceptional cases where through two points an infinity of straights can pass.[Pg 58]

In Riemann's geometry (at least in one of its forms), usually only one straight line can connect any two points; however, there are special cases where an infinite number of lines can go through those two points.[Pg 58]

There is a sort of opposition between Riemann's geometry and that of Lobachevski.

There’s a conflict between Riemann's geometry and Lobachevski's.

Thus the sum of the angles of a triangle is:

Thus, the total of the angles in a triangle is:

Equal to two right angles in Euclid's geometry;

Equal to two right angles in Euclid's geometry;

Less than two right angles in that of Lobachevski;

Less than two right angles in Lobachevski's work;

Greater than two right angles in that of Riemann.

Greater than two right angles in the Riemann case.

The number of straights through a given point that can be drawn coplanar to a given straight, but nowhere meeting it, is equal:

The number of lines that can be drawn through a specific point, parallel to a given line but never intersecting it, is equal:

To one in Euclid's geometry;

To someone in Euclid's geometry;

To zero in that of Riemann;

To focus on that of Riemann;

To infinity in that of Lobachevski.

To infinity in the work of Lobachevski.

Add that Riemann's space is finite, although unbounded, in the sense given above to these two words.

Add that Riemann's space is finite, though unbounded, in the sense described above to these two words.

The Surfaces of Constant Curvature.—One objection still remained possible. The theorems of Lobachevski and of Riemann present no contradiction; but however numerous the consequences these two geometers have drawn from their hypotheses, they must have stopped before exhausting them, since their number would be infinite; who can say then that if they had pushed their deductions farther they would not have eventually reached some contradiction?

Constant Curvature Surfaces.—One objection could still be raised. The theorems of Lobachevski and Riemann don't contradict each other; however, no matter how many conclusions those two geometers have drawn from their ideas, they must have stopped before exploring all possibilities, since the number of outcomes would be infinite. Who can say that if they had pushed their reasoning further, they wouldn't have eventually encountered some contradiction?

This difficulty does not exist for Riemann's geometry, provided it is limited to two dimensions; in fact, as we have seen, two-dimensional Riemannian geometry does not differ from spherical geometry, which is only a branch of ordinary geometry, and consequently is beyond all discussion.

This issue doesn’t come up with Riemann's geometry, as long as it's limited to two dimensions; in fact, as we've seen, two-dimensional Riemannian geometry is the same as spherical geometry, which is just a part of regular geometry, and therefore it's settled and beyond debate.

Beltrami, in correlating likewise Lobachevski's two-dimensional geometry with a branch of ordinary geometry, has equally refuted the objection so far as it is concerned.

Beltrami, by linking Lobachevski's two-dimensional geometry with a part of regular geometry, has also addressed the objection related to it.

Here is how he accomplished it. Consider any figure on a surface. Imagine this figure traced on a flexible and inextensible canvas applied over this surface in such a way that when the canvas is displaced and deformed, the various lines of this figure can change their form without changing their length. In general, this flexible and inextensible figure can not be displaced without leaving the surface; but there are certain particular surfaces[Pg 59] for which such a movement would be possible; these are the surfaces of constant curvature.

Here’s how he did it. Think of any shape on a surface. Picture this shape outlined on a flexible and non-stretchable canvas placed over this surface so that when the canvas is moved and bent, the different lines of this shape can change their form without changing their length. Generally, this flexible and non-stretchable shape cannot be moved without detaching from the surface; however, there are specific surfaces[Pg 59] where such movement is possible; these are the surfaces with constant curvature.

If we resume the comparison made above and imagine beings without thickness living on one of these surfaces, they will regard as possible the motion of a figure all of whose lines remain constant in length. On the contrary, such a movement would appear absurd to animals without thickness living on a surface of variable curvature.

If we go back to the comparison made earlier and picture beings with no thickness living on one of these surfaces, they would see the motion of a shape where all the lines stay the same length as possible. However, for thin creatures living on a surface with changing curves, such movement would seem ridiculous.

These surfaces of constant curvature are of two sorts: Some are of positive curvature, and can be deformed so as to be applied over a sphere. The geometry of these surfaces reduces itself therefore to the spherical geometry, which is that of Riemann.

These constant curvature surfaces come in two types: Some have positive curvature and can be shaped to fit over a sphere. The geometry of these surfaces thus aligns with spherical geometry, which is the geometry described by Riemann.

The others are of negative curvature. Beltrami has shown that the geometry of these surfaces is none other than that of Lobachevski. The two-dimensional geometries of Riemann and Lobachevski are thus correlated to the Euclidean geometry.

The others have negative curvature. Beltrami demonstrated that the geometry of these surfaces is exactly that of Lobachevski. The two-dimensional geometries of Riemann and Lobachevski are therefore linked to Euclidean geometry.

Interpretation of Non-Euclidean Geometries.—So vanishes the objection so far as two-dimensional geometries are concerned.

Understanding Non-Euclidean Geometries.—Thus, the objection disappears when it comes to two-dimensional geometries.

It would be easy to extend Beltrami's reasoning to three-dimensional geometries. The minds that space of four dimensions does not repel will see no difficulty in it, but they are few. I prefer therefore to proceed otherwise.

It would be simple to expand Beltrami's reasoning to three-dimensional geometries. Those who aren't put off by the idea of four-dimensional space won't find it challenging, but they are rare. So, I’d rather take a different approach.

Consider a certain plane, which I shall call the fundamental plane, and construct a sort of dictionary, by making correspond each to each a double series of terms written in two columns, just as correspond in the ordinary dictionaries the words of two languages whose significance is the same:

Consider a certain plane, which I’ll refer to as the fundamental plane, and create a kind of dictionary by matching each term in a double series, organized in two columns, just like how regular dictionaries align the words of two languages that have the same meaning:

Space: Portion of space situated above the fundamental plane.

Space: Area of space located above the base level.

Plane: Sphere cutting the fundamental plane orthogonally.

Plane: Sphere intersecting the fundamental plane at a right angle.

Straight: Circle cutting the fundamental plane orthogonally.

Straight: Cutting the circle on the main plane at a right angle.

Sphere: Sphere.

Sphere.

Circle: Circle.

Circle: Circle.

Angle: Angle.

Angle.

Distance between two points: Logarithm of the cross ratio of these two points and the intersections of the fundamental plane with a circle passing through these two points and cutting it orthogonally. Etc., Etc.[Pg 60]

Distance between two points: The logarithm of the cross ratio of these two points and the intersections of the fundamental plane with a circle that goes through these two points and intersects it at a right angle. Etc., Etc.[Pg 60]

Now take Lobachevski's theorems and translate them with the aid of this dictionary as we translate a German text with the aid of a German-English dictionary. We shall thus obtain theorems of the ordinary geometry. For example, that theorem of Lobachevski: 'the sum of the angles of a triangle is less than two right angles' is translated thus: "If a curvilinear triangle has for sides circle-arcs which prolonged would cut orthogonally the fundamental plane, the sum of the angles of this curvilinear triangle will be less than two right angles." Thus, however far the consequences of Lobachevski's hypotheses are pushed, they will never lead to a contradiction. In fact, if two of Lobachevski's theorems were contradictory, it would be the same with the translations of these two theorems, made by the aid of our dictionary, but these translations are theorems of ordinary geometry and no one doubts that the ordinary geometry is free from contradiction. Whence comes this certainty and is it justified? That is a question I can not treat here because it would require to be enlarged upon, but which is very interesting and I think not insoluble.

Now take Lobachevski's theorems and translate them using this dictionary, just like we translate a German text with a German-English dictionary. We will thus obtain theorems of ordinary geometry. For example, Lobachevski's theorem states that 'the sum of the angles of a triangle is less than two right angles,' which translates to: "If a curvilinear triangle has sides that are arcs of a circle which, if extended, would intersect the fundamental plane at right angles, the sum of the angles of this curvilinear triangle will be less than two right angles." Therefore, no matter how far we push the implications of Lobachevski's hypotheses, they will never lead to a contradiction. In fact, if two of Lobachevski's theorems were contradictory, the translations of these two theorems using our dictionary would also be contradictory, but these translations are theorems of ordinary geometry and no one doubts that ordinary geometry is free from contradiction. Where does this certainty come from, and is it justified? That’s a question I can't address here because it would require more elaboration, but it's very interesting and I believe it’s not unsolvable.

Nothing remains then of the objection above formulated. This is not all. Lobachevski's geometry, susceptible of a concrete interpretation, ceases to be a vain logical exercise and is capable of applications; I have not the time to speak here of these applications, nor of the aid that Klein and I have gotten from them for the integration of linear differential equations.

Nothing remains of the previous objection. But that's not all. Lobachevski's geometry, which can be concretely interpreted, stops being just a pointless logical exercise and becomes applicable; I don't have time to discuss these applications here, nor the help that Klein and I have received from them for integrating linear differential equations.

This interpretation moreover is not unique, and several dictionaries analogous to the preceding could be constructed, which would enable us by a simple 'translation' to transform Lobachevski's theorems into theorems of ordinary geometry.

This interpretation isn't unique, and several similar dictionaries could be created that would allow us to easily 'translate' Lobachevski's theorems into regular geometry theorems.

The Implicit Axioms.—Are the axioms explicitly enunciated in our treatises the sole foundations of geometry? We may be assured of the contrary by noticing that after they are successively abandoned there are still left over some propositions common to the theories of Euclid, Lobachevski and Riemann. These propositions must rest on premises the geometers admit without enunciation. It is interesting to try to disentangle them from the classic demonstrations.

The Hidden Assumptions.—Are the axioms stated in our writings the only foundations of geometry? We can be confident they aren’t, as we see that even when they are gradually set aside, some principles remain that are shared among the theories of Euclid, Lobachevski, and Riemann. These principles must be based on assumptions that geometers accept without stating them. It’s intriguing to try to separate them from the classic proofs.

Stuart Mill has claimed that every definition contains an[Pg 61] axiom, because in defining one affirms implicitly the existence of the object defined. This is going much too far; it is rare that in mathematics a definition is given without its being followed by the demonstration of the existence of the object defined, and when this is dispensed with it is generally because the reader can easily supply it. It must not be forgotten that the word existence has not the same sense when it refers to a mathematical entity and when it is a question of a material object. A mathematical entity exists, provided its definition implies no contradiction, either in itself, or with the propositions already admitted.

Stuart Mill argued that every definition includes an[Pg 61] axiom because defining something inherently confirms the existence of that object. This perspective is overreaching; in mathematics, it's uncommon for a definition to be presented without also providing a proof of the existence of the defined object, and when that's not included, it's usually because the reader can easily infer it. It's important to remember that the term existence has different meanings when applied to a mathematical entity versus a physical object. A mathematical entity exists as long as its definition doesn't contradict itself or any propositions that have already been accepted.

But if Stuart Mill's observation can not be applied to all definitions, it is none the less just for some of them. The plane is sometimes defined as follows:

But even if Stuart Mill's observation doesn't apply to all definitions, it definitely holds true for some of them. The plane is sometimes defined like this:

The plane is a surface such that the straight which joins any two of its points is wholly on this surface.

The plane is a flat surface where a straight line connecting any two points on it lies entirely on that surface.

This definition manifestly hides a new axiom; it is true we might change it, and that would be preferable, but then we should have to enunciate the axiom explicitly.

This definition clearly conceals a new principle; it's true we could change it, and that would be better, but then we would need to state the principle explicitly.

Other definitions would suggest reflections not less important.

Other definitions would suggest equally important reflections.

Such, for example, is that of the equality of two figures; two figures are equal when they can be superposed; to superpose them one must be displaced until it coincides with the other; but how shall it be displaced? If we should ask this, no doubt we should be told that it must be done without altering the shape and as a rigid solid. The vicious circle would then be evident.

Such, for instance, is the concept of the equality of two shapes; two shapes are equal when they can be placed on top of each other; to place them on top of one another, you have to move one until it matches the other; but how should it be moved? If we were to ask this, we would likely be told that it must be done without changing the shape and as a solid object. The circular reasoning would then be clear.

In fact this definition defines nothing; it would have no meaning for a being living in a world where there were only fluids. If it seems clear to us, that is because we are used to the properties of natural solids which do not differ much from those of the ideal solids, all of whose dimensions are invariable.

In fact, this definition defines nothing; it wouldn’t mean anything to a being living in a world made up entirely of fluids. If it seems clear to us, it’s because we’re familiar with the properties of natural solids, which aren’t much different from those of ideal solids, all of whose dimensions are unchanging.

Yet, imperfect as it may be, this definition implies an axiom.

Yet, as imperfect as it might be, this definition suggests a fundamental truth.

The possibility of the motion of a rigid figure is not a self-evident truth, or at least it is so only in the fashion of Euclid's postulate and not as an analytic judgment a priori would be.

The possibility of a rigid figure moving isn't an obvious truth, or at least it is only obvious in the way Euclid's postulate is, and not like an analytic judgment a priori would be.

Moreover, in studying the definitions and the demonstrations of geometry, we see that one is obliged to admit without proof not only the possibility of this motion, but some of its properties besides.[Pg 62]

Moreover, when we study the definitions and demonstrations of geometry, we have to accept without proof not just the possibility of this motion, but also some of its properties.[Pg 62]

This is at once seen from the definition of the straight line. Many defective definitions have been given, but the true one is that which is implied in all the demonstrations where the straight line enters:

This is immediately clear from the definition of a straight line. Many flawed definitions have been proposed, but the correct one is the one implied in all the proofs that involve the straight line:

"It may happen that the motion of a rigid figure is such that all the points of a line belonging to this figure remain motionless while all the points situated outside of this line move. Such a line will be called a straight line." We have designedly, in this enunciation, separated the definition from the axiom it implies.

"It might occur that the movement of a solid shape is such that all the points on a line belonging to this shape stay still while all the points outside this line move. This line will be referred to as a straight line." We have intentionally, in this statement, separated the definition from the axiom it suggests.

Many demonstrations, such as those of the cases of the equality of triangles, of the possibility of dropping a perpendicular from a point to a straight, presume propositions which are not enunciated, for they require the admission that it is possible to transport a figure in a certain way in space.

Many demonstrations, like those in the cases of triangle equality and the ability to drop a perpendicular from a point to a line, assume propositions that aren't explicitly stated, as they depend on the acceptance that it's possible to move a shape in a specific way in space.

The Fourth Geometry.—Among these implicit axioms, there is one which seems to me to merit some attention, because when it is abandoned a fourth geometry can be constructed as coherent as those of Euclid, Lobachevski and Riemann.

The Fourth Dimension.—Among these unspoken principles, there’s one that I think deserves some focus, because if it's disregarded, a fourth geometry can be developed that is just as consistent as those of Euclid, Lobachevski, and Riemann.

To prove that a perpendicular may always be erected at a point A to a straight AB, we consider a straight AC movable around the point A and initially coincident with the fixed straight AB; and we make it turn about the point A until it comes into the prolongation of AB.

To show that you can always draw a perpendicular line at a point A to a straight line AB, we take a straight line AC that can move around point A and is initially lined up with the fixed line AB; then we rotate it around point A until it extends in the same direction as AB.

Thus two propositions are presupposed: First, that such a rotation is possible, and next that it may be continued until the two straights come into the prolongation one of the other.

Thus two propositions are assumed: First, that such a rotation is possible, and next that it can continue until the two lines extend into one another.

If the first point is admitted and the second rejected, we are led to a series of theorems even stranger than those of Lobachevski and Riemann, but equally exempt from contradiction.

If we accept the first point and reject the second, we arrive at a set of theorems even more unusual than those of Lobachevski and Riemann, but equally free from contradiction.

I shall cite only one of these theorems and that not the most singular: A real straight may be perpendicular to itself.

I will mention just one of these theorems, and it's not even the most noteworthy: A real straight line may be perpendicular to itself.

Lie's Theorem.—The number of axioms implicitly introduced in the classic demonstrations is greater than necessary, and it would be interesting to reduce it to a minimum. It may first be asked whether this reduction is possible, whether the number of necessary axioms and that of imaginable geometries are not infinite.[Pg 63]

Lie's Theorem.—The number of axioms that are implicitly introduced in the classic proofs is more than what’s needed, and it would be interesting to cut it down to the bare minimum. One might first wonder if this reduction is even possible, and whether the necessary axioms and the possible geometries are infinite.[Pg 63]

A theorem of Sophus Lie dominates this whole discussion. It may be thus enunciated:

A theorem by Sophus Lie is the main focus of this entire discussion. It can be stated as follows:

Suppose the following premises are admitted:

Suppose we accept the following points:

1º Space has n dimensions;

1st Space has n dimensions;

2º The motion of a rigid figure is possible;

2º The movement of a rigid object is possible;

3º It requires p conditions to determine the position of this figure in space.

3º It requires p conditions to find out where this figure is located in space.

The number of geometries compatible with these premises will be limited.

The number of geometries that fit these premises will be limited.

I may even add that if n is given, a superior limit can be assigned to p.

I might also point out that if n is provided, a maximum value can be set for p.

If therefore the possibility of motion is admitted, there can be invented only a finite (and even a rather small) number of three-dimensional geometries.

If we accept that motion is possible, only a limited (and even quite small) number of three-dimensional geometries can be created.

Riemann's Geometries.—Yet this result seems contradicted by Riemann, for this savant constructs an infinity of different geometries, and that to which his name is ordinarily given is only a particular case.

Riemann's geometries.—However, this finding appears to be contradicted by Riemann, as this scholar creates countless different geometries, and the one commonly associated with his name is just a specific instance.

All depends, he says, on how the length of a curve is defined. Now, there is an infinity of ways of defining this length, and each of them may be the starting point of a new geometry.

All depends, he says, on how the length of a curve is defined. Now, there are countless ways to define this length, and each one can serve as the foundation for a new geometry.

That is perfectly true, but most of these definitions are incompatible with the motion of a rigid figure, which in the theorem of Lie is supposed possible. These geometries of Riemann, in many ways so interesting, could never therefore be other than purely analytic and would not lend themselves to demonstrations analogous to those of Euclid.

That’s absolutely true, but most of these definitions don’t fit with the movement of a rigid shape, which the theorem of Lie assumes is possible. These geometries of Riemann, while fascinating in many ways, could never be anything but purely analytical and wouldn’t allow for proofs similar to those of Euclid.

On the Nature of Axioms.—Most mathematicians regard Lobachevski's geometry only as a mere logical curiosity; some of them, however, have gone farther. Since several geometries are possible, is it certain ours is the true one? Experience no doubt teaches us that the sum of the angles of a triangle is equal to two right angles; but this is because the triangles we deal with are too little; the difference, according to Lobachevski, is proportional to the surface of the triangle; will it not perhaps become sensible when we shall operate on larger triangles or when our measurements shall become more precise? The Euclidean geometry would thus be only a provisional geometry.[Pg 64]

On Axioms.—Most mathematicians see Lobachevski's geometry as just an interesting logical oddity; however, some have delved deeper. Since multiple geometries are possible, can we be sure ours is the only true one? Experience certainly suggests that the sum of the angles in a triangle equals two right angles; but this is simply because the triangles we work with are too small. According to Lobachevski, the difference is proportional to the area of the triangle; might it become noticeable when we work with larger triangles or when our measurements become more accurate? In that case, Euclidean geometry would only be a temporary geometry.[Pg 64]

To discuss this opinion, we should first ask ourselves what is the nature of the geometric axioms.

To talk about this opinion, we should first ask ourselves what the nature of geometric axioms is.

Are they synthetic a priori judgments, as Kant said?

Are they synthetic a priori judgments, like Kant said?

They would then impose themselves upon us with such force that we could not conceive the contrary proposition, nor build upon it a theoretic edifice. There would be no non-Euclidean geometry.

They would then assert themselves so strongly that we couldn't even imagine an alternative idea or construct a theoretical framework around it. Non-Euclidean geometry wouldn't exist.

To be convinced of it take a veritable synthetic a priori judgment, the following, for instance, of which we have seen the preponderant rôle in the first chapter:

To be convinced of it, consider a true synthetic a priori judgment, like the one we've seen playing a significant role in the first chapter:

If a theorem is true for the number 1, and if it has been proved that it is true of n + 1 provided it is true of n, it will be true of all the positive whole numbers.

If a theorem is true for the number 1, and it has been shown that it's true for n + 1 if it's true for n, then it will be true for all positive whole numbers.

Then try to escape from that and, denying this proposition, try to found a false arithmetic analogous to non-Euclidean geometry—it can not be done; one would even be tempted at first blush to regard these judgments as analytic.

Then attempt to break free from that and, by rejecting this idea, try to establish a false arithmetic similar to non-Euclidean geometry—it can't be done; one might even initially be inclined to see these judgments as analytical.

Moreover, resuming our fiction of animals without thickness, we can hardly admit that these beings, if their minds are like ours, would adopt the Euclidean geometry which would be contradicted by all their experience.

Moreover, picking up our idea of animals without depth, we can hardly believe that these beings, if their minds are like ours, would accept Euclidean geometry, which would contradict all their experiences.

Should we therefore conclude that the axioms of geometry are experimental verities? But we do not experiment on ideal straights or circles; it can only be done on material objects. On what then could be based experiments which should serve as foundation for geometry? The answer is easy.

Should we therefore conclude that the axioms of geometry are proven truths? But we don't experiment on ideal lines or circles; we can only do that with physical objects. So, what can experiments be based on to form the foundation of geometry? The answer is simple.

We have seen above that we constantly reason as if the geometric figures behaved like solids. What geometry would borrow from experience would therefore be the properties of these bodies. The properties of light and its rectilinear propagation have also given rise to some of the propositions of geometry, and in particular those of projective geometry, so that from this point of view one would be tempted to say that metric geometry is the study of solids, and projective, that of light.

We have observed that we always think as if geometric shapes acted like three-dimensional objects. Thus, geometry would take from experience the characteristics of these bodies. The properties of light and its straight-line movement have also contributed to some of the principles of geometry, especially in projective geometry. From this perspective, one might be inclined to say that metric geometry focuses on solids, while projective geometry focuses on light.

But a difficulty remains, and it is insurmountable. If geometry were an experimental science, it would not be an exact science, it would be subject to a continual revision. Nay, it would from this very day be convicted of error, since we know that there is no rigorously rigid solid.[Pg 65]

But there's a problem that can't be solved. If geometry were an experimental science, it wouldn't be an exact science; it would constantly need revision. In fact, it would be proven wrong right now because we know that there are no perfectly rigid solids.[Pg 65]

The axioms of geometry therefore are neither synthetic a priori judgments nor experimental facts.

The axioms of geometry are neither synthetic a priori judgments nor experimental facts.

They are conventions; our choice among all possible conventions is guided by experimental facts; but it remains free and is limited only by the necessity of avoiding all contradiction. Thus it is that the postulates can remain rigorously true even though the experimental laws which have determined their adoption are only approximative.

They are conventions; our selection among all available conventions is guided by experimental facts; however, it remains free and is only restricted by the need to avoid any contradictions. This way, the postulates can stay rigorously true even though the experimental laws that led to their selection are only approximate.

In other words, the axioms of geometry (I do not speak of those of arithmetic) are merely disguised definitions.

In other words, the principles of geometry (I'm not talking about those of arithmetic) are just rephrased definitions.

Then what are we to think of that question: Is the Euclidean geometry true?

Then what should we think about that question: Is Euclidean geometry true?

It has no meaning.

It's meaningless.

As well ask whether the metric system is true and the old measures false; whether Cartesian coordinates are true and polar coordinates false. One geometry can not be more true than another; it can only be more convenient.

As well ask whether the metric system is correct and the old measurements incorrect; whether Cartesian coordinates are correct and polar coordinates incorrect. One geometry can't be more accurate than another; it can only be more convenient.

Now, Euclidean geometry is, and will remain, the most convenient:

Now, Euclidean geometry is, and will continue to be, the most convenient:

1º Because it is the simplest; and it is so not only in consequence of our mental habits, or of I know not what direct intuition that we may have of Euclidean space; it is the simplest in itself, just as a polynomial of the first degree is simpler than one of the second; the formulas of spherical trigonometry are more complicated than those of plane trigonometry, and they would still appear so to an analyst ignorant of their geometric signification.

1º Because it’s the simplest; and it’s not just because of our thought patterns or some vague intuition we might have about Euclidean space; it is inherently the simplest, just like a first-degree polynomial is simpler than a second-degree one; the formulas of spherical trigonometry are more complex than those of plane trigonometry, and they would still seem complicated to an analyst who doesn’t understand their geometric meaning.

2º Because it accords sufficiently well with the properties of natural solids, those bodies which our hands and our eyes compare and with which we make our instruments of measure.

2º Because it aligns quite well with the characteristics of natural solids, those objects that our hands and eyes examine and with which we create our measuring tools.


CHAPTER IV

Space and Geometry

Let us begin by a little paradox.

Let’s start with a little paradox.

Beings with minds like ours, and having the same senses as we, but without previous education, would receive from a suitably chosen external world impressions such that they would be led to construct a geometry other than that of Euclid and to localize the phenomena of that external world in a non-Euclidean space, or even in a space of four dimensions.

Beings with minds like ours and the same senses as we have, but without any prior education, would take in impressions from a well-chosen external world in such a way that they would end up creating a geometry different from Euclid's and would place the phenomena of that external world in a non-Euclidean space, or even in a four-dimensional space.

As for us, whose education has been accomplished by our actual world, if we were suddenly transported into this new world, we should have no difficulty in referring its phenomena to our Euclidean space. Conversely, if these beings were transported into our environment, they would be led to relate our phenomena to non-Euclidean space.

As for us, whose education has come from our real world, if we were suddenly taken to this new world, we wouldn’t have any trouble connecting its phenomena to our Euclidean space. On the other hand, if these beings were brought into our environment, they would likely relate our phenomena to non-Euclidean space.

Nay more; with a little effort we likewise could do it. A person who should devote his existence to it might perhaps attain to a realization of the fourth dimension.

No more; with a little effort, we could also do it. A person who dedicated their life to it might perhaps achieve an understanding of the fourth dimension.

Geometric Space and Perceptual Space.—It is often said the images of external objects are localized in space, even that they can not be formed except on this condition. It is also said that this space, which serves thus as a ready prepared frame for our sensations and our representations, is identical with that of the geometers, of which it possesses all the properties.

Geometric Space and Perceptual Space.—People often say that images of external objects are placed in space, and that they can only be formed under this condition. It's also said that this space, which acts as a convenient frame for our sensations and perceptions, is the same as that described by geometers, sharing all its properties.

To all the good minds who think thus, the preceding statement must have appeared quite extraordinary. But let us see whether they are not subject to an illusion that a more profound analysis would dissipate.

To everyone with good judgment who thinks this way, the previous statement must have seemed quite unusual. But let’s examine whether they’re falling for an illusion that a deeper analysis would clarify.

What, first of all, are the properties of space, properly so called? I mean of that space which is the object of geometry and which I shall call geometric space.

What, first of all, are the properties of space, properly speaking? I mean that space which is the subject of geometry and which I will refer to as geometric space.

The following are some of the most essential:

The following are some of the most important:

1º It is continuous;[Pg 67]

It is continuous;

2º It is infinite;

It's infinite;

3º It has three dimensions;

It has three dimensions.

4º It is homogeneous, that is to say, all its points are identical one with another;

4º It is uniform, meaning that all its points are the same as each other;

5º It is isotropic, that is to say, all the straights which pass through the same point are identical one with another.

5º It is isotropic, meaning that all the lines passing through the same point are identical to each other.

Compare it now to the frame of our representations and our sensations, which I may call perceptual space.

Compare it now to the framework of our representations and our sensations, which I’ll refer to as perceptual space.

Visual Space.—Consider first a purely visual impression, due to an image formed on the bottom of the retina.

Visual Space.—First, think about a purely visual impression created by an image formed on the back of the retina.

A cursory analysis shows us this image as continuous, but as possessing only two dimensions; this already distinguishes from geometric space what we may call pure visual space.

A quick look shows us this image as continuous, but only having two dimensions; this already sets it apart from geometric space, which we can refer to as pure visual space.

Besides, this image is enclosed in a limited frame.

Besides, this image is confined within a limited frame.

Finally, there is another difference not less important: this pure visual space is not homogeneous. All the points of the retina, aside from the images which may there be formed, do not play the same rôle. The yellow spot can in no way be regarded as identical with a point on the border of the retina. In fact, not only does the same object produce there much more vivid impressions, but in every limited frame the point occupying the center of the frame will never appear as equivalent to a point near one of the borders.

Finally, there's another important difference: this pure visual space is not uniform. Not all points on the retina, apart from the images that may be formed there, serve the same purpose. The yellow spot is definitely not the same as a point at the edge of the retina. In fact, not only does the same object create much stronger impressions there, but in every limited view, the point in the center of the frame will never seem equivalent to a point near one of the edges.

No doubt a more profound analysis would show us that this continuity of visual space and its two dimensions are only an illusion; it would separate it therefore still more from geometric space, but we shall not dwell on this remark.

No doubt a deeper analysis would reveal that this continuity of visual space and its two dimensions are just an illusion; it would further distinguish it from geometric space, but we won’t focus on that point.

Sight, however, enables us to judge of distances and consequently to perceive a third dimension. But every one knows that this perception of the third dimension reduces itself to the sensation of the effort at accommodation it is necessary to make, and to that of the convergence which must be given to the two eyes, to perceive an object distinctly.

Sight, however, allows us to judge distances and, as a result, perceive a third dimension. But everyone knows that this perception of the third dimension basically comes down to the sensation of the effort needed to adjust our vision and the convergence required by our two eyes to see an object clearly.

These are muscular sensations altogether different from the visual sensations which have given us the notion of the first two dimensions. The third dimension therefore will not appear to us as playing the same rôle as the other two. What may be called complete visual space is therefore not an isotropic space.[Pg 68]

These are physical feelings that are completely different from the visual sensations that have shaped our understanding of the first two dimensions. As a result, the third dimension won't seem to function the same way as the other two. What we can refer to as complete visual space is not an isotropic space.[Pg 68]

It has, it is true, precisely three dimensions, which means that the elements of our visual sensations (those at least which combine to form the notion of extension) will be completely defined when three of them are known; to use the language of mathematics, they will be functions of three independent variables.

It does indeed have exactly three dimensions, which means that the components of our visual experiences (at least those that come together to create the idea of extension) will be fully defined once three of them are known; to put it in mathematical terms, they will be functions of three independent variables.

But examine the matter a little more closely. The third dimension is revealed to us in two different ways: by the effort of accommodation and by the convergence of the eyes.

But look at the matter a bit more closely. The third dimension is shown to us in two different ways: through the effort of focusing our vision and by the way our eyes come together.

No doubt these two indications are always concordant, there is a constant relation between them, or, in mathematical terms, the two variables which measure these two muscular sensations do not appear to us as independent; or again, to avoid an appeal to mathematical notions already rather refined, we may go back to the language of the preceding chapter and enunciate the same fact as follows: If two sensations of convergence, A and B, are indistinguishable, the two sensations of accommodation, and , which respectively accompany them, will be equally indistinguishable.

No doubt these two signs are always in agreement; there's a consistent relationship between them. In mathematical terms, the two variables that measure these two muscular sensations don't seem independent to us. To avoid relying on somewhat complex mathematical ideas, we can restate this using the terminology from the previous chapter: If two sensations of convergence, A and B, are indistinguishable, then the two sensations of accommodation, and , which accompany them, will also be indistinguishable.

But here we have, so to speak, an experimental fact; a priori nothing prevents our supposing the contrary, and if the contrary takes place, if these two muscular sensations vary independently of one another, we shall have to take account of one more independent variable, and 'complete visual space' will appear to us as a physical continuum of four dimensions.

But here we have, so to speak, an experimental fact; a priori nothing prevents us from assuming the opposite, and if the opposite happens, if these two muscular sensations change independently of each other, we will need to consider one more independent variable, and 'complete visual space' will seem to us as a physical continuum of four dimensions.

We have here even, I will add, a fact of external experience. Nothing prevents our supposing that a being with a mind like ours, having the same sense organs that we have, may be placed in a world where light would only reach him after having traversed reflecting media of complicated form. The two indications which serve us in judging distances would cease to be connected by a constant relation. A being who should achieve in such a world the education of his senses would no doubt attribute four dimensions to complete visual space.

We have here, I would add, a fact of external experience. Nothing stops us from imagining a being with a mind like ours, having the same sense organs we do, placed in a world where light only reaches them after passing through complex reflective materials. The two cues we use to judge distances would no longer have a consistent relationship. A being who managed to train their senses in such a world would likely attribute four dimensions to complete visual space.

Tactile Space and Motor Space.—'Tactile space' is still more complicated than visual space and farther removed from geometric space. It is superfluous to repeat for touch the discussion I have given for sight.[Pg 69]

Tactile and Motor Space.—'Tactile space' is even more complex than visual space and is more distant from geometric space. There's no need to go over the discussion for touch that I've already covered for sight.[Pg 69]

But apart from the data of sight and touch, there are other sensations which contribute as much and more than they to the genesis of the notion of space. These are known to every one; they accompany all our movements, and are usually called muscular sensations.

But besides the information we get from sight and touch, there are other feelings that contribute just as much, if not more, to our understanding of space. Everyone knows about these; they come with all our movements, and they're usually referred to as muscular sensations.

The corresponding frame constitutes what may be called motor space.

The corresponding frame represents what can be referred to as motor space.

Each muscle gives rise to a special sensation capable of augmenting or of diminishing, so that the totality of our muscular sensations will depend upon as many variables as we have muscles. From this point of view, motor space would have as many dimensions as we have muscles.

Each muscle creates a distinct feeling that can increase or decrease, so the overall experience of our muscle sensations relies on as many factors as we have muscles. From this perspective, motor space would have as many dimensions as we have muscles.

I know it will be said that if the muscular sensations contribute to form the notion of space, it is because we have the sense of the direction of each movement and that it makes an integrant part of the sensation. If this were so, if a muscular sensation could not arise except accompanied by this geometric sense of direction, geometric space would indeed be a form imposed upon our sensibility.

I understand it will be argued that if muscle sensations help us understand space, it's because we feel the direction of each movement, which is a key part of the sensation. If that were true, and if a muscle sensation could only occur with this geometric sense of direction, then geometric space would truly be a framework imposed on our ability to sense.

But I perceive nothing at all of this when I analyze my sensations.

But I don't notice any of this when I think about my feelings.

What I do see is that the sensations which correspond to movements in the same direction are connected in my mind by a mere association of ideas. It is to this association that what we call 'the sense of direction' is reducible. This feeling therefore can not be found in a single sensation.

What I see is that the sensations linked to movements in the same direction are connected in my mind by a simple association of ideas. It's this association that we refer to as 'the sense of direction.' So, this feeling can’t be isolated in a single sensation.

This association is extremely complex, for the contraction of the same muscle may correspond, according to the position of the limbs, to movements of very different direction.

This connection is really complicated, because the same muscle contraction can result in movements that go in very different directions, depending on the position of the limbs.

Besides, it is evidently acquired; it is, like all associations of ideas, the result of a habit; this habit itself results from very numerous experiences; without any doubt, if the education of our senses had been accomplished in a different environment, where we should have been subjected to different impressions, contrary habits would have arisen and our muscular sensations would have been associated according to other laws.

Besides, it's clearly something we've learned; it's, like all connections of ideas, the result of a habit; this habit itself comes from countless experiences; without a doubt, if our sensory education had happened in a different environment, where we were exposed to different impressions, we would have developed different habits, and our physical sensations would have been linked according to different rules.

Characteristics of Perceptual Space.—Thus perceptual space, under its triple form, visual, tactile and motor, is essentially different from geometric space.[Pg 70]

Features of Perceptual Space.—So, perceptual space, in its three forms—visual, tactile, and motor—is fundamentally different from geometric space.[Pg 70]

It is neither homogeneous, nor isotropic; one can not even say that it has three dimensions.

It’s neither uniform nor isotropic; you can’t even say it has three dimensions.

It is often said that we 'project' into geometric space the objects of our external perception; that we 'localize' them.

It’s often said that we 'project' the objects we perceive in the outside world into geometric space; that we 'place' them.

Has this a meaning, and if so what?

Does this have a meaning, and if so, what is it?

Does it mean that we represent to ourselves external objects in geometric space?

Does it mean that we represent external objects to ourselves in geometric space?

Our representations are only the reproduction of our sensations; they can therefore be ranged only in the same frame as these, that is to say, in perceptual space.

Our representations are just a recreation of our sensations; they can only be placed in the same context as these, meaning in perceptual space.

It is as impossible for us to represent to ourselves external bodies in geometric space, as it is for a painter to paint on a plane canvas objects with their three dimensions.

It is just as impossible for us to visualize external objects in geometric space as it is for a painter to depict three-dimensional objects on a flat canvas.

Perceptual space is only an image of geometric space, an image altered in shape by a sort of perspective, and we can represent to ourselves objects only by bringing them under the laws of this perspective.

Perceptual space is just a representation of geometric space, a representation distorted in form by a kind of perspective, and we can understand objects only by fitting them into the rules of this perspective.

Therefore we do not represent to ourselves external bodies in geometric space, but we reason on these bodies as if they were situated in geometric space.

Therefore we do not represent to ourselves external objects in geometric space, but we reason about these objects as if they were located in geometric space.

When it is said then that we 'localize' such and such an object at such and such a point of space, what does it mean?

When we say that we 'localize' a certain object at a specific point in space, what does that mean?

It simply means that we represent to ourselves the movements it would be necessary to make to reach that object; and one may not say that to represent to oneself these movements, it is necessary to project the movements themselves in space and that the notion of space must, consequently, pre-exist.

It just means that we envision the actions needed to get to that goal; and one cannot say that in order to envision these actions, it’s necessary to project the actions themselves in space, which means that the idea of space must, therefore, already exist.

When I say that we represent to ourselves these movements, I mean only that we represent to ourselves the muscular sensations which accompany them and which have no geometric character whatever, which consequently do not at all imply the preexistence of the notion of space.

When I say that we visualize these movements, I only mean that we perceive the muscular sensations that come with them, which don’t have any geometric quality at all, and therefore don’t imply that the idea of space existed beforehand.

Change of State and Change of Position.—But, it will be said, if the idea of geometric space is not imposed upon our mind, and if, on the other hand, none of our sensations can furnish it, how could it have come into existence?

Change of State and Change of Position.—But, some might argue, if the concept of geometric space isn't something we naturally think of, and if none of our senses can provide it, how did it even come to be?

This is what we have now to examine, and it will take some time, but I can summarize in a few words the attempt at explanation that I am about to develop.[Pg 71]

This is what we need to look at now, and it will take some time, but I can summarize in a few words the explanation I'm about to present.[Pg 71]

None of our sensations, isolated, could have conducted us to the idea of space; we are led to it only in studying the laws, according to which these sensations succeed each other.

None of our feelings, on their own, could have brought us to the idea of space; we only arrive at it by examining the rules governing how these sensations follow one another.

We see first that our impressions are subject to change; but among the changes we ascertain we are soon led to make a distinction.

We first notice that our impressions can change; however, as we observe these changes, we quickly realize we need to make a distinction.

At one time we say that the objects which cause these impressions have changed state, at another time that they have changed position, that they have only been displaced.

At one point, we say that the objects causing these impressions have changed state; at another point, we say they’ve changed position, that they’ve just been moved.

Whether an object changes its state or merely its position, this is always translated for us in the same manner: by a modification in an aggregate of impressions.

Whether an object changes its state or just its position, we always interpret this in the same way: through a change in a collection of impressions.

How then could we have been led to distinguish between the two? It is easy to account for. If there has only been a change of position, we can restore the primitive aggregate of impressions by making movements which replace us opposite the mobile object in the same relative situation. We thus correct the modification that happened and we reestablish the initial state by an inverse modification.

How could we have been led to differentiate between the two? It's pretty straightforward. If there’s just been a change in position, we can go back to the original collection of impressions by moving ourselves back to face the moving object in the same relative position. In this way, we can correct the change that occurred and return to the initial state by making the opposite change.

If it is a question of sight, for example, and if an object changes its place before our eye, we can 'follow it with the eye' and maintain its image on the same point of the retina by appropriate movements of the eyeball.

If it's a matter of sight, for instance, and an object moves in front of us, we can 'follow it with our eyes' and keep its image on the same spot on the retina by moving our eyeballs accordingly.

These movements we are conscious of because they are voluntary and because they are accompanied by muscular sensations, but that does not mean that we represent them to ourselves in geometric space.

These movements are something we’re aware of because they’re voluntary and come with muscle sensations, but that doesn’t mean we visualize them in geometric space.

So what characterizes change of position, what distinguishes it from change of state, is that it can always be corrected in this way.

So what defines change of position, and what sets it apart from change of state, is that it can always be fixed this way.

It may therefore happen that we pass from the totality of impressions A to the totality B in two different ways:

It might be the case that we move from the totality of impressions A to the totality B in two different ways:

1º Involuntarily and without experiencing muscular sensations; this happens when it is the object which changes place;

1º Unintentionally and without feeling muscle sensations; this occurs when the object changes its position;

2º Voluntarily and with muscular sensations; this happens when the object is motionless, but we move so that the object has relative motion with reference to us.

2º Voluntarily and with physical sensations; this occurs when the object is still, but we move so that the object has relative motion in relation to us.

If this be so, the passage from the totality A to the totality B is only a change of position.[Pg 72]

If that's the case, moving from the totality A to the totality B is just a shift in position.[Pg 72]

It follows from this that sight and touch could not have given us the notion of space without the aid of the 'muscular sense.'

It follows from this that sight and touch couldn't have given us the idea of space without the help of the 'muscular sense.'

Not only could this notion not be derived from a single sensation or even from a series of sensations, but what is more, an immobile being could never have acquired it, since, not being able to correct by his movements the effects of the changes of position of exterior objects, he would have had no reason whatever to distinguish them from changes of state. Just as little could he have acquired it if his motions had not been voluntary or were unaccompanied by any sensations.

Not only could this idea not come from a single sensation or even a series of sensations, but what's more, a(n) immobile being could never have gained it, since, not being able to correct the effects of the changes in the positions of outside objects with its movements, it would have had no reason to differentiate them from changes in state at all. Similarly, it couldn't have acquired it if its movements weren't voluntary or were without any sensations.

Conditions of Compensation.—How is a like compensation possible, of such sort that two changes, otherwise independent of each other, reciprocally correct each other?

Compensation Terms.—How can a compensation like this be possible, where two changes that are otherwise independent from one another can correct each other?

A mind already familiar with geometry would reason as follows: Evidently, if there is to be compensation, the various parts of the external object, on the one hand, and the various sense organs, on the other hand, must be in the same relative position after the double change. And, for that to be the case, the various parts of the external object must likewise have retained in reference to each other the same relative position, and the same must be true of the various parts of our body in regard to each other.

A mind already familiar with geometry would think like this: Clearly, for there to be compensation, the different parts of the external object and the different sense organs must be in the same relative position after the double change. And for that to happen, the parts of the external object must also have kept the same relative position to each other, and the same goes for the various parts of our body in relation to each other.

In other words, the external object, in the first change, must be displaced as is a rigid solid, and so must it be with the whole of our body in the second change which corrects the first.

In other words, the external object, in the first change, must be moved like a rigid solid, and the same goes for our entire body in the second change that corrects the first.

Under these conditions, compensation may take place.

Under these circumstances, compensation may occur.

But we who as yet know nothing of geometry, since for us the notion of space is not yet formed, we can not reason thus, we can not foresee a priori whether compensation is possible. But experience teaches us that it sometimes happens, and it is from this experimental fact that we start to distinguish changes of state from changes of position.

But we who still don’t know anything about geometry, since we haven’t yet formed the idea of space, can’t reason like that; we can’t predict in advance whether compensation is possible. But experience teaches us that it sometimes happens, and it’s from this practical observation that we begin to differentiate changes in state from changes in position.

Solid Bodies and Geometry.—Among surrounding objects there are some which frequently undergo displacements susceptible of being thus corrected by a correlative movement of our own body; these are the solid bodies. The other objects,[Pg 73] whose form is variable, only exceptionally undergo like displacements (change of position without change of form). When a body changes its place and its shape, we can no longer, by appropriate movements, bring back our sense-organs into the same relative situation with regard to this body; consequently we can no longer reestablish the primitive totality of impressions.

Solid Shapes and Geometry.—Among the objects around us, there are some that often move in a way that we can correct through a corresponding movement of our own body; these are the solid bodies. The other objects,[Pg 73] which change shape, only occasionally move in the same way (changing position without changing form). When an object changes both its position and its shape, we can’t bring our sense organs back into the same relative position in relation to that object through suitable movements, and as a result, we can’t restore the original totality of our impressions.

It is only later, and as a consequence of new experiences, that we learn how to decompose the bodies of variable form into smaller elements, such that each is displaced almost in accordance with the same laws as solid bodies. Thus we distinguish 'deformations' from other changes of state; in these deformations, each element undergoes a mere change of position, which can be corrected, but the modification undergone by the aggregate is more profound and is no longer susceptible of correction by a correlative movement.

It’s only later, as a result of new experiences, that we learn how to break down variable forms into smaller elements, so that each one moves almost according to the same rules as solid objects. This way, we differentiate 'deformations' from other types of changes; in these deformations, each element just changes its position, which can be reversed, but the change experienced by the whole is deeper and can no longer be fixed by a corresponding movement.

Such a notion is already very complex and must have been relatively late in appearing; moreover it could not have arisen if the observation of solid bodies had not already taught us to distinguish changes of position.

Such an idea is already quite complex and likely emerged relatively late; also, it couldn't have come about if our observation of solid objects hadn't already taught us to recognize changes in position.

Therefore, if there were no solid bodies in nature, there would be no geometry.

So, if there were no solid objects in nature, there would be no geometry.

Another remark also deserves a moment's attention. Suppose a solid body to occupy successively the positions α and β; in its first position, it will produce on us the totality of impressions A, and in its second position the totality of impressions B. Let there be now a second solid body, having qualities entirely different from the first, for example, a different color. Suppose it to pass from the position α, where it gives us the totality of impressions , to the position β, where it gives the totality of impressions .

Another point also deserves a moment's attention. Imagine a solid object moving through positions α and β; in its first position, it will give us the complete set of impressions A, and in its second position, the complete set of impressions B. Now, consider a second solid object with qualities completely different from the first, for instance, a different color. Imagine it moving from position α, where it gives us the complete set of impressions , to position β, where it gives the complete set of impressions .

In general, the totality A will have nothing in common with the totality , nor the totality B with the totality . The transition from the totality A to the totality B and that from the totality to the totality are therefore two changes which in themselves have in general nothing in common.

In general, the totality A won't share anything in common with the totality , nor will the totality B share anything with the totality . The shift from the totality A to the totality B and the shift from the totality to the totality are therefore two changes that, by themselves, typically have nothing in common.

And yet we regard these two changes both as displacements and, furthermore, we consider them as the same displacement. How can that be?[Pg 74]

And yet we see these two changes as displacements, and we also think of them as the same displacement. How is that possible?[Pg 74]

It is simply because they can both be corrected by the same correlative movement of our body.

It’s just because they can both be fixed by the same corresponding movement of our body.

'Correlative movement' therefore constitutes the sole connection between two phenomena which otherwise we never should have dreamt of likening.

'Correlative movement' therefore constitutes the sole connection between two phenomena that we otherwise would never have thought to compare.

On the other hand, our body, thanks to the number of its articulations and muscles, may make a multitude of different movements; but all are not capable of 'correcting' a modification of external objects; only those will be capable of it in which our whole body, or at least all those of our sense-organs which come into play, are displaced as a whole, that is, without their relative positions varying, or in the fashion of a solid body.

On the other hand, our body, with its many joints and muscles, can perform a wide range of movements; however, not all of these movements can 'correct' changes in external objects. Only those movements where our entire body, or at least all the sense organs involved, move together as a whole—meaning their relative positions don’t change, like a solid object—can achieve that.

To summarize:

To sum up:

1º We are led at first to distinguish two categories of phenomena:

1º We are initially led to identify two categories of phenomena:

Some, involuntary, unaccompanied by muscular sensations, are attributed by us to external objects; these are external changes;

Some changes happen without us trying, and without any muscle sensations, we attribute them to things outside ourselves; these are external changes;

Others, opposite in character and attributed by us to the movements of our own body, are internal changes;

Others, in contrast and attributed by us to the movements of our own body, are internal changes;

2º We notice that certain changes of each of these categories may be corrected by a correlative change of the other category;

2º We see that certain changes in each of these categories can be adjusted by a corresponding change in the other category;

3º We distinguish among external changes those which have thus a correlative in the other category; these we call displacements; and just so among the internal changes, we distinguish those which have a correlative in the first category.

3º We differentiate external changes that have a corresponding element in the other category; we refer to these as displacements; similarly, among the internal changes, we identify those that have a corresponding element in the first category.

Thus are defined, thanks to this reciprocity, a particular class of phenomena which we call displacements.

Thus, thanks to this reciprocity, we define a specific class of phenomena that we call displacements.

The laws of these phenomena constitute the object of geometry.

The rules of these phenomena make up the subject of geometry.

Law of Homogeneity.—The first of these laws is the law of homogeneity.

Homogeneity Law.—The first of these laws is the law of homogeneity.

Suppose that, by an external change α, we pass from the totality of impressions A to the totality B, then that this change α is corrected by a correlative voluntary movement β, so that we are brought back to the totality A.

Suppose that due to an external change α, we move from the totality of impressions A to the totality B, then this change α is adjusted by a corresponding voluntary movement β, so that we return to the totality A.

Suppose now that another external change α´ makes us pass anew from the totality A to the totality B.

Suppose now that another external change α´ causes us to transition once again from the totality A to the totality B.

Experience teaches us that this change α´ is, like α, susceptible of being corrected by a correlative voluntary movement[Pg 75] β´ and that this movement β´ corresponds to the same muscular sensations as the movement β which corrected α.

Experience teaches us that this change α´ can, like α, be corrected by a related voluntary movement [Pg 75] β´, and this movement β´ corresponds to the same muscular sensations as the movement β that corrected α.

This fact is usually enunciated by saying that space is homogeneous and isotropic.

This fact is usually stated by saying that space is the same in all directions and uniform.

It may also be said that a movement which has once been produced may be repeated a second and a third time, and so on, without its properties varying.

It can also be said that a movement that has happened once can be repeated a second time, a third time, and so on, without changing its properties.

In the first chapter, where we discussed the nature of mathematical reasoning, we saw the importance which must be attributed to the possibility of repeating indefinitely the same operation.

In the first chapter, where we talked about the nature of mathematical reasoning, we highlighted the significance of being able to repeat the same operation indefinitely.

It is from this repetition that mathematical reasoning gets its power; it is, therefore, thanks to the law of homogeneity, that it has a hold on the geometric facts.

It’s through this repetition that mathematical reasoning gains its strength; it’s, therefore, because of the law of homogeneity that it applies to geometric facts.

For completeness, to the law of homogeneity should be added a multitude of other analogous laws, into the details of which I do not wish to enter, but which mathematicians sum up in a word by saying that displacements form 'a group.'

For completeness, the law of homogeneity should also include a number of other similar laws, which I don't want to delve into, but that mathematicians summarize with the term 'a group' to describe how displacements work.

The Non-Euclidean World.—If geometric space were a frame imposed on each of our representations, considered individually, it would be impossible to represent to ourselves an image stripped of this frame, and we could change nothing of our geometry.

The Non-Euclidean Universe.—If geometric space was just a frame placed on each of our individual representations, it would be impossible for us to imagine an image without this frame, and we wouldn’t be able to change our geometry at all.

But this is not the case; geometry is only the résumé of the laws according to which these images succeed each other. Nothing then prevents us from imagining a series of representations, similar in all points to our ordinary representations, but succeeding one another according to laws different from those to which we are accustomed.

But this is not true; geometry is simply the summary of the rules by which these images follow one another. Therefore, nothing stops us from imagining a series of representations, identical in every way to our usual representations, but following different rules than those we're used to.

We can conceive then that beings who received their education in an environment where these laws were thus upset might have a geometry very different from ours.

We can imagine that individuals who were educated in an environment where these laws were disrupted might have a very different understanding of geometry than we do.

Suppose, for example, a world enclosed in a great sphere and subject to the following laws:

Suppose, for example, a world enclosed in a huge sphere and subject to the following rules:

The temperature is not uniform; it is greatest at the center, and diminishes in proportion to the distance from the center, to sink to absolute zero when the sphere is reached in which this world is enclosed.[Pg 76]

The temperature isn’t the same everywhere; it peaks at the center and decreases as you move away from the center, reaching absolute zero when you get to the boundary of the sphere that contains this world.[Pg 76]

To specify still more precisely the law in accordance with which this temperature varies: Let R be the radius of the limiting sphere; let r be the distance of the point considered from the center of this sphere. The absolute temperature shall be proportional to R2r2.

To explain more clearly the rule that governs how this temperature changes: Let R be the radius of the limiting sphere; let r be the distance of the point in question from the center of this sphere. The absolute temperature will be proportional to R2r2.

I shall further suppose that, in this world, all bodies have the same coefficient of dilatation, so that the length of any rule is proportional to its absolute temperature.

I will also assume that, in this world, all materials have the same rate of expansion, so that the length of any measuring stick is proportional to its absolute temperature.

Finally, I shall suppose that a body transported from one point to another of different temperature is put immediately into thermal equilibrium with its new environment.

Finally, I will assume that a body moved from one point to another with a different temperature is placed immediately into thermal equilibrium with its new surroundings.

Nothing in these hypotheses is contradictory or unimaginable.

Nothing in these theories is inconsistent or hard to believe.

A movable object will then become smaller and smaller in proportion as it approaches the limit-sphere.

A moving object will get smaller and smaller as it gets closer to the limit-sphere.

Note first that, though this world is limited from the point of view of our ordinary geometry, it will appear infinite to its inhabitants.

Note first that, even though this world is limited according to our typical understanding of geometry, it will seem infinite to the people living in it.

In fact, when these try to approach the limit-sphere, they cool off and become smaller and smaller. Therefore the steps they take are also smaller and smaller, so that they can never reach the limiting sphere.

In fact, when these try to get closer to the limit-sphere, they cool down and get smaller and smaller. As a result, the steps they take also become smaller and smaller, so they can never reach the limiting sphere.

If, for us, geometry is only the study of the laws according to which rigid solids move, for these imaginary beings it will be the study of the laws of motion of solids distorted by the differences of temperature just spoken of.

If, for us, geometry is just the study of how rigid solids move, for these imaginary beings it will be the study of the motion laws of solids affected by the temperature differences just mentioned.

No doubt, in our world, natural solids likewise undergo variations of form and volume due to warming or cooling. But we neglect these variations in laying the foundations of geometry, because, besides their being very slight, they are irregular and consequently seem to us accidental.

No doubt, in our world, natural solids also change in shape and size because of heating or cooling. However, we ignore these changes when establishing the basics of geometry because, aside from being very minor, they are irregular and therefore seem random to us.

In our hypothetical world, this would no longer be the case, and these variations would follow regular and very simple laws.

In our imagined world, this wouldn't be true anymore, and these variations would adhere to straightforward and very basic rules.

Moreover, the various solid pieces of which the bodies of its inhabitants would be composed would undergo the same variations of form and volume.

Moreover, the different solid parts that make up the bodies of its inhabitants would also experience the same changes in shape and size.

I will make still another hypothesis; I will suppose light traverses media diversely refractive and such that the index of [Pg 77]refraction is inversely proportional to R2r2. It is easy to see that, under these conditions, the rays of light would not be rectilinear, but circular.

I will propose one more hypothesis; I'll assume that light passes through materials with different refractive properties and that the refractive index is inversely proportional to R2r2. It's straightforward to observe that, in this scenario, the rays of light would not travel in straight lines, but rather in circular paths.

To justify what precedes, it remains for me to show that certain changes in the position of external objects can be corrected by correlative movements of the sentient beings inhabiting this imaginary world, and that in such a way as to restore the primitive aggregate of impressions experienced by these sentient beings.

To support what has been discussed, I need to demonstrate that specific changes in the placement of external objects can be corrected through corresponding movements of the aware beings living in this imagined world, allowing the original collection of experiences felt by these beings to be restored.

Suppose in fact that an object is displaced, undergoing deformation, not as a rigid solid, but as a solid subjected to unequal dilatations in exact conformity to the law of temperature above supposed. Permit me for brevity to call such a movement a non-Euclidean displacement.

Suppose that an object is moved, changing shape, not as a rigid solid, but as a solid experiencing uneven expansions in line with the temperature law previously mentioned. For the sake of simplicity, let's refer to this kind of movement as a non-Euclidean displacement.

If a sentient being happens to be in the neighborhood, his impressions will be modified by the displacement of the object, but he can reestablish them by moving in a suitable manner. It suffices if finally the aggregate of the object and the sentient being, considered as forming a single body, has undergone one of those particular displacements I have just called non-Euclidean. This is possible if it be supposed that the limbs of these beings dilate according to the same law as the other bodies of the world they inhabit.

If a sentient being is nearby, their perceptions will be affected by the movement of the object, but they can restore those perceptions by moving appropriately. It’s enough that, in the end, the combination of the object and the sentient being, viewed as one whole, has experienced one of those specific movements I referred to as non-Euclidean. This is feasible if we assume that the limbs of these beings expand following the same principles as other bodies in the world they live in.

Although from the point of view of our ordinary geometry there is a deformation of the bodies in this displacement and their various parts are no longer in the same relative position, nevertheless we shall see that the impressions of the sentient being have once more become the same.

Although from the perspective of our everyday geometry there is a distortion of the bodies in this movement and their different parts are no longer in the same relative position, we will see that the perceptions of the conscious being have once again become identical.

In fact, though the mutual distances of the various parts may have varied, yet the parts originally in contact are again in contact. Therefore the tactile impressions have not changed.

In fact, even though the distances between the different parts may have changed, the parts that were originally touching are again touching. So, the tactile impressions haven't changed.

On the other hand, taking into account the hypothesis made above in regard to the refraction and the curvature of the rays of light, the visual impressions will also have remained the same.

On the other hand, considering the assumption made earlier about the bending and curvature of light rays, the visual impressions would also still be the same.

These imaginary beings will therefore like ourselves be led to classify the phenomena they witness and to distinguish among them the 'changes of position' susceptible of correction by a correlative voluntary movement.

These imaginary beings will, like us, categorize the events they observe and differentiate between the 'changes of position' that can be adjusted by a corresponding voluntary movement.

If they construct a geometry, it will not be, as ours is, the[Pg 78] study of the movements of our rigid solids; it will be the study of the changes of position which they will thus have distinguished and which are none other than the 'non-Euclidean displacements'; it will be non-Euclidean geometry.

If they create a geometry, it won’t be like ours, which is the[Pg 78] study of the movements of our solid objects; it will be the study of the changes in position that they will have identified, which are nothing but the 'non-Euclidean displacements'; it will be non-Euclidean geometry.

Thus beings like ourselves, educated in such a world, would not have the same geometry as ours.

Thus, beings like us, raised in such a world, wouldn't have the same understanding of geometry as we do.

The World of Four Dimensions.—We can represent to ourselves a four-dimensional world just as well as a non-Euclidean.

The 4D World.—We can imagine a four-dimensional world just as easily as a non-Euclidean one.

The sense of sight, even with a single eye, together with the muscular sensations relative to the movements of the eyeball, would suffice to teach us space of three dimensions.

The sense of sight, even with just one eye, along with the muscle sensations related to the movement of the eyeball, would be enough to teach us about three-dimensional space.

The images of external objects are painted on the retina, which is a two-dimensional canvas; they are perspectives.

The images of outside objects are captured on the retina, which is like a two-dimensional canvas; they are perspectives.

But, as eye and objects are movable, we see in succession various perspectives of the same body, taken from different points of view.

But, since the eye and objects can move, we see different perspectives of the same thing in succession, viewed from different angles.

At the same time, we find that the transition from one perspective to another is often accompanied by muscular sensations.

At the same time, we notice that switching from one viewpoint to another is often accompanied by physical sensations.

If the transition from the perspective A to the perspective B, and that from the perspective to the perspective are accompanied by the same muscular sensations, we liken them one to the other as operations of the same nature.

If the shift from perspective A to perspective B, and from perspective to perspective are accompanied by the same muscular sensations, we compare them to each other as processes of the same kind.

Studying then the laws according to which these operations combine, we recognize that they form a group, which has the same structure as that of the movements of rigid solids.

Studying the rules by which these operations combine, we see that they create a group, which has the same structure as that of the movements of rigid solids.

Now, we have seen that it is from the properties of this group we have derived the notion of geometric space and that of three dimensions.

Now, we've seen that it's from the properties of this group that we've derived the idea of geometric space and that of three dimensions.

We understand thus how the idea of a space of three dimensions could take birth from the pageant of these perspectives, though each of them is of only two dimensions, since they follow one another according to certain laws.

We can see how the concept of three-dimensional space could emerge from the display of these perspectives, even though each one only has two dimensions, since they follow one another according to certain laws.

Well, just as the perspective of a three-dimensional figure can be made on a plane, we can make that of a four-dimensional figure on a picture of three (or of two) dimensions. To a geometer this is only child's play.

Well, just like how you can represent a three-dimensional shape on a flat surface, we can represent a four-dimensional shape in a picture that shows three (or two) dimensions. For a geometer, this is just simple fun.

We can even take of the same figure several perspectives from several different points of view.[Pg 79]

We can even look at the same figure from several different perspectives.[Pg 79]

We can easily represent to ourselves these perspectives, since they are of only three dimensions.

We can easily visualize these perspectives because they only have three dimensions.

Imagine that the various perspectives of the same object succeed one another, and that the transition from one to the other is accompanied by muscular sensations.

Imagine that the different views of the same object follow one after another, and that the shift from one to the next comes with physical sensations.

We shall of course consider two of these transitions as two operations of the same nature when they are associated with the same muscular sensations.

We will definitely view two of these transitions as two operations of the same kind when they relate to the same muscle sensations.

Nothing then prevents us from imagining that these operations combine according to any law we choose, for example, so as to form a group with the same structure as that of the movements of a rigid solid of four dimensions.

Nothing then stops us from imagining that these operations can combine according to any rule we want, for instance, in a way that creates a group with the same structure as the movements of a rigid solid in four dimensions.

Here there is nothing unpicturable, and yet these sensations are precisely those which would be felt by a being possessed of a two-dimensional retina who could move in space of four dimensions. In this sense we may say the fourth dimension is imaginable.

Here, nothing is beyond imagination, and yet these feelings are exactly what someone with a two-dimensional retina would experience while moving through four-dimensional space. In this way, we can say that the fourth dimension is conceivable.

Conclusions.—We see that experience plays an indispensable rôle in the genesis of geometry; but it would be an error thence to conclude that geometry is, even in part, an experimental science.

Conclusions.—We see that experience plays a crucial role in the development of geometry; however, it would be a mistake to conclude from this that geometry is, even partially, an experimental science.

If it were experimental, it would be only approximative and provisional. And what rough approximation!

If it were experimental, it would only be rough and temporary. And what a rough approximation!

Geometry would be only the study of the movements of solids; but in reality it is not occupied with natural solids, it has for object certain ideal solids, absolutely rigid, which are only a simplified and very remote image of natural solids.

Geometry would only be the study of the movements of solids; but in reality, it doesn't deal with natural solids. Its focus is on certain ideal solids that are completely rigid, which are just a simplified and very distant representation of natural solids.

The notion of these ideal solids is drawn from all parts of our mind, and experience is only an occasion which induces us to bring it forth from them.

The idea of these perfect shapes comes from every part of our mind, and experience is just a trigger that encourages us to bring it out.

The object of geometry is the study of a particular 'group'; but the general group concept pre-exists, at least potentially, in our minds. It is imposed on us, not as form of our sense, but as form of our understanding.

The purpose of geometry is to explore a specific 'group'; however, the broad idea of a group already exists, at least in potential, in our minds. It's given to us, not as a shape from our senses, but as a concept in our understanding.

Only, from among all the possible groups, that must be chosen which will be, so to speak, the standard to which we shall refer natural phenomena.

Only, from all the possible groups, we must choose the one that will serve as the standard for referencing natural phenomena.

Experience guides us in this choice without forcing it upon[Pg 80] us; it tells us not which is the truest geometry, but which is the most convenient.

Experience guides us in this choice without imposing it on us; it tells us not which is the truest geometry, but which is the most convenient.

Notice that I have been able to describe the fantastic worlds above imagined without ceasing to employ the language of ordinary geometry.

Notice that I have been able to describe the amazing worlds above imagined without stopping the use of everyday geometry.

And, in fact, we should not have to change it if transported thither.

And actually, we shouldn't need to change it if we were taken there.

Beings educated there would doubtless find it more convenient to create a geometry different from ours, and better adapted to their impressions. As for us, in face of the same impressions, it is certain we should find it more convenient not to change our habits.

Beings educated there would definitely find it easier to create a geometry that's different from ours and better suited to their experiences. As for us, in light of the same experiences, it's clear that we would prefer not to change our habits.


CHAPTER V

Experience and Geometry

1. Already in the preceding pages I have several times tried to show that the principles of geometry are not experimental facts and that in particular Euclid's postulate can not be proven experimentally.

1. In the previous pages, I have tried multiple times to show that the principles of geometry are not based on experimental facts and that, specifically, Euclid's postulate cannot be proven through experimentation.

However decisive appear to me the reasons already given, I believe I should emphasize this point because here a false idea is profoundly rooted in many minds.

However clear the reasons I've provided seem to me, I think I should highlight this point because a false idea is deeply ingrained in many people's minds.

2. If we construct a material circle, measure its radius and circumference, and see if the ratio of these two lengths is equal to π, what shall we have done? We shall have made an experiment on the properties of the matter with which we constructed this round thing, and of that of which the measure used was made.

2. If we build a physical circle, measure its radius and circumference, and check if the ratio of these two lengths equals π, what will we have done? We will have conducted an experiment on the properties of the material used to create this round thing, as well as that of the measuring tool.

3. Geometry and Astronomy.—The question has also been put in another way. If Lobachevski's geometry is true, the parallax of a very distant star will be finite; if Riemann's is true, it will be negative. These are results which seem within the reach of experiment, and there have been hopes that astronomical observations might enable us to decide between the three geometries.

3. Geometry and Astronomy.—The question has also been posed in another way. If Lobachevski's geometry is correct, the parallax of a very distant star will be finite; if Riemann's is accurate, it will be negative. These are outcomes that seem feasible to test, and there have been hopes that astronomical observations might help us determine which of the three geometries is true.

But in astronomy 'straight line' means simply 'path of a ray of light.'

But in astronomy, 'straight line' just means 'the path of a ray of light.'

If therefore negative parallaxes were found, or if it were demonstrated that all parallaxes are superior to a certain limit, two courses would be open to us; we might either renounce Euclidean geometry, or else modify the laws of optics and suppose that light does not travel rigorously in a straight line.

If negative parallaxes were found, or if it were proven that all parallaxes exceed a certain limit, we would have two options: we could either abandon Euclidean geometry, or modify the laws of optics and assume that light doesn't always travel in a straight line.

It is needless to add that all the world would regard the latter solution as the more advantageous.

It's unnecessary to say that everyone would see the latter solution as the better option.

The Euclidean geometry has, therefore, nothing to fear from fresh experiments.

The Euclidean geometry has nothing to fear from new experiments.

4. Is the position tenable, that certain phenomena, possible in Euclidean space, would be impossible in non-Euclidean space,[Pg 82] so that experience, in establishing these phenomena, would directly contradict the non-Euclidean hypothesis? For my part I think no such question can be put. To my mind it is precisely equivalent to the following, whose absurdity is patent to all eyes: are there lengths expressible in meters and centimeters, but which can not be measured in fathoms, feet and inches, so that experience, in ascertaining the existence of these lengths, would directly contradict the hypothesis that there are fathoms divided into six feet?

4. Can we argue that certain phenomena, which are possible in Euclidean space, would be impossible in non-Euclidean space,[Pg 82] so that our experiences in identifying these phenomena would directly contradict the non-Euclidean theory? I believe that such a question is not valid. To me, it's exactly the same as asking, which is clearly absurd to everyone: are there lengths that can be expressed in meters and centimeters, but cannot be measured in fathoms, feet, and inches, so that our experiences in determining the presence of these lengths would directly contradict the idea that there are fathoms divided into six feet?

Examine the question more closely. I suppose that the straight line possesses in Euclidean space any two properties which I shall call A and B; that in non-Euclidean space it still possesses the property A, but no longer has the property B; finally I suppose that in both Euclidean and non-Euclidean space the straight line is the only line having the property A.

Examine the question more closely. I suppose that the straight line has two properties in Euclidean space, which I will call A and B; that in non-Euclidean space it still has property A, but no longer has property B; finally, I suppose that in both Euclidean and non-Euclidean space, the straight line is the only line that has property A.

If this were so, experience would be capable of deciding between the hypothesis of Euclid and that of Lobachevski. It would be ascertained that a definite concrete object, accessible to experiment, for example, a pencil of rays of light, possesses the property A; we should conclude that it is rectilinear, and then investigate whether or not it has the property B.

If this were the case, experience would be able to choose between Euclid's hypothesis and Lobachevski's. We would determine that a specific concrete object, which can be tested, like a beam of light, has the property A; we would then conclude that it is straight and investigate whether or not it has the property B.

But this is not so; no property exists which, like this property A, can be an absolute criterion enabling us to recognize the straight line and to distinguish it from every other line.

But this is not the case; there is no property that, like this property A, can serve as an absolute standard for identifying the straight line and distinguishing it from any other line.

Shall we say, for instance: "the following is such a property: the straight line is a line such that a figure of which this line forms a part can be moved without the mutual distances of its points varying and so that all points of this line remain fixed"?

Shall we say, for example: "the following is such a property: the straight line is a line such that a shape that includes this line can be moved without changing the distances between any of its points, and so that all points of this line stay in place"?

This, in fact, is a property which, in Euclidean or non-Euclidean space, belongs to the straight and belongs only to it. But how shall we ascertain experimentally whether it belongs to this or that concrete object? It will be necessary to measure distances, and how shall one know that any concrete magnitude which I have measured with my material instrument really represents the abstract distance?

This is actually a property that, whether in Euclidean or non-Euclidean space, applies only to straight lines. But how can we determine through experiments if it applies to this specific object or that one? We will need to measure distances, but how can we be sure that the concrete measurement I took with my physical instrument truly reflects the abstract distance?

We have only pushed back the difficulty.

We have only postponed the challenge.

In reality the property just enunciated is not a property of the straight line alone, it is a property of the straight line and[Pg 83] distance. For it to serve as absolute criterion, we should have to be able to establish not only that it does not also belong to a line other than the straight and to distance, but in addition that it does not belong to a line other than the straight and to a magnitude other than distance. Now this is not true.

In reality, the property just mentioned isn't exclusive to straight lines; it's a property of both straight lines and distance. For it to be a definitive criterion, we would need to demonstrate not just that it isn't shared with any other line besides the straight one and distance, but also that it isn't shared with any other line besides the straight one and any other measurement besides distance. But that's not the case.

It is therefore impossible to imagine a concrete experiment which can be interpreted in the Euclidean system and not in the Lobachevskian system, so that I may conclude:

It is therefore impossible to envision a specific experiment that can be interpreted in the Euclidean system and not in the Lobachevskian system, so I can conclude:

No experience will ever be in contradiction to Euclid's postulate; nor, on the other hand, will any experience ever contradict the postulate of Lobachevski.

No experience will ever conflict with Euclid's postulate; nor will any experience contradict Lobachevski's postulate.

5. But it is not enough that the Euclidean (or non-Euclidean) geometry can never be directly contradicted by experience. Might it not happen that it can accord with experience only by violating the principle of sufficient reason or that of the relativity of space?

5. But it's not enough that Euclidean (or non-Euclidean) geometry can never be directly contradicted by experience. Could it happen that it only aligns with experience by breaking the principle of sufficient reason or the relativity of space?

I will explain myself: consider any material system; we shall have to regard, on the one hand, 'the state' of the various bodies of this system (for instance, their temperature, their electric potential, etc.), and, on the other hand, their position in space; and among the data which enable us to define this position we shall, moreover, distinguish the mutual distances of these bodies, which define their relative positions, from the conditions which define the absolute position of the system and its absolute orientation in space.

I will explain myself: consider any physical system; we need to look at, on one hand, 'the state' of the different parts of this system (like their temperature, electric potential, etc.), and, on the other hand, their location in space; and among the information that helps us define this location, we will also differentiate between the distances between these parts, which define their relative positions, and the factors that define the absolute position of the system and its overall orientation in space.

The laws of the phenomena which will happen in this system will depend on the state of these bodies and their mutual distances; but, because of the relativity and passivity of space, they will not depend on the absolute position and orientation of the system.

The laws of the phenomena that will occur in this system will depend on the state of these bodies and their distances from each other; however, due to the relativity and passivity of space, they will not depend on the absolute position and orientation of the system.

In other words, the state of the bodies and their mutual distances at any instant will depend solely on the state of these same bodies and on their mutual distances at the initial instant, but will not at all depend on the absolute initial position of the system or on its absolute initial orientation. This is what for brevity I shall call the law of relativity.

In other words, the condition of the bodies and how far apart they are at any moment will only rely on the condition of those same bodies and their distances from each other at the starting point, and it won't depend at all on the system's absolute starting position or its absolute initial orientation. For short, I’ll refer to this as the law of relativity.

Hitherto I have spoken as a Euclidean geometer. As I have said, an experience, whatever it be, admits of an interpretation on the Euclidean hypothesis; but it admits of one equally on[Pg 84] the non-Euclidean hypothesis. Well, we have made a series of experiments; we have interpreted them on the Euclidean hypothesis, and we have recognized that these experiments thus interpreted do not violate this 'law of relativity.'

So far, I've been talking like a Euclidean geometer. As I mentioned, any experience can be interpreted based on the Euclidean approach, but it can also be interpreted just as well based on the non-Euclidean approach. We've conducted a number of experiments; we've looked at them from the Euclidean perspective, and we've found that these experiments, when interpreted this way, don't contradict the 'law of relativity.'

We now interpret them on the non-Euclidean hypothesis: this is always possible; only the non-Euclidean distances of our different bodies in this new interpretation will not generally be the same as the Euclidean distances in the primitive interpretation.

We now understand them based on the non-Euclidean hypothesis: this is always an option; however, the non-Euclidean distances of our different bodies in this new interpretation will not typically be the same as the Euclidean distances in the original interpretation.

Will our experiments, interpreted in this new manner, still be in accord with our 'law of relativity'? And if there were not this accord, should we not have also the right to say experience had proven the falsity of the non-Euclidean geometry?

Will our experiments, viewed in this new way, still align with our 'law of relativity'? And if they don't align, shouldn't we also have the right to claim that experience has shown non-Euclidean geometry to be false?

It is easy to see that this is an idle fear; in fact, to apply the law of relativity in all rigor, it must be applied to the entire universe. For if only a part of this universe were considered, and if the absolute position of this part happened to vary, the distances to the other bodies of the universe would likewise vary, their influence on the part of the universe considered would consequently augment or diminish, which might modify the laws of the phenomena happening there.

It’s clear that this is just an unfounded worry; actually, to fully apply the law of relativity, it has to encompass the whole universe. If we were to look at just a portion of the universe, and the absolute position of that part changed, the distances to the other bodies in the universe would change as well. This would affect their influence on the part of the universe being considered, which could then alter the laws of the phenomena occurring there.

But if our system is the entire universe, experience is powerless to give information about its absolute position and orientation in space. All that our instruments, however perfected they may be, can tell us will be the state of the various parts of the universe and their mutual distances.

But if our system is the entire universe, experience can’t provide any information about its absolute position and orientation in space. No matter how advanced our instruments are, all they can tell us is the condition of different parts of the universe and how far apart they are from each other.

So our law of relativity may be thus enunciated:

So our law of relativity can be stated like this:

The readings we shall be able to make on our instruments at any instant will depend only on the readings we could have made on these same instruments at the initial instant.

The readings we can take from our instruments at any moment will depend only on the readings we could have taken from those same instruments at the starting point.

Now such an enunciation is independent of every interpretation of experimental facts. If the law is true in the Euclidean interpretation, it will also be true in the non-Euclidean interpretation.

Now, this statement stands apart from any interpretation of experimental facts. If the law holds true in the Euclidean interpretation, it will also be valid in the non-Euclidean interpretation.

Allow me here a short digression. I have spoken above of the data which define the position of the various bodies of the system; I should likewise have spoken of those which define their velocities; I should then have had to distinguish the velocities with which the mutual distances of the different bodies vary;[Pg 85] and, on the other hand, the velocities of translation and rotation of the system, that is to say, the velocities with which its absolute position and orientation vary.

Allow me to take a brief detour. I’ve mentioned the data that define the positions of the various bodies in the system; I should also mention the data that define their velocities. Then, I would need to distinguish between the velocities at which the distances between the different bodies change,[Pg 85] and, on the other hand, the translation and rotation velocities of the system, meaning the rates at which its overall position and orientation change.

To fully satisfy the mind, the law of relativity should be expressible thus:

To fully satisfy the mind, the law of relativity should be expressed like this:

The state of bodies and their mutual distances at any instant, as well as the velocities with which these distances vary at this same instant, will depend only on the state of those bodies and their mutual distances at the initial instant, and the velocities with which these distances vary at this initial instant, but they will not depend either upon the absolute initial position of the system, or upon its absolute orientation, or upon the velocities with which this absolute position and orientation varied at the initial instant.

The condition of bodies and how far apart they are at any moment, as well as the speeds at which those distances change at that moment, will only rely on the condition of those bodies and their distances at the starting moment, and the speeds at which those distances changed at that starting moment. However, they will not depend on the absolute initial location of the system, its absolute orientation, or the speeds at which that absolute location and orientation changed at the starting moment.

Unhappily the law thus enunciated is not in accord with experiments, at least as they are ordinarily interpreted.

Unfortunately, the law as stated doesn't match up with experiments, at least not how they are usually understood.

Suppose a man be transported to a planet whose heavens were always covered with a thick curtain of clouds, so that he could never see the other stars; on that planet he would live as if it were isolated in space. Yet this man could become aware that it turned, either by measuring its oblateness (done ordinarily by the aid of astronomic observations, but capable of being done by purely geodetic means), or by repeating the experiment of Foucault's pendulum. The absolute rotation of this planet could therefore be made evident.

Imagine a man is taken to a planet where the sky is always covered by a thick layer of clouds, so he can never see the other stars; on that planet, he would feel completely isolated in space. Still, this man could realize that the planet is rotating, either by measuring its shape (which is usually done with astronomical observations but can also be done using purely geodetic methods) or by conducting Foucault's pendulum experiment. The planet's rotation could thus be clearly demonstrated.

That is a fact which shocks the philosopher, but which the physicist is compelled to accept.

That's a fact that surprises the philosopher, but the physicist has to accept it.

We know that from this fact Newton inferred the existence of absolute space; I myself am quite unable to adopt this view. I shall explain why in Part III. For the moment it is not my intention to enter upon this difficulty.

We know that from this fact, Newton inferred the existence of absolute space; I personally cannot accept this view. I will explain why in Part III. For now, I don’t intend to dive into this issue.

Therefore I must resign myself, in the enunciation of the law of relativity, to including velocities of every kind among the data which define the state of the bodies.

Therefore, I must accept that, in stating the law of relativity, I need to include velocities of all kinds among the information that defines the condition of the bodies.

However that may be, this difficulty is the same for Euclid's geometry as for Lobachevski's; I therefore need not trouble myself with it, and have only mentioned it incidentally.[Pg 86]

However that may be, this issue is the same for Euclid's geometry as for Lobachevski's; I therefore don’t need to worry about it and have only brought it up in passing.[Pg 86]

What is important is the conclusion: experiment can not decide between Euclid and Lobachevski.

What matters is the conclusion: experiments can’t choose between Euclid and Lobachevsky.

To sum up, whichever way we look at it, it is impossible to discover in geometric empiricism a rational meaning.

To sum up, no matter how we examine it, it's impossible to find a rational meaning in geometric empiricism.

6. Experiments only teach us the relations of bodies to one another; none of them bears or can bear on the relations of bodies with space, or on the mutual relations of different parts of space.

6. Experiments only show us how bodies relate to each other; none of them provides insight into how bodies relate to space, or how different parts of space relate to each other.

"Yes," you reply, "a single experiment is insufficient, because it gives me only a single equation with several unknowns; but when I shall have made enough experiments I shall have equations enough to calculate all my unknowns."

"Yes," you reply, "one experiment isn't enough because it gives me just one equation with multiple unknowns; but once I've done enough experiments, I'll have enough equations to figure out all my unknowns."

To know the height of the mainmast does not suffice for calculating the age of the captain. When you have measured every bit of wood in the ship you will have many equations, but you will know his age no better. All your measurements bearing only on your bits of wood can reveal to you nothing except concerning these bits of wood. Just so your experiments, however numerous they may be, bearing only on the relations of bodies to one another, will reveal to us nothing about the mutual relations of the various parts of space.

To know the height of the main mast isn’t enough to figure out the captain’s age. Even if you measure every piece of wood in the ship, you’ll have plenty of equations, but you still won’t know how old he is. All your measurements related to those pieces of wood will only tell you about the wood itself. Similarly, no matter how many experiments you conduct regarding how bodies relate to one another, they won’t tell us anything about the relationships between different parts of space.

7. Will you say that if the experiments bear on the bodies, they bear at least upon the geometric properties of the bodies? But, first, what do you understand by geometric properties of the bodies? I assume that it is a question of the relations of the bodies with space; these properties are therefore inaccessible to experiments which bear only on the relations of the bodies to one another. This alone would suffice to show that there can be no question of these properties.

7. Are you saying that if the experiments involve the bodies, they at least relate to the geometric properties of the bodies? But first, what do you mean by the geometric properties of the bodies? I take it you’re referring to the relationships between the bodies and space; these properties are not something that can be tested by experiments that only focus on the relationships between the bodies themselves. This alone would be enough to demonstrate that there’s no question of these properties.

Still let us begin by coming to an understanding about the sense of the phrase: geometric properties of bodies. When I say a body is composed of several parts, I assume that I do not enunciate therein a geometric property, and this would remain true even if I agreed to give the improper name of points to the smallest parts I consider.

Still, let’s start by agreeing on what the phrase "geometric properties of bodies" means. When I say a body is made up of several parts, I assume that I'm not stating a geometric property, and this would still be the case even if I consented to call the smallest parts I consider "points," even though that's not the proper name for them.

When I say that such a part of such a body is in contact with such a part of such another body, I enunciate a proposition which concerns the mutual relations of these two bodies and not their relations with space.[Pg 87]

When I say that one part of a body is touching a part of another body, I'm making a statement about how these two bodies relate to each other, not how they relate to space.[Pg 87]

I suppose you will grant me these are not geometric properties; at least I am sure you will grant me these properties are independent of all knowledge of metric geometry.

I guess you’ll agree that these aren’t geometric properties; at the very least, I’m sure you’ll agree that these properties don’t depend on any understanding of metric geometry.

This presupposed, I imagine that we have a solid body formed of eight slender iron rods, OA, OB, OC, OD, OE, OF, OG, OH, united at one of their extremities O. Let us besides have a second solid body, for example a bit of wood, to be marked with three little flecks of ink which I shall call α, β, γ. I further suppose it ascertained that αβγ may be brought into contact with AGO (I mean α with A, and at the same time β with G and γ with O), then that we may bring successively into contact αβγ with BGO, CGO, DGO, EGO, FGO, then with AHO, BHO, CHO, DHO, EHO, FHO, then αγ successively with AB, BC, CD, DE, EF, FA.

This assumed, I envision we have a solid structure made up of eight thin iron rods, OA, OB, OC, OD, OE, OF, OG, OH, connected at one end O. Additionally, let's have a second solid object, like a piece of wood, marked with three small dots of ink which I will refer to as α, β, γ. I further suppose that we've confirmed that αβγ can be brought into contact with AGO (meaning α with A, and at the same time β with G and γ with O), then that we can successively bring αβγ into contact with BGO, CGO, DGO, EGO, FGO, and then with AHO, BHO, CHO, DHO, EHO, FHO, then αγ successively with AB, BC, CD, DE, EF, FA.

These are determinations we may make without having in advance any notion about form or about the metric properties of space. They in no wise bear on the 'geometric properties of bodies.' And these determinations will not be possible if the bodies experimented upon move in accordance with a group having the same structure as the Lobachevskian group (I mean according to the same laws as solid bodies in Lobachevski's geometry). They suffice therefore to prove that these bodies move in accordance with the Euclidean group, or at least that they do not move according to the Lobachevskian group.

These are decisions we can make without any prior understanding of the shape or the measurement properties of space. They don’t relate to the 'geometric properties of objects.' Additionally, these determinations won’t be possible if the objects being tested move according to a group that has the same structure as the Lobachevskian group (I mean under the same laws as solid objects in Lobachevski's geometry). Therefore, they are enough to demonstrate that these objects move according to the Euclidean group, or at least that they do not move according to the Lobachevskian group.

That they are compatible with the Euclidean group is easy to see. For they could be made if the body αβγ was a rigid solid of our ordinary geometry presenting the form of a right-angled triangle, and if the points ABCDEFGH were the summits of a polyhedron formed of two regular hexagonal pyramids of our ordinary geometry, having for common base ABCDEF and for apices the one G and the other H.

That they are compatible with the Euclidean group is obvious. They could be created if the body αβγ were a rigid shape in our usual geometry that looks like a right-angled triangle, and if the points ABCDEFGH were the peaks of a polyhedron made up of two regular hexagonal pyramids in our usual geometry, sharing a common base of ABCDEF and having vertices G and H as the tips.

Suppose now that in place of the preceding determination it is observed that as above αβγ can be successively applied to AGO, BGO, CGO, DGO, EGO, AHO, BHO, CHO, DHO, EHO, FHO, then that αβ (and no longer αγ) can be successively applied to AB, BC, CD, DE, EF and FA.

Suppose now that instead of the earlier determination, we see that, as mentioned above, αβγ can be applied in sequence to AGO, BGO, CGO, DGO, EGO, AHO, BHO, CHO, DHO, EHO, FHO. Then, αβ (and no longer αγ) can be applied in sequence to AB, BC, CD, DE, EF, and FA.

These are determinations which could be made if non-Euclidean geometry were true, if the bodies αβγ and OABCDEFGH were rigid solids, and if the first were a right-angled triangle[Pg 88] and the second a double regular hexagonal pyramid of suitable dimensions.

These are decisions that could be made if non-Euclidean geometry were accurate, if the bodies αβγ and OABCDEFGH were solid and rigid, and if the first was a right-angled triangle[Pg 88] and the second was a double regular hexagonal pyramid with appropriate dimensions.

Therefore these new determinations are not possible if the bodies move according to the Euclidean group; but they become so if it be supposed that the bodies move according to the Lobachevskian group. They would suffice, therefore (if one made them), to prove that the bodies in question do not move according to the Euclidean group.

Therefore, these new determinations aren’t possible if the bodies move according to the Euclidean group; however, they become possible if we assume that the bodies move according to the Lobachevskian group. Thus, they would be enough (if we established them) to prove that the bodies in question do not move according to the Euclidean group.

Thus, without making any hypothesis about form, about the nature of space, about the relations of bodies to space, and without attributing to bodies any geometric property, I have made observations which have enabled me to show in one case that the bodies experimented upon move according to a group whose structure is Euclidean, in the other case that they move according to a group whose structure is Lobachevskian.

Thus, without making any assumptions about shape, the nature of space, or how objects relate to space, and without giving any geometric properties to those objects, I've made observations that have allowed me to demonstrate in one instance that the objects in the experiment move according to a framework that is Euclidean, and in the other instance that they move according to a framework that is Lobachevskian.

And one may not say that the first aggregate of determinations would constitute an experiment proving that space is Euclidean, and the second an experiment proving that space is non-Euclidean.

And one cannot say that the first collection of determinations would make up an experiment proving that space is Euclidean, and the second an experiment proving that space is non-Euclidean.

In fact one could imagine (I say imagine) bodies moving so as to render possible the second series of determinations. And the proof is that the first mechanician met could construct such bodies if he cared to take the pains and make the outlay. You will not conclude from that, however, that space is non-Euclidean.

In fact, one could envision (I say envision) bodies moving in a way that makes the second series of determinations possible. And the proof is that the first mechanic met could create such bodies if they were willing to put in the effort and expense. However, you should not conclude from that that space is non-Euclidean.

Nay, since the ordinary solid bodies would continue to exist when the mechanician had constructed the strange bodies of which I have just spoken, it would be necessary to conclude that space is at the same time Euclidean and non-Euclidean.

No, since the usual solid objects would still exist when the mechanic had created the strange objects I've just mentioned, it would be necessary to conclude that space is both Euclidean and non-Euclidean at the same time.

Suppose, for example, that we have a great sphere of radius R and that the temperature decreases from the center to the surface of this sphere according to the law of which I have spoken in describing the non-Euclidean world.

Suppose, for example, that we have a large sphere with a radius R and that the temperature drops from the center to the surface of this sphere according to the law I mentioned while describing the non-Euclidean world.

We might have bodies whose expansion would be negligible and which would act like ordinary rigid solids; and, on the other hand, bodies very dilatable and which would act like non-Euclidean solids. We might have two double pyramids OABCDEFGH and O´A´B´C´D´E´F´G´H´ and two triangles αβγ and α´β´γ´. The first double pyramid might be rectilinear and the second[Pg 89] curvilinear; the triangle αβγ might be made of inexpansible matter and the other of a very dilatable matter.

We could have bodies that barely expand and behave like regular rigid solids, and on the other hand, bodies that are highly expandable and act like non-Euclidean solids. Imagine two double pyramids OABCDEFGH and O´A´B´C´D´E´F´G´H´, along with two triangles αβγ and α´β´γ´. The first double pyramid could be straight-edged while the second [Pg 89] is curved; the triangle αβγ could be made of material that doesn’t expand, while the other could be made of a material that expands a lot.

It would then be possible to make the first observations with the double pyramid OAH and the triangle αβγ, and the second with the double pyramid O´A´H´ and the triangle α´β´γ´. And then experiment would seem to prove first that the Euclidean geometry is true and then that it is false.

It would then be possible to make the first observations with the double pyramid OAH and the triangle αβγ, and the second with the double pyramid O´A´H´ and the triangle α´β´γ´. And then experiments would seem to prove first that Euclidean geometry is true and then that it is false.

Experiments therefore have a bearing, not on space, but on bodies.

Experiments are related, not to space, but to objects.

Supplement

Add-on

8. To complete the matter, I ought to speak of a very delicate question, which would require long development; I shall confine myself to summarizing here what I have expounded in the Revue de Métaphysique et de Morale and in The Monist. When we say space has three dimensions, what do we mean?

8. To wrap things up, I need to address a very sensitive question that needs a lot more exploration; I’ll just summarize what I laid out in the Revue de Métaphysique et de Morale and in The Monist. When we say space has three dimensions, what do we actually mean?

We have seen the importance of those 'internal changes' revealed to us by our muscular sensations. They may serve to characterize the various attitudes of our body. Take arbitrarily as origin one of these attitudes A. When we pass from this initial attitude to any other attitude B, we feel a series of muscular sensations, and this series S will define B. Observe, however, that we shall often regard two series S and as defining the same attitude B (since the initial and final attitudes A and B remaining the same, the intermediary attitudes and the corresponding sensations may differ). How then shall we recognize the equivalence of these two series? Because they may serve to compensate the same external change, or more generally because, when it is a question of compensating an external change, one of the series can be replaced by the other. Among these series, we have distinguished those which of themselves alone can compensate an external change, and which we have called 'displacements.' As we can not discriminate between two displacements which are too close together, the totality of these displacements presents the characteristics of a physical continuum; experience teaches us that they are those of a physical continuum of six dimensions; but we do not yet know how many dimensions space itself has, we must first solve another question.

We have seen how important those 'internal changes' are, revealed to us by our muscle sensations. They help define the different attitudes of our body. Let's arbitrarily choose one of these attitudes A as our starting point. When we move from this initial attitude to any other attitude B, we experience a series of muscle sensations, and this series S will characterize B. However, it's important to note that we often see two series S and as defining the same attitude B (since the initial and final attitudes A and B stay the same, the intermediate attitudes and their corresponding sensations may differ). So, how do we recognize the equivalence of these two series? Because they can both compensate for the same external change, or more generally, when it comes to compensating for an external change, one series can be switched out for the other. Among these series, we've identified those that alone can compensate for an external change, which we call 'displacements.' Since we can't distinguish between two displacements that are too close together, the totality of these displacements shows the properties of a physical continuum; experience tells us that they have the characteristics of a six-dimensional physical continuum; but we still don't know how many dimensions space itself has, as we first need to solve another question.

What is a point of space? Everybody thinks he knows, but[Pg 90] that is an illusion. What we see when we try to represent to ourselves a point of space is a black speck on white paper, a speck of chalk on a blackboard, always an object. The question should therefore be understood as follows:

What is a point in space? Everyone thinks they know, but[Pg 90] that's an illusion. What we see when we try to visualize a point in space is a black dot on white paper, a chalk mark on a blackboard, always an object. So, the question should be understood like this:

What do I mean when I say the object B is at the same point that the object A occupied just now? Or further, what criterion will enable me to apprehend this?

What do I mean when I say that object B is in the same place that object A was just now? Or, more importantly, what standard will help me understand this?

I mean that, although I have not budged (which my muscular sense tells me), my first finger which just now touched the object A touches at present the object B. I could have used other criteria; for instance another finger or the sense of sight. But the first criterion is sufficient; I know that if it answers yes, all the other criteria will give the same response. I know it by experience, I can not know it a priori. For the same reason I say that touch can not be exercised at a distance; this is another way of enunciating the same experimental fact. And if, on the contrary, I say that sight acts at a distance, it means that the criterion furnished by sight may respond yes while the others reply no.

I mean that, even though I haven’t moved (which my body tells me), my index finger that just touched object A is now touching object B. I could have used other senses; for example, another finger or my sense of sight. But the first sense is enough; I know that if it confirms yes, all the other senses will give the same answer. I know this from experience, not a priori. For the same reason, I say that touch can’t be experienced from a distance; this is just another way of stating the same experimental fact. Conversely, when I say that sight works at a distance, it means that the answer from sight might be yes while the others say no.

And in fact, the object, although moved away, may form its image at the same point of the retina. Sight responds yes, the object has remained at the same point and touch answers no, because my finger which just now touched the object, at present touches it no longer. If experience had shown us that one finger may respond no when the other says yes, we should likewise say that touch acts at a distance.

And actually, the object, even when it's moved away, can still create its image on the same spot of the retina. Vision confirms that the object is still in the same place, while touch contradicts that because my finger, which just touched the object, is no longer touching it now. If our experience had shown us that one finger can say no while the other says yes, we might also say that touch operates over a distance.

In short, for each attitude of my body, my first finger determines a point, and this it is, and this alone, which defines a point of space.

In short, for every position of my body, my index finger marks a spot, and it's this, and only this, that defines a point in space.

To each attitude corresponds thus a point; but it often happens that the same point corresponds to several different attitudes (in this case we say our finger has not budged, but the rest of the body has moved). We distinguish, therefore, among the changes of attitude those where the finger does not budge. How are we led thereto? It is because often we notice that in these changes the object which is in contact with the finger remains in contact with it.

To each attitude corresponds a specific point; however, it's common for the same point to relate to several different attitudes (in this situation, we say our finger hasn't moved, but the rest of our body has shifted). Therefore, we differentiate among the changes in attitude where the finger stays still. How do we come to this conclusion? It's often because we observe that in these changes, the object touching the finger remains in contact with it.

Range, therefore, in the same class all the attitudes obtainable from each other by one of the changes we have thus distinguished.[Pg 91] To all the attitudes of the class will correspond the same point of space. Therefore to each class will correspond a point and to each point a class. But one may say that what experience arrives at is not the point, it is this class of changes or, better, the corresponding class of muscular sensations.

Range, then, includes all the attitudes that can be achieved from one another through the changes we’ve identified.[Pg 91] Each class of attitudes will correspond to the same point in space. So for each class, there’s a point, and for each point, there’s a class. However, one could argue that what experience actually reaches is not the point itself, but this class of changes or, more accurately, the related class of muscular sensations.

And when we say space has three dimensions, we simply mean that the totality of these classes appears to us with the characteristics of a physical continuum of three dimensions.

And when we say space has three dimensions, we just mean that all of these classes show up to us like a physical three-dimensional continuum.

One might be tempted to conclude that it is experience which has taught us how many dimensions space has. But in reality here also our experiences have bearing, not on space, but on our body and its relations with the neighboring objects. Moreover they are excessively crude.

One might be tempted to think that experience has shown us how many dimensions space has. But in reality, our experiences relate not to space itself but to our bodies and their relationships with nearby objects. Furthermore, these experiences are rather simplistic.

In our mind pre-existed the latent idea of a certain number of groups—those whose theory Lie has developed. Which group shall we choose, to make of it a sort of standard with which to compare natural phenomena? And, this group chosen, which of its sub-groups shall we take to characterize a point of space? Experience has guided us by showing us which choice best adapts itself to the properties of our body. But its rôle is limited to that.

In our minds, there was already a hidden idea of a certain number of groups—those that Lie's theory has developed. Which group should we choose to serve as a kind of standard for comparing natural phenomena? And once we choose that group, which of its sub-groups should we select to define a point in space? Experience has helped us by showing which choice fits best with the properties of our body. But its role is limited to that.

Heritage Journey

It has often been said that if individual experience could not create geometry the same is not true of ancestral experience. But what does that mean? Is it meant that we could not experimentally demonstrate Euclid's postulate, but that our ancestors have been able to do it? Not in the least. It is meant that by natural selection our mind has adapted itself to the conditions of the external world, that it has adopted the geometry most advantageous to the species: or in other words the most convenient. This is entirely in conformity with our conclusions; geometry is not true, it is advantageous.

It’s often said that while personal experience can’t shape geometry, ancestral experience can. But what does that really mean? Is it suggesting that we can’t prove Euclid's postulate through experimentation, but our ancestors could? Not at all. It means that our minds have adapted through natural selection to fit the conditions of the external world, and that we’ve adopted the geometry that is most beneficial to our species: in other words, the most convenient. This aligns perfectly with our conclusions; geometry isn’t about truth, it’s about advantage.


PART III

FORCE


CHAPTER VI

Classic Mechanics

The English teach mechanics as an experimental science; on the continent it is always expounded as more or less a deductive and a priori science. The English are right, that goes without saying; but how could the other method have been persisted in so long? Why have the continental savants who have sought to get out of the ruts of their predecessors been usually unable to free themselves completely?

The English teach mechanics as an experimental science; on the continent, it is usually presented as more of a deductive and a priori science. The English are right, that’s for sure; but how could the other method have lasted so long? Why have continental scholars who tried to break away from their predecessors often been unable to fully free themselves?

On the other hand, if the principles of mechanics are only of experimental origin, are they not therefore only approximate and provisional? Might not new experiments some day lead us to modify or even to abandon them?

On the other hand, if the principles of mechanics come solely from experiments, are they not just approximate and temporary? Could new experiments someday cause us to change or even discard them?

Such are the questions which naturally obtrude themselves, and the difficulty of solution comes principally from the fact that the treatises on mechanics do not clearly distinguish between what is experiment, what is mathematical reasoning, what is convention, what is hypothesis.

Such are the questions that naturally come to mind, and the difficulty in finding answers mainly arises from the fact that the discussions on mechanics do not clearly differentiate between what is an experiment, what is mathematical reasoning, what is convention, and what is hypothesis.

That is not all:

That's not all:

1º There is no absolute space and we can conceive only of relative motions; yet usually the mechanical facts are enunciated as if there were an absolute space to which to refer them.

1º There is no such thing as absolute space, and we can only understand relative motions; however, mechanical facts are often stated as if there were an absolute space to use as a reference.

2º There is no absolute time; to say two durations are equal is an assertion which has by itself no meaning and which can acquire one only by convention.

2º There is no absolute time; saying two durations are equal is a statement that, by itself, has no meaning and can only gain meaning through convention.

3º Not only have we no direct intuition of the equality of two durations, but we have not even direct intuition of the[Pg 93] simultaneity of two events occurring in different places: this I have explained in an article entitled La mesure du temps.[3]

3º Not only do we not have a direct understanding of the equality of two durations, but we also don't have a direct understanding of the simultaneity of two events happening in different locations: I've explained this in an article titled La mesure du temps.[3]

4º Finally, our Euclidean geometry is itself only a sort of convention of language; mechanical facts might be enunciated with reference to a non-Euclidean space which would be a guide less convenient than, but just as legitimate as, our ordinary space; the enunciation would thus become much more complicated, but it would remain possible.

4º Finally, our Euclidean geometry is really just a way of using language; mechanical facts could be expressed in terms of a non-Euclidean space, which would be less convenient than our usual space but just as valid. The expression would become much more complex, but it would still be possible.

Thus absolute space, absolute time, geometry itself, are not conditions which impose themselves on mechanics; all these things are no more antecedent to mechanics than the French language is logically antecedent to the verities one expresses in French.

Thus, absolute space, absolute time, and geometry aren’t conditions that impose themselves on mechanics; all of these are no more prior to mechanics than the French language is logically prior to the truths expressed in French.

We might try to enunciate the fundamental laws of mechanics in a language independent of all these conventions; we should thus without doubt get a better idea of what these laws are in themselves; this is what M. Andrade has attempted to do, at least in part, in his Leçons de mécanique physique.

We could try to express the basic laws of mechanics in a way that's not tied to any of these conventions; this would definitely give us a clearer understanding of what these laws are in their essence. This is what M. Andrade has tried to do, at least to some extent, in his Leçons de mécanique physique.

The enunciation of these laws would become of course much more complicated, because all these conventions have been devised expressly to abridge and simplify this enunciation.

The explanation of these laws would obviously become much more complicated, since all these conventions were specifically designed to shorten and simplify this explanation.

As for me, save in what concerns absolute space, I shall ignore all these difficulties; not that I fail to appreciate them, far from that; but we have sufficiently examined them in the first two parts of the book.

As for me, except for everything related to absolute space, I will ignore all these difficulties; not that I don’t recognize them, far from it; but we have already looked at them in detail in the first two parts of the book.

I shall therefore admit, provisionally, absolute time and Euclidean geometry.

I will therefore accept, provisionally, absolute time and Euclidean geometry.

The Principle of Inertia.—A body acted on by no force can only move uniformly in a straight line.

The Law of Inertia.—An object that is not subjected to any force can only move in a straight line at a constant speed.

Is this a truth imposed a priori upon the mind? If it were so, how should the Greeks have failed to recognize it? How could they have believed that motion stops when the cause which gave birth to it ceases? Or again that every body if nothing prevents, will move in a circle, the noblest of motions?

Is this a truth that is forced a priori on the mind? If it were, how could the Greeks have missed it? How could they have thought that motion stops when the cause that created it ends? Or that every object, if there's nothing stopping it, will move in a circle, the greatest of motions?

If it is said that the velocity of a body can not change if there is no reason for it to change, could it not be maintained just as well that the position of this body can not change, or that the[Pg 94] curvature of its trajectory can not change, if no external cause intervenes to modify them?

If it's said that the speed of an object can't change without a reason for it to change, can't it also be said that the object's position can't change, or that the[Pg 94] curve of its path can't change, unless an external factor comes in to alter them?

Is the principle of inertia, which is not an a priori truth, therefore an experimental fact? But has any one ever experimented on bodies withdrawn from the action of every force? and, if so, how was it known that these bodies were subjected to no force? The example ordinarily cited is that of a ball rolling a very long time on a marble table; but why do we say it is subjected to no force? Is this because it is too remote from all other bodies to experience any appreciable action from them? Yet it is not farther from the earth than if it were thrown freely into the air; and every one knows that in this case it would experience the influence of gravity due to the attraction of the earth.

Is the principle of inertia, which isn't an a priori truth, actually an experimental fact? But has anyone truly conducted experiments on objects that are completely free from the influence of any force? And if they did, how would we know that these objects weren't affected by any force? The common example is of a ball rolling for a long time on a marble table; but why do we claim it's not influenced by any force? Is it because it’s too far away from other objects to feel their effects? Still, it’s not any farther from the Earth than if it were tossed freely into the air; and everyone knows that in that case, it would still be affected by gravity due to the Earth's attraction.

Teachers of mechanics usually pass rapidly over the example of the ball; but they add that the principle of inertia is verified indirectly by its consequences. They express themselves badly; they evidently mean it is possible to verify various consequences of a more general principle, of which that of inertia is only a particular case.

Teachers of mechanics often quickly move past the example of the ball; however, they mention that the principle of inertia is confirmed indirectly through its effects. They don't communicate this clearly; what they really mean is that it's possible to verify various consequences of a broader principle, of which inertia is just one specific instance.

I shall propose for this general principle the following enunciation:

I will propose the following statement for this general principle:

The acceleration of a body depends only upon the position of this body and of the neighboring bodies and upon their velocities.

The acceleration of an object depends solely on its position, the positions of nearby objects, and their velocities.

Mathematicians would say the movements of all the material molecules of the universe depend on differential equations of the second order.

Mathematicians would say that the movement of all the material molecules in the universe relies on second-order differential equations.

To make it clear that this is really the natural generalization of the law of inertia, I shall beg you to permit me a bit of fiction. The law of inertia, as I have said above, is not imposed upon us a priori; other laws would be quite as compatible with the principle of sufficient reason. If a body is subjected to no force, in lieu of supposing its velocity not to change, it might be supposed that it is its position or else its acceleration which is not to change.

To clarify that this is truly just a natural extension of the law of inertia, I’d like to ask you to allow me a little imaginative reasoning. The law of inertia, as I mentioned earlier, isn’t something we just accept without question; other laws could also fit well with the principle of sufficient reason. If an object isn’t acted upon by any force, instead of assuming its velocity remains constant, one could also assume that either its position or its acceleration stays the same.

Well, imagine for an instant that one of these two hypothetical laws is a law of nature and replaces our law of inertia. What would be its natural generalization? A moment's thought will show us.[Pg 95]

Well, just think for a moment that one of these two hypothetical laws is a law of nature and takes the place of our law of inertia. What would its natural generalization be? A moment's reflection will reveal that.[Pg 95]

In the first case, we must suppose that the velocity of a body depends only upon its position and upon that of the neighboring bodies; in the second case that the change of acceleration of a body depends only upon the position of this body and of the neighboring bodies, upon their velocities and upon their accelerations.

In the first scenario, we have to assume that a body's speed only depends on its location and the positions of nearby bodies; in the second scenario, that a body's change in acceleration only depends on its own position and the positions of nearby bodies, as well as their speeds and accelerations.

Or to speak the language of mathematics, the differential equations of motion would be of the first order in the first case, and of the third order in the second case.

Or to put it in mathematical terms, the motion differential equations would be first order in the first case and third order in the second case.

Let us slightly modify our fiction. Suppose a world analogous to our solar system, but where, by a strange chance, the orbits of all the planets are without eccentricity and without inclination. Suppose further that the masses of these planets are too slight for their mutual perturbations to be sensible. Astronomers inhabiting one of these planets could not fail to conclude that the orbit of a star can only be circular and parallel to a certain plane; the position of a star at a given instant would then suffice to determine its velocity and its whole path. The law of inertia which they would adopt would be the first of the two hypothetical laws I have mentioned.

Let’s make a small adjustment to our story. Imagine a world similar to our solar system, but by some weird happenstance, all the planets' orbits are perfectly circular and perfectly aligned. Let's also assume the masses of these planets are so small that their gravitational effects on each other are negligible. Astronomers living on one of these planets would inevitably conclude that a star's orbit must be circular and in line with a specific plane; knowing a star's position at any moment would be enough to figure out its speed and entire trajectory. The law of inertia they would adopt would be the first of the two theoretical laws I've mentioned.

Imagine now that this system is some day traversed with great velocity by a body of vast mass, coming from distant constellations. All the orbits would be profoundly disturbed. Still our astronomers would not be too greatly astonished; they would very well divine that this new star was alone to blame for all the mischief. "But," they would say, "when it is gone, order will of itself be reestablished; no doubt the distances of the planets from the sun will not revert to what they were before the cataclysm, but when the perturbing star is gone, the orbits will again become circular."

Imagine that one day, a massive body from faraway star systems travels through this system at a high speed. The orbits would be significantly disrupted. However, our astronomers wouldn’t be too surprised; they would quickly figure out that this new star was the cause of all the chaos. "But," they would say, "once it passes, everything will return to normal on its own; surely the distances of the planets from the sun won’t go back to what they were before the upheaval, but after the disruptive star is gone, the orbits will become circular again."

It would only be when the disturbing body was gone and when nevertheless the orbits, in lieu of again becoming circular, became elliptic, that these astronomers would become conscious of their error and the necessity of remaking all their mechanics.

It wasn't until the unsettling body disappeared and, instead of returning to circular orbits, they became elliptical that these astronomers would realize their mistake and the need to redo all their mechanics.

I have dwelt somewhat upon these hypotheses because it seems to me one can clearly comprehend what our generalized law of inertia really is only in contrasting it with a contrary hypothesis.

I have spent some time on these ideas because it seems to me that one can truly understand what our overall law of inertia is only by comparing it with an opposing idea.

Well, now, has this generalized law of inertia been verified by[Pg 96] experiment, or can it be? When Newton wrote the Principia he quite regarded this truth as experimentally acquired and demonstrated. It was so in his eyes, not only through the anthropomorphism of which we shall speak further on, but through the work of Galileo. It was so even from Kepler's laws themselves; in accordance with these laws, in fact, the path of a planet is completely determined by its initial position and initial velocity; this is just what our generalized law of inertia requires.

Well, has this general law of inertia been proven by[Pg 96]experiment, or is it possible to prove it? When Newton wrote the Principia, he considered this truth to be tested and confirmed. He believed this not only because of the anthropomorphism we'll discuss later, but also due to Galileo's work. It was also evident from Kepler's laws; according to these laws, the path of a planet is entirely determined by its starting position and initial velocity, which is exactly what our general law of inertia states.

For this principle to be only in appearance true, for one to have cause to dread having some day to replace it by one of the analogous principles I have just now contrasted with it, would be necessary our having been misled by some amazing chance, like that which, in the fiction above developed, led into error our imaginary astronomers.

For this principle to seem true only on the surface, and for someone to fear having to eventually replace it with one of the similar principles I just compared it to, it would require us to have been deceived by some incredible coincidence, like the one that, in the story I mentioned earlier, misled our fictional astronomers.

Such a hypothesis is too unlikely to delay over. No one will believe that such coincidences can happen; no doubt the probability of two eccentricities being both precisely null, to within errors of observation, is not less than the probability of one being precisely equal to 0.1, for instance, and the other to 0.2, to within errors of observation. The probability of a simple event is not less than that of a complicated event; and yet, if the first happens, we shall not consent to attribute it to chance; we should not believe that nature had acted expressly to deceive us. The hypothesis of an error of this sort being discarded, it may therefore be admitted that in so far as astronomy is concerned, our law has been verified by experiment.

Such a hypothesis is too unlikely to dwell on. No one will believe that such coincidences can occur; the chances of two unusual things both being exactly null, within the limits of observational errors, are definitely not less than the chances of one being exactly 0.1, for example, and the other 0.2, within those same errors. The probability of a simple event isn't less than that of a complicated one; yet, if the first occurs, we won't agree to attribute it to chance; we shouldn't believe that nature had acted just to mislead us. With the idea of this kind of error ruled out, it's reasonable to say that as far as astronomy goes, our law has been confirmed by experiments.

But astronomy is not the whole of physics.

But astronomy isn't the entire field of physics.

May we not fear lest some day a new experiment should come to falsify the law in some domain of physics? An experimental law is always subject to revision; one should always expect to see it replaced by a more precise law.

May we not be afraid that someday a new experiment could prove a law wrong in some area of physics? An experimental law is always open to change; we should always be ready to see it replaced by a more accurate law.

Yet no one seriously thinks that the law we are speaking of will ever be abandoned or amended. Why? Precisely because it can never be subjected to a decisive test.

Yet no one truly believes that the law we’re talking about will ever be discarded or changed. Why? Because it can never undergo a definitive test.

First of all, in order that this trial should be complete, it would be necessary that after a certain time all the bodies in the universe should revert to their initial positions with their initial[Pg 97] velocities. It might then be seen whether, starting from this moment, they would resume their original paths.

First of all, for this trial to be complete, it would be necessary for all the bodies in the universe to return to their original positions with their initial[Pg 97] velocities after a certain time. It could then be observed whether, starting from that moment, they would follow their original paths again.

But this test is impossible, it can be only partially applied, and, however well it is made, there will always be some bodies which will not revert to their initial positions; thus every derogation of the law will easily find its explanation.

But this test is impossible; it can only be partially applied. No matter how well it's done, there will always be some cases that won't revert to their initial positions. Therefore, any violation of the law will easily find its justification.

This is not all; in astronomy we see the bodies whose motions we study and we usually assume that they are not subjected to the action of other invisible bodies. Under these conditions our law must indeed be either verified or not verified.

This isn't everything; in astronomy we observe the objects whose movements we study, and we typically assume that they aren't influenced by other unseen bodies. Given these conditions, our law must either be confirmed or not confirmed.

But it is not the same in physics; if the physical phenomena are due to motions, it is to the motions of molecules which we do not see. If then the acceleration of one of the bodies we see appears to us to depend on something else besides the positions or velocities of other visible bodies or of invisible molecules whose existence we have been previously led to admit, nothing prevents our supposing that this something else is the position or the velocity of other molecules whose presence we have not before suspected. The law will find itself safeguarded.

But it’s different in physics; if physical phenomena result from movements, they come from the movements of molecules that we can’t see. So, if the acceleration of one of the visible bodies seems to depend on something else, aside from the positions or speeds of other visible bodies or of unseen molecules we’ve already acknowledged, there's nothing stopping us from suggesting that this something else is the position or the speed of other molecules we haven't noticed before. The law will still hold true.

Permit me to employ mathematical language a moment to express the same thought under another form. Suppose we observe n molecules and ascertain that their 3n coordinates satisfy a system of 3n differential equations of the fourth order (and not of the second order as the law of inertia would require). We know that by introducing 3n auxiliary variables, a system of 3n equations of the fourth order can be reduced to a system of 6n equations of the second order. If then we suppose these 3n auxiliary variables represent the coordinates of n invisible molecules, the result is again in conformity with the law of inertia.

Let me use some math language for a moment to express the same idea in a different way. Imagine we have n molecules and find out that their 3n coordinates fit a set of 3n fourth-order differential equations (instead of the second order like the law of inertia would suggest). We know that by adding 3n extra variables, a set of 3n fourth-order equations can be simplified to a set of 6n second-order equations. If we then assume these 3n extra variables represent the coordinates of n invisible molecules, the outcome still aligns with the law of inertia.

To sum up, this law, verified experimentally in some particular cases, may unhesitatingly be extended to the most general cases, since we know that in these general cases experiment no longer is able either to confirm or to contradict it.

To sum up, this law, proven through experiments in specific instances, can confidently be applied to broader situations, since we know that in these general cases, experiments can no longer confirm or contradict it.

The Law of Acceleration.—The acceleration of a body is equal to the force acting on it divided by its mass. Can this law be verified by experiment? For that it would be necessary to[Pg 98] measure the three magnitudes which figure in the enunciation: acceleration, force and mass.

The Acceleration Law.—The acceleration of an object is equal to the force acting on it divided by its mass. Can we test this law through experimentation? To do so, we would need to[Pg 98] measure the three quantities mentioned: acceleration, force, and mass.

I assume that acceleration can be measured, for I pass over the difficulty arising from the measurement of time. But how measure force, or mass? We do not even know what they are.

I assume that we can measure acceleration since I'm skipping over the challenge of measuring time. But how do we measure force or mass? We don't even know what they actually are.

What is mass? According to Newton, it is the product of the volume by the density. According to Thomson and Tait, it would be better to say that density is the quotient of the mass by the volume. What is force? It is, replies Lagrange, that which moves or tends to move a body. It is, Kirchhoff will say, the product of the mass by the acceleration. But then, why not say the mass is the quotient of the force by the acceleration?

What is mass? According to Newton, it's the product of volume and density. Thomson and Tait would argue it's better to say that density is the mass divided by volume. What is force? Lagrange answers that it's what moves or tries to move an object. Kirchhoff adds that it's the mass multiplied by acceleration. So, why not say that mass is the force divided by acceleration?

These difficulties are inextricable.

These challenges are interconnected.

When we say force is the cause of motion, we talk metaphysics, and this definition, if one were content with it, would be absolutely sterile. For a definition to be of any use, it must teach us to measure force; moreover that suffices; it is not at all necessary that it teach us what force is in itself, nor whether it is the cause or the effect of motion.

When we say that force is what causes motion, we’re diving into metaphysics, and if someone is satisfied with this definition, it ends up being completely unproductive. For a definition to be useful, it needs to help us measure force; that’s all that really matters. It’s not necessary for it to explain what force is in itself, or whether it causes motion or is the result of it.

We must therefore first define the equality of two forces. When shall we say two forces are equal? It is, we are told, when, applied to the same mass, they impress upon it the same acceleration, or when, opposed directly one to the other, they produce equilibrium. This definition is only a sham. A force applied to a body can not be uncoupled to hook it up to another body, as one uncouples a locomotive to attach it to another train. It is therefore impossible to know what acceleration such a force, applied to such a body, would impress upon such another body, if it were applied to it. It is impossible to know how two forces which are not directly opposed would act, if they were directly opposed.

We need to start by defining what we mean by the equality of two forces. When can we say two forces are equal? We’re told it’s when, applied to the same mass, they give it the same acceleration or when, opposing each other directly, they achieve balance. But this definition is misleading. A force applied to one object can’t be separated to connect it to another object, like unhooking a locomotive to attach it to a different train. So, it’s impossible to determine what acceleration that force, applied to that object, would exert on a different object, if it were applied to it. It’s also impossible to predict how two forces that aren’t opposing each other would behave, if they were directly opposing.

It is this definition we try to materialize, so to speak, when we measure a force with a dynamometer, or in balancing it with a weight. Two forces F and , which for simplicity I will suppose vertical and directed upward, are applied respectively to two bodies C and ; I suspend the same heavy body P first to the body C, then to the body ; if equilibrium is produced in both cases, I shall conclude that the two forces F and are equal to[Pg 99] one another, since they are each equal to the weight of the body P.

It is this definition we try to bring to life when we measure a force with a dynamometer or balance it with a weight. Two forces F and , which for simplicity I'll assume are vertical and directed upward, are applied to two bodies C and ; I first hang the same heavy body P from body C, then from body ; if equilibrium is achieved in both situations, I can conclude that the two forces F and are equal to[Pg 99] each other, since they are both equal to the weight of body P.

But am I sure the body P has retained the same weight when I have transported it from the first body to the second? Far from it; I am sure of the contrary; I know the intensity of gravity varies from one point to another, and that it is stronger, for instance, at the pole than at the equator. No doubt the difference is very slight and, in practise, I shall take no account of it; but a properly constructed definition should have mathematical rigor; this rigor is lacking. What I say of weight would evidently apply to the force of the resiliency of a dynamometer, which the temperature and a multitude of circumstances may cause to vary.

But can I be sure that the body P has kept the same weight when I move it from one place to another? Not at all; I’m actually certain of the opposite; I know that the intensity of gravity changes from one location to another, and it’s stronger, for instance, at the poles than at the equator. The difference is probably very small and in practice, I won’t really consider it; but a well-defined concept should be mathematically precise; that precision is missing. What I’m saying about weight would clearly also apply to the force of a dynamometer’s resistance, which can change due to temperature and various other factors.

This is not all; we can not say the weight of the body P may be applied to the body C and directly balance the force F. What is applied to the body C is the action A of the body P on the body C; the body P is submitted on its part, on the one hand, to its weight; on the other hand, to the reaction R of the body C on P. Finally, the force F is equal to the force A, since it balances it; the force A is equal to R, in virtue of the principle of the equality of action and reaction; lastly, the force R is equal to the weight of P, since it balances it. It is from these three equalities we deduce as consequence the equality of F and the weight of P.

This is not all; we cannot say the weight of body P can be applied to body C and directly balance the force F. What is applied to body C is the action A of body P on body C; body P is, on one hand, subject to its weight, and on the other hand, to the reaction R of body C on P. Finally, the force F is equal to force A, since it balances it; force A is equal to R, based on the principle of action and reaction being equal; lastly, force R is equal to the weight of P, since it balances it. From these three equalities, we deduce the equality of F and the weight of P.

We are therefore obliged in the definition of the equality of the two forces to bring in the principle of the equality of action and reaction; on this account, this principle must no longer be regarded as an experimental law, but as a definition.

We are therefore required in defining the equality of the two forces to include the principle of action and reaction; for this reason, this principle should no longer be seen as an experimental law, but as a definition.

For recognizing the equality of two forces here, we are then in possession of two rules: equality of two forces which balance; equality of action and reaction. But, as we have seen above, these two rules are insufficient; we are obliged to have recourse to a third rule and to assume that certain forces, as, for instance, the weight of a body, are constant in magnitude and direction. But this third rule, as I have said, is an experimental law; it is only approximately true; it is a bad definition.

To recognize when two forces are equal, we have two principles: the equality of balancing forces and the equality of action and reaction. However, as we've discussed, these two principles aren't enough; we need a third principle and must assume that certain forces, like the weight of an object, are constant in both size and direction. But this third principle, as I mentioned, is an experimental law; it’s only roughly correct; it's a poor definition.

We are therefore reduced to Kirchhoff's definition; force is equal to the mass multiplied by the acceleration. This 'law of Newton' in its turn ceases to be regarded as an experimental law, it is now only a definition. But this definition is still insufficient,[Pg 100] for we do not know what mass is. It enables us doubtless to calculate the relation of two forces applied to the same body at different instants; it teaches us nothing about the relation of two forces applied to two different bodies.

We are therefore left with Kirchhoff's definition; force is equal to mass multiplied by acceleration. This 'law of Newton' is no longer seen as an experimental law; it has become just a definition. However, this definition is still not enough,[Pg 100] because we still don’t know what mass is. It certainly allows us to calculate the relationship between two forces applied to the same body at different times, but it doesn't explain the relationship between two forces applied to two different bodies.

To complete it, it is necessary to go back anew to Newton's third law (equality of action and reaction), regarded again, not as an experimental law, but as a definition. Two bodies A and B act one upon the other; the acceleration of A multiplied by the mass of A is equal to the action of B upon A; in the same way, the product of the acceleration of B by its mass is equal to the reaction of A upon B. As, by definition, action is equal to reaction, the masses of A and B are in the inverse ratio of their accelerations. Here we have the ratio of these two masses defined, and it is for experiment to verify that this ratio is constant.

To finalize this, we need to return once more to Newton's third law (the equality of action and reaction), viewed not just as an experimental law but as a definition. Two bodies A and B influence each other; the acceleration of A multiplied by the mass of A equals the action of B on A; similarly, the product of the acceleration of B and its mass equals the reaction of A on B. Since action is defined to equal reaction, the masses of A and B are inversely proportional to their accelerations. Here, we establish the ratio of these two masses, and it's up to experiments to prove that this ratio remains constant.

That would be all very well if the two bodies A and B alone were present and removed from the action of the rest of the world. This is not at all the case; the acceleration of A is not due merely to the action of B, but to that of a multitude of other bodies C, D,... To apply the preceding rule, it is therefore necessary to separate the acceleration of A into many components, and discern which of these components is due to the action of B.

That would be great if only the two bodies A and B were involved and isolated from everything else. However, that's not the case; the acceleration of A isn't just caused by B, but by many other bodies C, D, and so on. To use the previous rule, it's essential to break down the acceleration of A into several components and identify which of these components are influenced by B.

This separation would still be possible, if we should assume that the action of C upon A is simply adjoined to that of B upon A, without the presence of the body C modifying the action of B upon A; or the presence of B modifying the action of C upon A; if we should assume, consequently, that any two bodies attract each other, that their mutual action is along their join and depends only upon their distance apart; if, in a word, we assume the hypothesis of central forces.

This separation would still be possible if we assume that the effect of C on A is simply added to that of B on A, without C affecting how B acts on A; or B affecting how C acts on A; if we assume, therefore, that any two bodies attract each other, that their mutual interaction occurs along the line connecting them and depends only on the distance between them; if, in short, we accept the hypothesis of central forces.

You know that to determine the masses of the celestial bodies we use a wholly different principle. The law of gravitation teaches us that the attraction of two bodies is proportional to their masses; if r is their distance apart, m and their masses, k a constant, their attraction will be kmm´/r2.

You know that to find the masses of celestial bodies, we use a completely different principle. The law of gravitation tells us that the attraction between two bodies is proportional to their masses; if r is the distance between them, m and are their masses, and k is a constant, their attraction will be kmm´/r2.

What we are measuring then is not mass, the ratio of force to acceleration, but the attracting mass; it is not the inertia of the body, but its attracting force.

What we're measuring, then, isn't mass, the relation between force and acceleration, but the attracting mass; it's not the body's inertia, but its attracting force.

This is an indirect procedure, whose employment is not[Pg 101] theoretically indispensable. It might very well have been that attraction was inversely proportional to the square of the distance without being proportional to the product of the masses, that it was equal to f/r2, but without our having f = kmm´.

This is an indirect procedure, which is not[Pg 101] theoretically necessary. It could have been that attraction was inversely proportional to the square of the distance without being proportional to the product of the masses, that it was equal to f/r2, but without our having f = kmm´.

If it were so, we could nevertheless, by observation of the relative motions of the heavenly bodies, measure the masses of these bodies.

If that were the case, we could still, by observing the relative motions of the heavenly bodies, measure the masses of these objects.

But have we the right to admit the hypothesis of central forces? Is this hypothesis rigorously exact? Is it certain it will never be contradicted by experiment? Who would dare affirm that? And if we must abandon this hypothesis, the whole edifice so laboriously erected will crumble.

But do we have the right to accept the idea of central forces? Is this idea completely accurate? Can we be sure it will never be disproven by experiments? Who would dare to claim that? And if we have to give up this idea, the entire structure that we've worked so hard to build will collapse.

We have no longer the right to speak of the component of the acceleration of A due to the action of B. We have no means of distinguishing it from that due to the action of C or of another body. The rule for the measurement of masses becomes inapplicable.

We can no longer talk about the part of the acceleration of A caused by B. We can't tell it apart from the acceleration caused by C or any other body. The rule for measuring masses no longer applies.

What remains then of the principle of the equality of action and reaction? If the hypothesis of central forces is rejected, this principle should evidently be enunciated thus: the geometric resultant of all the forces applied to the various bodies of a system isolated from all external action will be null. Or, in other words, the motion of the center of gravity of this system will be rectilinear and uniform.

What’s left of the principle of action and reaction? If we reject the idea of central forces, this principle should clearly be stated like this: the overall effect of all the forces acting on the various bodies in a system that is isolated from any outside influences will be zero. In other words, the motion of the center of gravity of this system will be straight and steady.

There it seems we have a means of defining mass; the position of the center of gravity evidently depends on the values attributed to the masses; it will be necessary to dispose of these values in such a way that the motion of the center of gravity may be rectilinear and uniform; this will always be possible if Newton's third law is true, and possible in general only in a single way.

There we have a way of defining mass; the location of the center of gravity clearly depends on the values assigned to the masses. We need to arrange these values so that the movement of the center of gravity can be straight and uniform. This will always be achievable if Newton's third law holds true, and it will generally only be possible in one specific way.

But there exists no system isolated from all external action; all the parts of the universe are subject more or less to the action of all the other parts. The law of the motion of the center of gravity is rigorously true only if applied to the entire universe.

But there is no system completely separated from outside influences; all parts of the universe are influenced to some extent by all the other parts. The law of the motion of the center of gravity is strictly accurate only when applied to the whole universe.

But then, to get from it the values of the masses, it would be necessary to observe the motion of the center of gravity of the universe. The absurdity of this consequence is manifest; we know only relative motions; the motion of the center of gravity of the universe will remain for us eternally unknown.[Pg 102]

But to find out the masses' values, we would need to observe the movement of the universe's center of gravity. The ridiculousness of this idea is clear; we only understand relative motions, and the movement of the universe's center of gravity will always be unknown to us.[Pg 102]

Therefore nothing remains and our efforts have been fruitless; we are driven to the following definition, which is only an avowal of powerlessness: masses are coefficients it is convenient to introduce into calculations.

Therefore, nothing is left, and our efforts have been in vain; we are led to the following definition, which is simply an admission of helplessness: masses are factors that it's useful to bring into calculations.

We could reconstruct all mechanics by attributing different values to all the masses. This new mechanics would not be in contradiction either with experience or with the general principles of dynamics (principle of inertia, proportionality of forces to masses and to accelerations, equality of action and reaction, rectilinear and uniform motion of the center of gravity, principle of areas).

We could rebuild all of mechanics by assigning different values to all the masses. This new mechanics wouldn't contradict either experience or the general principles of dynamics (the principle of inertia, the proportionality of forces to masses and accelerations, the equality of action and reaction, the straight-line and uniform motion of the center of gravity, and the principle of areas).

Only the equations of this new mechanics would be less simple. Let us understand clearly: it would only be the first terms which would be less simple, that is those experience has already made us acquainted with; perhaps one could alter the masses by small quantities without the complete equations gaining or losing in simplicity.

Only the equations of this new mechanics would be less simple. Let’s be clear: it would only be the initial terms that would be less simple, meaning those we are already familiar with from experience; maybe one could adjust the masses by small amounts without the complete equations becoming more or less simple.

Hertz has raised the question whether the principles of mechanics are rigorously true. "In the opinion of many physicists," he says, "it is inconceivable that the remotest experience should ever change anything in the immovable principles of mechanics; and yet, what comes from experience may always be rectified by experience." After what we have just said, these fears will appear groundless.

Hertz has questioned whether the principles of mechanics are absolutely true. "Many physicists believe," he says, "it's hard to imagine that any distant experience could ever alter the unchanging principles of mechanics; and yet, what we gain from experience can always be corrected by further experience." Given what we've just discussed, these concerns will seem unfounded.

The principles of dynamics at first appeared to us as experimental truths; but we have been obliged to use them as definitions. It is by definition that force is equal to the product of mass by acceleration; here, then, is a principle which is henceforth beyond the reach of any further experiment. It is in the same way by definition that action is equal to reaction.

The principles of dynamics initially seemed like experimental facts; however, we have had to treat them as definitions. It is by definition that force equals mass times acceleration; thus, this is a principle that is no longer subject to further experimentation. Similarly, action is equal to reaction by definition.

But then, it will be said, these unverifiable principles are absolutely devoid of any significance; experiment can not contradict them; but they can teach us nothing useful; then what is the use of studying dynamics?

But then, it will be said, these unverifiable principles have no real significance; experiments can’t contradict them; but they don’t teach us anything useful; so what’s the point of studying dynamics?

This over-hasty condemnation would be unjust. There is not in nature any system perfectly isolated, perfectly removed from all external action; but there are systems almost isolated.

This rushed judgment would be unfair. In nature, there is no system that is perfectly isolated, completely cut off from all external influences; however, there are systems that are almost isolated.

If such a system be observed, one may study not only the[Pg 103] relative motion of its various parts one in reference to another, but also the motion of its center of gravity in reference to the other parts of the universe. We ascertain then that the motion of this center of gravity is almost rectilinear and uniform, in conformity with Newton's third law.

If such a system is observed, one can study not only the[Pg 103] relative motion of its various parts with respect to each other, but also the motion of its center of gravity in relation to the other parts of the universe. We find that the motion of this center of gravity is almost straight and consistent, in accordance with Newton's third law.

That is an experimental truth, but it can not be invalidated by experience; in fact, what would a more precise experiment teach us? It would teach us that the law was only almost true; but that we knew already.

That is a tested truth, but it can't be disproven by experience; in fact, what would a more precise experiment show us? It would show us that the law was only nearly true; but we already knew that.

We can now understand how experience has been able to serve as basis for the principles of mechanics and yet will never be able to contradict them.

We can now see how experience has provided the foundation for the principles of mechanics and yet will never be able to refute them.

Anthropomorphic Mechanics.—"Kirchhoff," it will be said, "has only acted in obedience to the general tendency of mathematicians toward nominalism; from this his ability as a physicist has not saved him. He wanted a definition of force, and he took for it the first proposition that presented itself; but we need no definition of force: the idea of force is primitive, irreducible, indefinable; we all know what it is, we have a direct intuition of it. This direct intuition comes from the notion of effort, which is familiar to us from infancy."

Personified Mechanics.—"People will say that Kirchhoff only followed the general trend among mathematicians towards nominalism; this didn't protect him as a physicist. He sought a definition of force and grabbed the first idea that came to him. However, we don’t actually need a definition of force: the concept of force is basic, fundamental, and cannot be defined. We all understand what it is; we have an immediate sense of it. This immediate sense comes from our understanding of effort, something we’ve known since we were kids."

But first, even though this direct intuition made known to us the real nature of force in itself, it would be insufficient as a foundation for mechanics; it would besides be wholly useless. What is of importance is not to know what force is, but to know how to measure it.

But first, even though this direct intuition revealed the true nature of force itself, it wouldn’t be enough as a foundation for mechanics; it would also be completely pointless. What really matters isn’t knowing what force is, but knowing how to measure it.

Whatever does not teach us to measure it is as useless to mechanics as is, for instance, the subjective notion of warmth and cold to the physicist who is studying heat. This subjective notion can not be translated into numbers, therefore it is of no use; a scientist whose skin was an absolutely bad conductor of heat and who, consequently, would never have felt either sensations of cold or sensations of warmth, could read a thermometer just as well as any one else, and that would suffice him for constructing the whole theory of heat.

Whatever doesn't help us measure it is as useless to mechanics as the subjective ideas of warmth and cold are to a physicist studying heat. These subjective ideas can't be turned into numbers, so they aren’t useful; a scientist whose skin was a terrible conductor of heat and, as a result, never felt cold or warm sensations could read a thermometer just as well as anyone else, and that would be enough for him to build the entire theory of heat.

Now this immediate notion of effort is of no use to us for measuring force; it is clear, for instance, that I should feel more[Pg 104] fatigue in lifting a weight of fifty kilos than a man accustomed to carry burdens.

Now this direct idea of effort isn't useful for measuring force; it's obvious, for example, that I'd feel more [Pg 104] tired lifting a fifty-kilo weight than someone used to carrying heavy loads.

But more than that: this notion of effort does not teach us the real nature of force; it reduces itself finally to a remembrance of muscular sensations, and it will hardly be maintained that the sun feels a muscular sensation when it draws the earth.

But more than that: this idea of effort doesn’t show us the true nature of force; it ultimately boils down to a memory of muscle sensations, and it’s hard to argue that the sun experiences a muscle sensation when it pulls the earth.

All that can there be sought is a symbol, less precise and less convenient than the arrows the geometers use, but just as remote from the reality.

All that can be found there is a symbol, less precise and less convenient than the arrows used by mathematicians, but just as far removed from reality.

Anthropomorphism has played a considerable historic rôle in the genesis of mechanics; perhaps it will still at times furnish a symbol which will appear convenient to some minds; but it can not serve as foundation for anything of a truly scientific or philosophic character.

Anthropomorphism has played a significant historical role in the development of mechanics; it might still sometimes provide a symbol that seems useful to some people, but it cannot be the basis for anything genuinely scientific or philosophical.

'The School of the Thread.'—M. Andrade, in his Leçons de mécanique physique, has rejuvenated anthropomorphic mechanics. To the school of mechanics to which Kirchhoff belongs, he opposes that which he bizarrely calls the school of the thread.

'The Thread School.'—M. Andrade, in his Leçons de mécanique physique, has refreshed anthropomorphic mechanics. He contrasts the mechanics school that Kirchhoff is part of with what he oddly refers to as the school of the thread.

This school tries to reduce everything to "the consideration of certain material systems of negligible mass, envisaged in the state of tension and capable of transmitting considerable efforts to distant bodies, systems of which the ideal type is the thread."

This school aims to simplify everything to “the consideration of specific material systems of minimal mass, imagined in a state of tension and able to transfer significant forces to distant objects, systems where the ideal type is the thread.”

A thread which transmits any force is slightly elongated under the action of this force; the direction of the thread tells us the direction of the force, whose magnitude is measured by the elongation of the thread.

A thread that carries any force stretches a bit when this force is applied; the direction of the thread indicates the direction of the force, and its strength is measured by how much the thread stretches.

One may then conceive an experiment such as this. A body A is attached to a thread; at the other extremity of the thread any force acts which varies until the thread takes an elongation α; the acceleration of the body A is noted; A is detached and the body B attached to the same thread; the same force or another force acts anew, and is made to vary until the thread takes again the elongation α; the acceleration of the body B is noted. The experiment is then renewed with both A and B, but so that the thread takes the elongation ßβ. The four observed accelerations should be proportional. We have thus an experimental verification of the law of acceleration above enunciated.

One can then imagine an experiment like this. A body A is attached to a thread; at the other end of the thread, a force is applied that changes until the thread stretches to a length α; the acceleration of body A is recorded; then A is removed and body B is attached to the same thread; the same force or a different force is applied again, and it is adjusted until the thread stretches to the same length α; the acceleration of body B is recorded. The experiment is then repeated with both A and B, but this time so that the thread stretches to a length ßβ. The four recorded accelerations should be proportional. This gives us an experimental confirmation of the previously stated law of acceleration.

Or still better, a body is submitted to the simultaneous action[Pg 105] of several identical threads in equal tension, and by experiment it is sought what must be the orientations of all these threads that the body may remain in equilibrium. We have then an experimental verification of the law of the composition of forces.

Or even better, a body is subjected to the simultaneous action[Pg 105] of multiple identical threads under equal tension, and through experimentation, we look for the orientations of all these threads that allow the body to maintain equilibrium. This gives us an experimental confirmation of the law of force composition.

But, after all, what have we done? We have defined the force to which the thread is subjected by the deformation undergone by this thread, which is reasonable enough; we have further assumed that if a body is attached to this thread, the effort transmitted to it by the thread is equal to the action this body exercises on this thread; after all, we have therefore used the principle of the equality of action and reaction, in considering it, not as an experimental truth, but as the very definition of force.

But really, what have we accomplished? We have defined the force acting on the thread based on the deformation that the thread experiences, which makes sense; we have also assumed that if a body is attached to this thread, the effort transferred to it by the thread is equal to the force that this body exerts on the thread; ultimately, we have applied the principle of action and reaction, treating it not as a proven fact, but as the very definition of force.

This definition is just as conventional as Kirchhoff's, but far less general.

This definition is just as traditional as Kirchhoff's, but much more limited.

All forces are not transmitted by threads (besides, to be able to compare them, they would all have to be transmitted by identical threads). Even if it should be conceded that the earth is attached to the sun by some invisible thread, at least it would be admitted that we have no means of measuring its elongation.

All forces aren’t transmitted through threads (besides, to compare them, they’d all have to be transmitted by the same type of threads). Even if we were to accept that the earth is connected to the sun by some invisible thread, at the very least, it would be acknowledged that we have no way of measuring its stretching.

Nine times out of ten, consequently, our definition would be at fault; no sort of sense could be attributed to it, and it would be necessary to fall back on Kirchhoff's.

Nine times out of ten, our definition would be wrong; it wouldn't make any sense, and we would have to rely on Kirchhoff's instead.

Why then take this détour? You admit a certain definition of force which has a meaning only in certain particular cases. In these cases you verify by experiment that it leads to the law of acceleration. On the strength of this experiment, you then take the law of acceleration as a definition of force in all the other cases.

Why take this detour then? You acknowledge a specific definition of force that only makes sense in certain situations. In these situations, you confirm through experimentation that it leads to the law of acceleration. Based on this experiment, you then use the law of acceleration as a definition of force in all other situations.

Would it not be simpler to consider the law of acceleration as a definition in all cases, and to regard the experiments in question, not as verifications of this law, but as verifications of the principle of reaction, or as demonstrating that the deformations of an elastic body depend only on the forces to which this body is subjected?

Wouldn't it be easier to think of the law of acceleration as a definition in every situation, and to view the experiments as not just tests of this law, but as tests of the principle of reaction, or as showing that the deformations of an elastic body are determined solely by the forces acting on that body?

And this is without taking into account that the conditions under which your definition could be accepted are never fulfilled except imperfectly, that a thread is never without mass, that it is never removed from every force except the reaction of the bodies attached to its extremities.[Pg 106]

And this doesn't even consider that the conditions for your definition to be accepted are rarely met perfectly, that a thread always has mass, and that it's never free from all forces except for the reaction of the bodies at its ends.[Pg 106]

Andrade's ideas are nevertheless very interesting; if they do not satisfy our logical craving, they make us understand better the historic genesis of the fundamental ideas of mechanics. The reflections they suggest show us how the human mind has raised itself from a naïve anthropomorphism to the present conceptions of science.

Andrade's ideas are really intriguing; even if they don't fully satisfy our logical needs, they help us grasp the historical development of the essential concepts in mechanics. The thoughts they provoke illustrate how the human mind has evolved from a simple anthropomorphism to today's scientific understanding.

We see at the start a very particular and in sum rather crude experiment; at the finish, a law perfectly general, perfectly precise, the certainty of which we regard as absolute. This certainty we ourselves have bestowed upon it voluntarily, so to speak, by looking upon it as a convention.

We see at the beginning a unique and overall quite rough experiment; by the end, a law that is completely general, completely precise, whose certainty we consider absolute. This certainty is something we have willingly given it, so to speak, by viewing it as a convention.

Are the law of acceleration, the rule of the composition of forces then only arbitrary conventions? Conventions, yes; arbitrary, no; they would be if we lost sight of the experiments which led the creators of the science to adopt them, and which, imperfect as they may be, suffice to justify them. It is well that from time to time our attention is carried back to the experimental origin of these conventions.

Are the laws of acceleration and the rules for combining forces just arbitrary conventions? They’re conventions, yes, but not arbitrary; they would be if we overlooked the experiments that led the founders of the science to establish them. Those experiments, though imperfect, are enough to validate these rules. It's important to occasionally remind ourselves of the experimental origins of these conventions.


CHAPTER VII

Relative and Absolute Motion

The Principle of Relative Motion.—The attempt has sometimes been made to attach the law of acceleration to a more general principle. The motion of any system must obey the same laws, whether it be referred to fixed axes, or to movable axes carried along in a rectilinear and uniform motion. This is the principle of relative motion, which forces itself upon us for two reasons: first, the commonest experience confirms it, and second, the contrary hypothesis is singularly repugnant to the mind.

The Principle of Relative Motion.—At times, people have tried to connect the law of acceleration to a broader principle. The motion of any system must follow the same rules, whether it's based on fixed axes or movable ones that travel in a straight and constant motion. This is the principle of relative motion, which is evident for two reasons: first, our everyday experiences support it, and second, the opposing idea is quite hard to accept.

Assume it then, and consider a body subjected to a force; the relative motion of this body, in reference to an observer moving with a uniform velocity equal to the initial velocity of the body, must be identical to what its absolute motion would be if it started from rest. We conclude hence that its acceleration can not depend upon its absolute velocity; the attempt has even been made to derive from this a demonstration of the law of acceleration.

Assume this, and think about a body exposed to a force; the relative movement of this body, from the perspective of an observer moving at a constant speed equal to the body's initial speed, must be the same as what its motion would be if it started from a standstill. Therefore, we conclude that its acceleration cannot depend on its absolute speed; there has even been an effort to derive a proof of the law of acceleration from this.

There long were traces of this demonstration in the regulations for the degree B. ès Sc. It is evident that this attempt is idle. The obstacle which prevented our demonstrating the law of acceleration is that we had no definition of force; this obstacle subsists in its entirety, since the principle invoked has not furnished us the definition we lacked.

There have long been signs of this demonstration in the rules for the B. ès Sc. degree. It's clear that this attempt is pointless. The reason we couldn't demonstrate the law of acceleration is that we didn't have a definition of force; this issue still exists completely, since the principle we referred to hasn't given us the definition we needed.

The principle of relative motion is none the less highly interesting and deserves study for its own sake. Let us first try to enunciate it in a precise manner.

The principle of relative motion is still very interesting and deserves to be studied for its own sake. Let's start by stating it clearly.

We have said above that the accelerations of the different bodies forming part of an isolated system depend only on their relative velocities and positions, and not on their absolute velocities and positions, provided the movable axes to which the relative motion is referred move uniformly in a straight line. Or, if[Pg 108] we prefer, their accelerations depend only on the differences of their velocities and the differences of their coordinates, and not on the absolute values of these velocities and coordinates.

We mentioned earlier that the accelerations of the different bodies in an isolated system depend only on their relative velocities and positions, not on their absolute velocities and positions, as long as the moving axes that reference the relative motion move uniformly in a straight line. Alternatively, we could say that their accelerations depend only on the differences in their velocities and the differences in their coordinates, rather than on the absolute values of those velocities and coordinates.

If this principle is true for relative accelerations, or rather for differences of acceleration, in combining it with the law of reaction we shall thence deduce that it is still true of absolute accelerations.

If this principle applies to relative accelerations, or rather differences in acceleration, by combining it with the law of reaction, we can conclude that it also holds true for absolute accelerations.

It then remains to be seen how we may demonstrate that the differences of the accelerations depend only on the differences of the velocities and of the coordinates, or, to speak in mathematical language, that these differences of coordinates satisfy differential equations of the second order.

It then remains to be seen how we can show that the differences in accelerations depend only on the differences in velocities and coordinates, or, in mathematical terms, that these differences in coordinates satisfy second-order differential equations.

Can this demonstration be deduced from experiments or from a priori considerations?

Can this demonstration be figured out from experiments or from a priori reasoning?

Recalling what we have said above, the reader can answer for himself.

Recalling what we mentioned earlier, the reader can respond for themselves.

Thus enunciated, in fact, the principle of relative motion singularly resembles what I called above the generalized principle of inertia; it is not altogether the same thing, since it is a question of the differences of coordinates and not of the coordinates themselves. The new principle teaches us therefore something more than the old, but the same discussion is applicable and would lead to the same conclusions; it is unnecessary to return to it.

Thus stated, the principle of relative motion closely resembles what I previously referred to as the generalized principle of inertia; it’s not exactly the same, since it deals with differences in coordinates rather than the coordinates themselves. The new principle therefore teaches us something beyond the old one, but the same discussion applies and would lead to the same conclusions; there's no need to revisit it.

Newton's Argument.—Here we encounter a very important and even somewhat disconcerting question. I have said the principle of relative motion was for us not solely a result of experiment and that a priori every contrary hypothesis would be repugnant to the mind.

Newton's Argument.—Here we face a very important and somewhat unsettling question. I’ve stated that the principle of relative motion isn’t just based on experiments, and that a priori, any opposing hypothesis would be unacceptable to our understanding.

But then, why is the principle true only if the motion of the movable axes is rectilinear and uniform? It seems that it ought to impose itself upon us with the same force, if this motion is varied, or at any rate if it reduces to a uniform rotation. Now, in these two cases, the principle is not true. I will not dwell long on the case where the motion of the axes is rectilinear without being uniform; the paradox does not bear a moment's examination. If I am on board, and if the train, striking any[Pg 109] obstacle, stops suddenly, I shall be thrown against the seat in front of me, although I have not been directly subjected to any force. There is nothing mysterious in that; if I have undergone the action of no external force, the train itself has experienced an external impact. There can be nothing paradoxical in the relative motion of two bodies being disturbed when the motion of one or the other is modified by an external cause.

But then, why is this principle true only when the motion of the movable axes is straight and constant? It seems like it should apply just as strongly when the motion is varied, or at least when it becomes a consistent rotation. In these two cases, however, the principle doesn't hold. I won’t spend much time on the scenario where the motion of the axes is straight but not constant; the paradox doesn't stand up to scrutiny. If I’m on a train and it suddenly stops after hitting an obstacle, I'll be thrown against the seat in front of me, even though I haven’t been directly acted upon by any force. There's nothing mysterious about that; if I haven’t experienced any external force, the train itself has faced an outside impact. There’s nothing paradoxical about the relative motion of two bodies being disturbed when the motion of one or the other is changed by an external factor.

I will pause longer on the case of relative motions referred to axes which rotate uniformly. If the heavens were always covered with clouds, if we had no means of observing the stars, we nevertheless might conclude that the earth turns round; we could learn this from its flattening or again by the Foucault pendulum experiment.

I will take a moment to focus on the situation of relative motions in relation to axes that rotate consistently. Even if the sky was always overcast and we had no way to see the stars, we could still conclude that the Earth is rotating. We could figure this out from its flattened shape or by conducting the Foucault pendulum experiment.

And yet, in this case, would it have any meaning, to say the earth turns round? If there is no absolute space, can one turn without turning in reference to something else? and, on the other hand, how could we admit Newton's conclusion and believe in absolute space?

And yet, in this case, would it mean anything to say the earth is spinning? If there’s no such thing as absolute space, can one spin without spinning in relation to something else? And, on the other hand, how can we accept Newton's conclusion and still believe in absolute space?

But it does not suffice to ascertain that all possible solutions are equally repugnant to us; we must analyze, in each case, the reasons for our repugnance, so as to make our choice intelligently. The long discussion which follows will therefore be excused.

But it's not enough to just realize that all possible solutions are equally unappealing to us; we need to examine, in each case, the reasons behind our distaste, so we can make our choice wisely. The lengthy discussion that follows will therefore be justified.

Let us resume our fiction: thick clouds hide the stars from men, who can not observe them and are ignorant even of their existence; how shall these men know the earth turns round?

Let’s get back to our story: dense clouds cover the stars from people, who can’t see them and don’t even know they exist; how will these people know that the earth is spinning?

Even more than our ancestors, no doubt, they will regard the ground which bears them as fixed and immovable; they will await much longer the advent of a Copernicus. But in the end the Copernicus would come—how?

Even more than our ancestors, there's no doubt they will see the ground beneath them as solid and unchangeable; they will wait much longer for the arrival of a Copernicus. But eventually, the Copernicus will come—how?

The students of mechanics in this world would not at first be confronted with an absolute contradiction. In the theory of relative motion, besides real forces, two fictitious forces are met which are called ordinary and compound centrifugal force. Our imaginary scientists could therefore explain everything by regarding these two forces as real, and they would not see therein any contradiction of the generalized principle of inertia, for these forces would depend, the one on the relative positions of[Pg 110] the various parts of the system, as real attractions do, the other on their relative velocities, as real frictions do.

The mechanics students in this world wouldn’t initially face a complete contradiction. In the theory of relative motion, alongside real forces, there are two fictitious forces known as ordinary and compound centrifugal force. Our imaginary scientists could explain everything by treating these two forces as real, and they wouldn’t see this as contradicting the generalized principle of inertia. One force would depend on the relative positions of[Pg 110] the different parts of the system, similar to how real attractions work, while the other would depend on their relative velocities, like real frictions do.

Many difficulties, however, would soon awaken their attention; if they succeeded in realizing an isolated system, the center of gravity of this system would not have an almost rectilinear path. They would invoke, to explain this fact, the centrifugal forces which they would regard as real, and which they would attribute no doubt to the mutual actions of the bodies. Only they would not see these forces become null at great distances, that is to say in proportion as the isolation was better realized; far from it; centrifugal force increases indefinitely with the distance.

Many challenges, however, would soon capture their attention; if they managed to create an isolated system, the center of gravity of this system wouldn't follow a nearly straight path. They would explain this phenomenon using centrifugal forces, which they would consider to be real, and they would likely attribute them to the mutual interactions of the bodies. They just wouldn’t recognize that these forces become negligible at great distances, meaning as the isolation improves; quite the opposite; centrifugal force actually increases indefinitely with distance.

This difficulty would seem to them already sufficiently great; and yet it would not stop them long; they would soon imagine some very subtile medium, analogous to our ether, in which all bodies would be immersed and which would exert a repellent action upon them.

This difficulty might seem quite significant to them; however, it wouldn’t hold them back for long; they would quickly come up with some very subtle medium, similar to our ether, in which all objects would be immersed and that would exert a repellent force on them.

But this is not all. Space is symmetric, and yet the laws of motion would not show any symmetry; they would have to distinguish between right and left. It would be seen for instance that cyclones turn always in the same sense, whereas by reason of symmetry these winds should turn indifferently in one sense and in the other. If our scientists by their labor had succeeded in rendering their universe perfectly symmetric, this symmetry would not remain, even though there was no apparent reason why it should be disturbed in one sense rather than in the other.

But that’s not all. Space is symmetrical, yet the laws of motion don’t reflect that symmetry; they have to differentiate between right and left. For example, cyclones always rotate in the same direction, but because of symmetry, those winds should spin equally in either direction. If our scientists had managed to make their universe perfectly symmetrical through their work, that symmetry wouldn’t hold, even if there was no clear reason for it to be disrupted in one direction over the other.

They would get themselves out of the difficulty doubtless, they would invent something which would be no more extraordinary than the glass spheres of Ptolemy, and so it would go on, complications accumulating, until the long-expected Copernicus sweeps them all away at a single stroke, saying: It is much simpler to assume the earth turns round.

They would find a way out of the problem, no doubt; they would come up with something just as ordinary as Ptolemy's glass spheres, and it would keep going like that, with complications piling up, until the long-awaited Copernicus clears everything away in one go, stating: It's much simpler to think of the earth as rotating.

And just as our Copernicus said to us: It is more convenient to suppose the earth turns round, since thus the laws of astronomy are expressible in a much simpler language; this one would say: It is more convenient to suppose the earth turns round, since thus the laws of mechanics are expressible in a much simpler language.

And just like our Copernicus told us: It's easier to assume the earth revolves around the sun, because this way, the laws of astronomy can be explained in a much simpler way; one could say: It's easier to assume the earth revolves around the sun, because this way, the laws of mechanics can be explained in a much simpler way.

This does not preclude maintaining that absolute space, that[Pg 111] is to say the mark to which it would be necessary to refer the earth to know whether it really moves, has no objective existence. Hence, this affirmation: 'the earth turns round' has no meaning, since it can be verified by no experiment; since such an experiment, not only could not be either realized or dreamed by the boldest Jules Verne, but can not be conceived of without contradiction; or rather these two propositions: 'the earth turns round,' and, 'it is more convenient to suppose the earth turns round' have the same meaning; there is nothing more in the one than in the other.

This doesn't rule out the idea that absolute space, which[Pg 111] is the reference point we need to determine whether the earth truly moves, doesn't exist objectively. Therefore, the statement 'the earth turns round' is meaningless, as it can't be verified by any experiment; such an experiment couldn't even be imagined by the most adventurous Jules Verne, and it can't be thought of without contradiction. In fact, these two statements: 'the earth turns round,' and 'it's more convenient to assume the earth turns round' mean the same thing; there's nothing more in one than in the other.

Perhaps one will not be content even with that, and will find it already shocking that among all the hypotheses, or rather all the conventions we can make on this subject, there is one more convenient than the others.

Perhaps one might not be satisfied even with that, and might find it already surprising that among all the hypotheses, or rather all the conventions we can create on this topic, there is one that is more convenient than the others.

But if it has been admitted without difficulty when it was a question of the laws of astronomy, why should it be shocking in that which concerns mechanics?

But if it's been easily accepted when it comes to the laws of astronomy, why should it be surprising when it relates to mechanics?

We have seen that the coordinates of bodies are determined by differential equations of the second order, and that so are the differences of these coordinates. This is what we have called the generalized principle of inertia and the principle of relative motion. If the distances of these bodies were determined likewise by equations of the second order, it seems that the mind ought to be entirely satisfied. In what measure does the mind get this satisfaction and why is it not content with it?

We have observed that the positions of objects are defined by second-order differential equations, and so are the changes in these positions. This is what we refer to as the generalized principle of inertia and the principle of relative motion. If the distances between these objects were also determined by second-order equations, it seems that the mind should be completely satisfied. To what extent does the mind find this satisfaction, and why isn't it fully content with it?

To account for this, we had better take a simple example. I suppose a system analogous to our solar system, but where one can not perceive fixed stars foreign to this system, so that astronomers can observe only the mutual distances of the planets and the sun, and not the absolute longitudes of the planets. If we deduce directly from Newton's law the differential equations which define the variation of these distances, these equations will not be of the second order. I mean that if, besides Newton's law, one knew the initial values of these distances and of their derivatives with respect to the time, that would not suffice to determine the values of these same distances at a subsequent instant. There would still be lacking one datum, and this datum might be for instance what astronomers call the area-constant.[Pg 112]

To understand this, let's consider a simple example. Imagine a system like our solar system, but where you can't see distant fixed stars outside this system, so astronomers can only observe the distances between the planets and the sun, not the absolute positions of the planets. If we directly derive from Newton's law the differential equations that describe how these distances change, these equations won't be second-order. What I mean is that even if we have Newton's law and the initial values for these distances and their rates of change over time, that wouldn't be enough to figure out the values of these distances at a later time. We would still need one more piece of information, which could be what astronomers refer to as the area-constant.[Pg 112]

But here two different points of view may be taken; we may distinguish two sorts of constants. To the eyes of the physicist the world reduces to a series of phenomena, depending, on the one hand, solely upon the initial phenomena; on the other hand, upon the laws which bind the consequents to the antecedents. If then observation teaches us that a certain quantity is a constant, we shall have the choice between two conceptions.

But here we can look at it in two different ways; we can identify two types of constants. To a physicist, the world is just a series of phenomena, based, on one hand, only on the initial phenomena, and on the other hand, on the laws that connect the results to the causes. If observation shows us that a certain quantity is a constant, we will have two perspectives to choose from.

Either we shall assume that there is a law requiring this quantity not to vary, but that by chance, at the beginning of the ages, it had, rather than another, this value it has been forced to keep ever since. This quantity might then be called an accidental constant.

Either we can assume that there's a law that keeps this quantity from changing, but that by chance, at the beginning of time, it happened to have this value instead of another, which it has been forced to maintain ever since. This quantity could then be called an accidental constant.

Or else we shall assume, on the contrary, that there is a law of nature which imposes upon this quantity such a value and not such another.

Or else we'll assume, on the other hand, that there is a natural law that gives this quantity a specific value and not another one.

We shall then have what we may call an essential constant.

We will then have what we can call an essential constant.

For example, in virtue of Newton's laws, the duration of the revolution of the earth must be constant. But if it is 366 sidereal days and something over, and not 300 or 400, this is in consequence of I know not what initial chance. This is an accidental constant. If, on the contrary, the exponent of the distance which figures in the expression of the attractive force is equal to −2 and not to −3, this is not by chance, but because Newton's law requires it. This is an essential constant.

For example, according to Newton's laws, the length of time it takes for the Earth to complete one revolution must be consistent. However, if it's 366 sidereal days plus a bit more, and not 300 or 400 days, that's due to some initial chance that I don't understand. This is an accidental constant. On the other hand, if the exponent of the distance in the formula for gravitational force is -2 instead of -3, that's not by chance; it's because Newton's law demands it. This is an essential constant.

I know not whether this way of giving chance its part is legitimate in itself, and whether this distinction is not somewhat artificial; it is certain at least that, so long as nature shall have secrets, this distinction will be in application extremely arbitrary and always precarious.

I don’t know if this way of allowing chance to play its role is valid in itself, and whether this distinction is somewhat artificial; what’s clear is that as long as nature has secrets, this distinction will be very arbitrary and always uncertain.

As to the area-constant, we are accustomed to regard it as accidental. Is it certain our imaginary astronomers would do the same? If they could have compared two different solar systems, they would have the idea that this constant may have several different values; but my very supposition in the beginning was that their system should appear as isolated, and that they should observe no star foreign to it. Under these conditions, they would see only one single constant which would have a single value absolutely invariable; they would be led without any doubt to regard it as an essential constant.[Pg 113]

Regarding the area constant, we usually think of it as something random. Would our imaginary astronomers feel the same way? If they could compare two different solar systems, they might believe that this constant could have several different values. However, my initial assumption was that their system would appear isolated, and they would not observe any stars outside of it. In that case, they would only see one constant, which would have one unchanging value; they would undoubtedly consider it an essential constant.[Pg 113]

A word in passing to forestall an objection: the inhabitants of this imaginary world could neither observe nor define the area-constant as we do, since the absolute longitudes escape them; that would not preclude their being quickly led to notice a certain constant which would introduce itself naturally into their equations and which would be nothing but what we call the area-constant.

A quick note to address a potential objection: the people in this imagined world couldn't measure or identify the area constant like we do, since they can't see absolute longitudes; however, that wouldn't stop them from noticing a certain constant that would naturally appear in their equations, which would be what we refer to as the area constant.

But then see what would happen. If the area-constant is regarded as essential, as depending upon a law of nature, to calculate the distances of the planets at any instant it will suffice to know the initial values of these distances and those of their first derivatives. From this new point of view, the distances will be determined by differential equations of the second order.

But then look at what would happen. If we consider the area-constant to be crucial and based on a natural law, to calculate the distances of the planets at any moment, it will be enough to know the initial values of these distances and their first derivatives. From this new perspective, the distances will be determined by second-order differential equations.

Yet would the mind of these astronomers be completely satisfied? I do not believe so; first, they would soon perceive that in differentiating their equations and thus raising their order, these equations became much simpler. And above all they would be struck by the difficulty which comes from symmetry. It would be necessary to assume different laws, according as the aggregate of the planets presented the figure of a certain polyhedron or of the symmetric polyhedron, and one would escape from this consequence only by regarding the area-constant as accidental.

Yet would the minds of these astronomers be fully satisfied? I don’t think so; first, they would quickly notice that by differentiating their equations and increasing their order, these equations became much simpler. And above all, they would be struck by the challenges that arise from symmetry. It would be necessary to assume different laws depending on whether the group of planets formed the shape of a certain polyhedron or a symmetric polyhedron, and one could only avoid this conclusion by seeing the area-constant as coincidental.

I have taken a very special example, since I have supposed astronomers who did not at all consider terrestrial mechanics, and whose view was limited to the solar system. Our universe is more extended than theirs, as we have fixed stars, but still it too is limited, and so we might reason on the totality of our universe as the astronomers on their solar system.

I’ve chosen a very unique example, considering astronomers who completely ignore terrestrial mechanics and focus only on the solar system. Our universe is bigger than theirs, since we have fixed stars, but it’s still limited. We could think about our entire universe in the same way the astronomers did with their solar system.

Thus we see that finally we should be led to conclude that the equations which define distances are of an order superior to the second. Why should we be shocked at that, why do we find it perfectly natural for the series of phenomena to depend upon the initial values of the first derivatives of these distances, while we hesitate to admit that they may depend on the initial values of the second derivatives? This can only be because of the habits of mind created in us by the constant study of the generalized principle of inertia and its consequences.[Pg 114]

So, we can finally conclude that the equations that define distances are of a higher order than the second. Why should we be surprised by this? Why do we accept that the series of phenomena depend on the initial values of the first derivatives of these distances, but hesitate to acknowledge that they might also depend on the initial values of the second derivatives? This hesitation can only come from the mindset shaped by our constant study of the generalized principle of inertia and its implications.[Pg 114]

The values of the distances at any instant depend upon their initial values, upon those of their first derivatives and also upon something else. What is this something else?

The values of the distances at any moment depend on their initial values, on the values of their first derivatives, and also on something else. What is this something else?

If we will not admit that this may be simply one of the second derivatives, we have only the choice of hypotheses. Either it may be supposed, as is ordinarily done, that this something else is the absolute orientation of the universe in space, or the rapidity with which this orientation varies; and this supposition may be correct; it is certainly the most convenient solution for geometry; it is not the most satisfactory for the philosopher, because this orientation does not exist.

If we don't acknowledge that this could just be one of the second derivatives, we only have a choice of hypotheses. We can either assume, as is usually done, that this something else is the absolute orientation of the universe in space, or the speed at which this orientation changes; and this assumption might be right; it’s definitely the most convenient solution for geometry; it’s not the most satisfying for the philosopher, because this orientation doesn’t actually exist.

Or it may be supposed that this something else is the position or the velocity of some invisible body; this has been done by certain persons who have even called it the body alpha, although we are doomed never to know anything of this body but its name. This is an artifice entirely analogous to that of which I spoke at the end of the paragraph devoted to my reflections on the principle of inertia.

Or it might be assumed that this something else refers to the position or speed of some invisible object; some people have even named it body alpha, although we are destined to know nothing about this object except its name. This is a trick completely similar to what I mentioned at the end of the paragraph where I reflected on the principle of inertia.

But, after all, the difficulty is artificial. Provided the future indications of our instruments can depend only on the indications they have given us or would have given us formerly, this is all that is necessary. Now as to this we may rest easy.

But, in the end, the difficulty is made up. As long as the future signals from our instruments rely only on the signals they’ve given us or would have given us in the past, that’s all we need. Now, we can be confident about that.


CHAPTER VIII

Energy and Thermodynamics

Energetics.—The difficulties inherent in the classic mechanics have led certain minds to prefer a new system they call energetics.

Energy Science.—The challenges found in classical mechanics have prompted some thinkers to favor a new system they refer to as energetics.

Energetics took its rise as an outcome of the discovery of the principle of the conservation of energy. Helmholtz gave it its final form.

Energetics emerged from the discovery of the principle of energy conservation. Helmholtz refined it to its final form.

It begins by defining two quantities which play the fundamental rôle in this theory. They are kinetic energy, or vis viva, and potential energy.

It starts by defining two key quantities that are central to this theory. They are kinetic energy, or vis viva, and potential energy.

All the changes which bodies in nature can undergo are regulated by two experimental laws:

All the changes that bodies in nature can go through are governed by two experimental laws:

1º The sum of kinetic energy and potential energy is constant. This is the principle of the conservation of energy.

1º The total kinetic energy plus potential energy remains constant. This is the principle of energy conservation.

2º If a system of bodies is at A at the time t0 and at B at the time t1, it always goes from the first situation to the second in such a way that the mean value of the difference between the two sorts of energy, in the interval of time which separates the two epochs t0 and t1, may be as small as possible.

2º If a system of bodies is at A at time t0 and at B at time t1, it always transitions from the first state to the second in such a way that the mean value of the difference between the two types of energy during the time interval between t0 and t1 can be minimized as much as possible.

This is Hamilton's principle, which is one of the forms of the principle of least action.

This is Hamilton's principle, which is one of the versions of the principle of least action.

The energetic theory has the following advantages over the classic theory:

The energetic theory has these advantages over the classic theory:

1º It is less incomplete; that is to say, Hamilton's principle and that of the conservation of energy teach us more than the fundamental principles of the classic theory, and exclude certain motions not realized in nature and which would be compatible with the classic theory:

1º It's less incomplete; in other words, Hamilton's principle and the conservation of energy provide us with more insights than the basic principles of classical theory, and they rule out certain motions that don't occur in nature, which would still fit within classical theory:

2º It saves us the hypothesis of atoms, which it was almost impossible to avoid with the classic theory.

2º It spares us the assumption of atoms, which was nearly unavoidable with the classical theory.

But it raises in its turn new difficulties:

But it brings up new challenges:

The definitions of the two sorts of energy would raise difficulties almost as great as those of force and mass in the first[Pg 116] system. Yet they may be gotten over more easily, at least in the simplest cases.

The definitions of the two types of energy would create challenges almost as significant as those of force and mass in the first[Pg 116] system. However, they might be resolved more easily, at least in the simplest situations.

Suppose an isolated system formed of a certain number of material points; suppose these points subjected to forces depending only on their relative position and their mutual distances, and independent of their velocities. In virtue of the principle of the conservation of energy, a function of forces must exist.

Suppose there's an isolated system made up of a specific number of material points; suppose these points are subject to forces that depend only on their relative positions and the distances between them, and that these forces are independent of their speeds. According to the principle of energy conservation, there must be a function that describes these forces.

In this simple case the enunciation of the principle of the conservation of energy is of extreme simplicity. A certain quantity, accessible to experiment, must remain constant. This quantity is the sum of two terms; the first depends only on the position of the material points and is independent of their velocities; the second is proportional to the square of these velocities. This resolution can take place only in a single way.

In this straightforward case, stating the principle of energy conservation is very simple. A specific quantity, which can be measured through experiments, must stay constant. This quantity is the sum of two components: the first one depends solely on the position of the material points and does not depend on their speeds; the second is proportional to the square of those speeds. This resolution can only happen in one specific way.

The first of these terms, which I shall call U, will be the potential energy; the second, which I shall call T, will be the kinetic energy.

The first of these terms, which I’ll refer to as U, will be the potential energy; the second, which I’ll refer to as T, will be the kinetic energy.

It is true that if T + U is a constant, so is any function of T + U,

It’s true that if T + U is a constant, then any function of T + U is also a constant.

Φ (T + U).

Φ (T + U).

But this function Φ (T + U) will not be the sum of two terms the one independent of the velocities, the other proportional to the square of these velocities. Among the functions which remain constant there is only one which enjoys this property, that is T + U (or a linear function of T + U, which comes to the same thing, since this linear function may always be reduced to T + U by change of unit and of origin). This then is what we shall call energy; the first term we shall call potential energy and the second kinetic energy. The definition of the two sorts of energy can therefore be carried through without any ambiguity.

But this function Φ (T + U) won't just be the sum of two terms—one that doesn't depend on velocities and the other that's proportional to the square of those velocities. Among the functions that stay constant, there's only one that has this property, which is T + U (or a linear function of T + U, which is essentially the same since we can always convert this linear function back to T + U by changing the unit and origin). This is what we will refer to as energy; we'll call the first term potential energy and the second kinetic energy. So, the definitions of these two types of energy can be clearly established.

It is the same with the definition of the masses. Kinetic energy, or vis viva, is expressed very simply by the aid of the masses and the relative velocities of all the material points with reference to one of them. These relative velocities are accessible to observation, and, when we know the expression of the kinetic energy as function of these relative velocities, the coefficients of this expression will give us the masses.[Pg 117]

It’s the same with how we define mass. Kinetic energy, or vis viva, is clearly expressed through the masses and the relative speeds of all the material points in relation to one of them. We can observe these relative speeds, and once we have the formula for kinetic energy based on these relative speeds, the coefficients in that formula will reveal the masses.[Pg 117]

Thus, in this simple case, the fundamental ideas may be defined without difficulty. But the difficulties reappear in the more complicated cases and, for instance, if the forces, in lieu of depending only on the distances, depend also on the velocities. For example, Weber supposes the mutual action of two electric molecules to depend not only on their distance, but on their velocity and their acceleration. If material points should attract each other according to an analogous law, U would depend on the velocity, and might contain a term proportional to the square of the velocity.

Thus, in this simple case, the basic ideas can be defined easily. However, the challenges arise in more complex situations, such as when the forces depend not just on the distances but also on the velocities. For example, Weber suggests that the interaction between two electric molecules is influenced not only by their distance but also by their velocity and acceleration. If material points were to attract each other based on a similar principle, U would depend on the velocity and could include a term that is proportional to the square of the velocity.

Among the terms proportional to the squares of the velocities, how distinguish those which come from T or from U? Consequently, how distinguish the two parts of energy?

Among the terms related to the squares of the speeds, how do we differentiate those that come from T or from U? Therefore, how do we separate the two components of energy?

But still more; how define energy itself? We no longer have any reason to take as definition T + U rather than any other function of T + U, when the property which characterized T + U has disappeared, that, namely, of being the sum of two terms of a particular form.

But still more; how do we define energy itself? We no longer have any reason to define it as T + U instead of any other function of T + U, when the characteristic that defined T + U has disappeared, which is that it was the sum of two terms of a specific form.

But this is not all; it is necessary to take account, not only of mechanical energy properly so called, but of the other forms of energy, heat, chemical energy, electric energy, etc. The principle of the conservation of energy should be written:

But this isn't everything; we need to consider not just mechanical energy itself, but also other forms of energy like heat, chemical energy, electrical energy, and so on. The principle of energy conservation should be stated as:

T + U + Q = const.

T + U + Q = constant.

where T would represent the sensible kinetic energy, U the potential energy of position, depending only on the position of the bodies, Q the internal molecular energy, under the thermal, chemic or electric form.

where T represents the sensible kinetic energy, U is the potential energy of position, which depends only on the location of the bodies, and Q denotes the internal molecular energy, in thermal, chemical, or electric form.

All would go well if these three terms were absolutely distinct, if T were proportional to the square of the velocities, U independent of these velocities and of the state of the bodies, Q independent of the velocities and of the positions of the bodies and dependent only on their internal state.

All would be fine if these three terms were completely separate, if T were proportional to the square of the velocities, U were not affected by these velocities or the state of the bodies, and Q were not influenced by the velocities or positions of the bodies, only depending on their internal state.

The expression for the energy could be resolved only in one single way into three terms of this form.

The expression for the energy could only be broken down in one specific way into three terms like this.

But this is not the case; consider electrified bodies; the electrostatic energy due to their mutual action will evidently depend upon their charge, that is to say, on their state; but it will equally[Pg 118] depend upon their position. If these bodies are in motion, they will act one upon another electrodynamically and the electrodynamic energy will depend not only upon their state and their position, but upon their velocities.

But this isn't true; take electrified objects, for example. The electrostatic energy from their interaction clearly depends on their charge, meaning their state; but it will also[Pg 118] depend on their position. If these objects are in motion, they will interact with each other electrodynamically, and the electrodynamic energy will depend not just on their state and their position, but also on their velocities.

We therefore no longer have any means of making the separation of the terms which should make part of T, of U and of Q, and of separating the three parts of energy.

We no longer have any way to separate the terms that should belong to T, U, and Q, and to divide the three parts of energy.

If (T + U + Q) is constant so is any function Φ (T + U + Q).

If (T + U + Q) is constant, then any function Φ (T + U + Q) is also constant.

If T + U + Q were of the particular form I have above considered, no ambiguity would result; among the functions Φ (T + U + Q) which remain constant, there would only be one of this particular form, and that I should convene to call energy.

If T + U + Q were in the specific form I mentioned earlier, there would be no confusion; among the functions Φ (T + U + Q) that stay constant, there would only be one of this specific form, which I would refer to as energy.

But as I have said, this is not rigorously the case; among the functions which remain constant, there is none which can be put rigorously under this particular form; hence, how choose among them the one which should be called energy? We no longer have anything to guide us in our choice.

But as I've mentioned, this isn't strictly true; among the functions that stay the same, there's none that can be precisely categorized in this way; so, how do we pick which one should be called energy? We no longer have anything to help us make that choice.

There only remains for us one enunciation of the principle of the conservation of energy: There is something which remains constant. Under this form it is in its turn out of the reach of experiment and reduces to a sort of tautology. It is clear that if the world is governed by laws, there will be quantities which will remain constant. Like Newton's laws, and, for an analogous reason, the principle of the conservation of energy, founded on experiment, could no longer be invalidated by it.

There’s just one way to state the principle of conservation of energy: Something stays constant. In this form, it’s also beyond experimentation and becomes somewhat of a tautology. It’s clear that if the universe follows laws, there will be quantities that stay constant. Much like Newton's laws, and for similar reasons, the principle of conservation of energy, which is based on experiments, can’t be disproven by them.

This discussion shows that in passing from the classic to the energetic system progress has been made; but at the same time it shows this progress is insufficient.

This discussion highlights that moving from the classic to the energetic system has led to progress; however, it also indicates that this progress is not enough.

Another objection seems to me still more grave: the principle of least action is applicable to reversible phenomena; but it is not at all satisfactory in so far as irreversible phenomena are concerned; the attempt by Helmholtz to extend it to this kind of phenomena did not succeed and could not succeed; in this regard everything remains to be done. The very statement of the principle of least action has something about it repugnant to the mind. To go from one point to another, a material molecule, acted upon by no force, but required to move on a surface, will take the geodesic line, that is to say, the shortest path.[Pg 119]

Another objection seems even more serious to me: the principle of least action applies to reversible phenomena, but it’s not at all satisfactory when it comes to irreversible phenomena; Helmholtz's attempt to apply it to these kinds of phenomena failed and couldn’t succeed; in this respect, there’s still a lot of work to be done. The very idea of the principle of least action has something unsettling about it. When moving from one point to another, a material molecule, not acted upon by any force but required to move on a surface, will follow the geodesic line, which means the shortest path.[Pg 119]

This molecule seems to know the point whither it is to go, to foresee the time it would take to reach it by such and such a route, and then to choose the most suitable path. The statement presents the molecule to us, so to speak, as a living and free being. Clearly it would be better to replace it by an enunciation less objectionable, and where, as the philosophers would say, final causes would not seem to be substituted for efficient causes.

This molecule seems to know where it needs to go, to anticipate how long it would take to get there along different routes, and then to select the best path. This description portrays the molecule, so to speak, as a living and free entity. It would definitely be better to express this in a way that’s less controversial, where, as philosophers would say, ultimate purposes don’t seem to replace actual causes.

Thermodynamics.[4]—The rôle of the two fundamental principles of thermodynamics in all branches of natural philosophy becomes daily more important. Abandoning the ambitious theories of forty years ago, which were encumbered by molecular hypotheses, we are trying to-day to erect upon thermodynamics alone the entire edifice of mathematical physics. Will the two principles of Mayer and of Clausius assure to it foundations solid enough for it to last some time? No one doubts it; but whence comes this confidence?

Thermodynamics.[4]—The role of the two fundamental principles of thermodynamics in all areas of natural science is becoming increasingly significant. Moving away from the ambitious theories of forty years ago, which were complicated by molecular assumptions, we are now attempting to build the entire framework of mathematical physics solely on thermodynamics. Will the two principles of Mayer and Clausius provide a strong enough foundation for it to stand the test of time? No one doubts it; but where does this confidence come from?

An eminent physicist said to me one day à propos of the law of errors: "All the world believes it firmly, because the mathematicians imagine that it is a fact of observation, and the observers that it is a theorem of mathematics." It was long so for the principle of the conservation of energy. It is no longer so to-day; no one is ignorant that this is an experimental fact.

An influential physicist once told me about the law of errors: "Everyone believes it strongly because mathematicians think it's an observational fact, while observers see it as a mathematical theorem." This was the case for the principle of conservation of energy for a long time. Not anymore; today, everyone knows that this is an experimental fact.

But then what gives us the right to attribute to the principle itself more generality and more precision than to the experiments which have served to demonstrate it? This is to ask whether it is legitimate, as is done every day, to generalize empirical data, and I shall not have the presumption to discuss this question, after so many philosophers have vainly striven to solve it. One thing is certain; if this power were denied us, science could not exist or, at least, reduced to a sort of inventory, to the ascertaining of isolated facts, it would have no value for us, since it could give no satisfaction to our craving for order and harmony and since it would be at the same time incapable of foreseeing. As the circumstances which have preceded any fact will probably never be simultaneously reproduced, a first generalization[Pg 120] is already necessary to foresee whether this fact will be reproduced again after the least of these circumstances shall be changed.

But what gives us the right to attribute more generality and precision to the principle itself than to the experiments that demonstrated it? This raises the question of whether it’s legitimate, as happens every day, to generalize empirical data, and I won’t presume to tackle this issue after so many philosophers have tried in vain to resolve it. One thing is certain: if we were denied this power, science couldn’t exist, or at least it would be reduced to a mere listing of isolated facts, which wouldn’t hold any value for us because it wouldn’t satisfy our desire for order and harmony and would also be incapable of making predictions. Since the circumstances leading to any fact will likely never be reproduced in the exact same way, the first generalization[Pg 120] is essential to anticipate whether this fact will reappear once even the slightest of those circumstances changes.

But every proposition may be generalized in an infinity of ways. Among all the generalizations possible, we must choose, and we can only choose the simplest. We are therefore led to act as if a simple law were, other things being equal, more probable than a complicated law.

But every statement can be generalized in countless ways. Among all the possible generalizations, we need to make a choice, and the simplest option is usually the best. So, we end up treating a simple law as more likely to be true than a complex one, all else being equal.

Half a century ago this was frankly confessed, and it was proclaimed that nature loves simplicity; she has since too often given us the lie. To-day we no longer confess this tendency, and we retain only so much of it as is indispensable if science is not to become impossible.

Half a century ago, this was openly acknowledged, and it was stated that nature favors simplicity; since then, it has often proven otherwise. Today, we no longer admit to this tendency, and we keep only as much of it as is necessary for science to remain viable.

In formulating a general, simple and precise law on the basis of experiments relatively few and presenting certain divergences, we have therefore only obeyed a necessity from which the human mind can not free itself.

In creating a general, straightforward, and clear law based on relatively few experiments that show some differences, we have simply followed a necessity that the human mind cannot escape.

But there is something more, and this is why I dwell upon the point.

But there’s more to it, and that’s why I keep emphasizing this point.

No one doubts that Mayer's principle is destined to survive all the particular laws from which it was obtained, just as Newton's law has survived Kepler's laws, from which it sprang, and which are only approximative if account be taken of perturbations.

No one questions that Mayer's principle is set to outlast all the specific laws it was derived from, just like Newton's law has outlasted Kepler's laws, from which it originated, and which are only approximate when considering disturbances.

Why does this principle occupy thus a sort of privileged place among all the physical laws? There are many little reasons for it.

Why does this principle hold a special status among all the physical laws? There are several minor reasons for that.

First of all it is believed that we could not reject it or even doubt its absolute rigor without admitting the possibility of perpetual motion; of course we are on our guard at such a prospect, and we think ourselves less rash in affirming Mayer's principle than in denying it.

First of all, it's believed that we couldn't dismiss it or even question its complete validity without accepting the possibility of perpetual motion; naturally, we're cautious about that idea, and we feel it's less reckless to affirm Mayer's principle than to deny it.

That is perhaps not wholly accurate; the impossibility of perpetual motion implies the conservation of energy only for reversible phenomena.

That might not be completely accurate; the impossibility of perpetual motion suggests the conservation of energy only applies to reversible processes.

The imposing simplicity of Mayer's principle likewise contributes to strengthen our faith. In a law deduced immediately from experiment, like Mariotte's, this simplicity would rather[Pg 121] seem to us a reason for distrust; but here this is no longer the case; we see elements, at first sight disparate, arrange themselves in an unexpected order and form a harmonious whole; and we refuse to believe that an unforeseen harmony may be a simple effect of chance. It seems that our conquest is the dearer to us the more effort it has cost us, or that we are the surer of having wrested her true secret from nature the more jealously she has hidden it from us.

The striking simplicity of Mayer's principle also strengthens our confidence. In a law directly derived from experiments, like Mariotte's, this simplicity might seem like a reason to be skeptical; however, that’s not the case here. We observe elements that initially seem unrelated arranging themselves in an unexpected way and forming a harmonious whole. We refuse to believe that this unanticipated harmony is just a random coincidence. It appears that our achievements are more valuable to us the more effort it took to achieve them, or that we feel more confident we've uncovered nature's true secret the more she has tried to keep it hidden from us.

But those are only little reasons; to establish Mayer's law as an absolute principle, a more profound discussion is necessary. But if this be attempted, it is seen that this absolute principle is not even easy to state.

But those are just minor reasons; to establish Mayer's law as a definitive principle, a deeper discussion is needed. However, if this is tried, it becomes clear that this absolute principle isn't even easy to articulate.

In each particular case it is clearly seen what energy is and at least a provisional definition of it can be given; but it is impossible to find a general definition for it.

In each specific case, it's clear what energy is, and we can at least provide a temporary definition; however, it's impossible to come up with a general definition for it.

If we try to enunciate the principle in all its generality and apply it to the universe, we see it vanish, so to speak, and nothing is left but this: There is something which remains constant.

If we attempt to state the principle in its entirety and relate it to the universe, we see it disappear, so to speak, and all that remains is this: There is something that stays constant.

But has even this any meaning? In the determinist hypothesis, the state of the universe is determined by an extremely great number n of parameters which I shall call x1, x2, ... xn. As soon as the values of these n parameters at any instant are known, their derivatives with respect to the time are likewise known and consequently the values of these same parameters at a preceding or subsequent instant can be calculated. In other words, these n parameters satisfy n differential equations of the first order.

But does this even have any meaning? In the determinist hypothesis, the state of the universe is determined by a very large number n of parameters which I’ll call x1, x2, ... xn. As soon as the values of these n parameters at any moment are known, their rates of change over time are also known, and therefore the values of these same parameters at a previous or future moment can be calculated. In other words, these n parameters satisfy n first-order differential equations.

These equations admit of n − 1 integrals and consequently there are n − 1 functions of x1, x2,... xn, which remain constant. If then we say there is something which remains constant, we only utter a tautology. We should even be puzzled to say which among all our integrals should retain the name of energy.

These equations allow for n − 1 integrals, which means there are n − 1 functions of x1, x2,... xn that stay constant. If we then say there’s something that stays constant, we’re just repeating ourselves. We might even find it confusing to determine which of our integrals should be called energy.

Besides, Mayer's principle is not understood in this sense when it is applied to a limited system. It is then assumed that p of our parameters vary independently, so that we only have np relations, generally linear, between our n parameters and their derivatives.[Pg 122]

Besides, Mayer's principle isn't understood this way when applied to a limited system. It's assumed that p of our parameters change independently, so we only have np relationships, usually linear, between our n parameters and their derivatives.[Pg 122]

To simplify the enunciation, suppose that the sum of the work of the external forces is null, as well as that of the quantities of heat given off to the outside. Then the signification of our principle will be:

To make it easier to explain, let's assume that the total work done by external forces is zero, and so is the amount of heat released to the outside. Then the meaning of our principle will be:

There is a combination of these n − p relations whose first member is an exact differential; and then this differential vanishing in virtue of our n − p relations, its integral is a constant and this integral is called energy.

There is a combination of these n − p relations where the first element is an exact differential; and since this differential goes to zero due to our n − p relations, its integral is a constant, and this integral is referred to as energy.

But how can it be possible that there are several parameters whose variations are independent? That can only happen under the influence of external forces (although we have supposed, for simplicity, that the algebraic sum of the effects of these forces is null). In fact, if the system were completely isolated from all external action, the values of our n parameters at a given instant would suffice to determine the state of the system at any subsequent instant, provided always we retain the determinist hypothesis; we come back therefore to the same difficulty as above.

But how can it be possible for there to be several parameters that change independently? That can only happen under the influence of external forces (even though we’ve assumed, for simplicity, that the total effect of these forces is zero). In fact, if the system were completely isolated from any external influence, the values of our n parameters at a certain moment would be enough to determine the state of the system at any future moment, as long as we stick to the deterministic assumption; we find ourselves facing the same issue as before.

If the future state of the system is not entirely determined by its present state, this is because it depends besides upon the state of bodies external to the system. But then is it probable that there exist between the parameters xi, which define the state of the system, equations independent of this state of the external bodies? and if in certain cases we believe we can find such, is this not solely in consequence of our ignorance and because the influence of these bodies is too slight for our experimenting to detect it?

If the future state of the system isn’t completely determined by its current state, it’s because it also depends on the condition of external factors. But is it likely that there are equations among the parameters xi, which define the system’s state, that are independent of the external bodies' states? And if we think we can find such equations in some cases, isn’t it just because we don’t fully understand the situation and the influence of these bodies is too small for us to detect in our experiments?

If the system is not regarded as completely isolated, it is probable that the rigorously exact expression of its internal energy will depend on the state of the external bodies. Again, I have above supposed the sum of the external work was null, and if we try to free ourselves from this rather artificial restriction, the enunciation becomes still more difficult.

If the system isn't seen as completely isolated, it's likely that the precise expression of its internal energy will rely on the conditions of the external bodies. I previously assumed that the total external work was zero, and if we attempt to move beyond this somewhat artificial limitation, the statement becomes even more complicated.

To formulate Mayer's principle in an absolute sense, it is therefore necessary to extend it to the whole universe, and then we find ourselves face to face with the very difficulty we sought to avoid.

To express Mayer's principle in an absolute way, we need to apply it to the entire universe, and then we encounter the very problem we were trying to escape.

In conclusion, using ordinary language, the law of the[Pg 123] conservation of energy can have only one signification, which is that there is a property common to all the possibilities; but on the determinist hypothesis there is only a single possibility, and then the law has no longer any meaning.

In conclusion, put simply, the law of the[Pg 123] conservation of energy can only mean one thing: there is a property shared by all possibilities. However, under the determinist view, there’s just one possibility, and in that case, the law becomes meaningless.

On the indeterminist hypothesis, on the contrary, it would have a meaning, even if it were taken in an absolute sense; it would appear as a limitation imposed upon freedom.

On the indeterminist hypothesis, on the other hand, it would have a meaning, even if it were taken in an absolute sense; it would seem like a limitation placed on freedom.

But this word reminds me that I am digressing and am on the point of leaving the domain of mathematics and physics. I check myself therefore and will stress of all this discussion only one impression, that Mayer's law is a form flexible enough for us to put into it almost whatever we wish. By that I do not mean it corresponds to no objective reality, nor that it reduces itself to a mere tautology, since, in each particular case, and provided one does not try to push to the absolute, it has a perfectly clear meaning.

But this word reminds me that I'm getting off track and about to leave the realm of math and physics. So I’ll rein myself in and emphasize just one takeaway from all this discussion: Mayer's law is flexible enough for us to adapt in various ways. I don’t mean to suggest it doesn’t reflect any objective reality or that it’s just a pointless repetition; in every specific instance, as long as we don’t push it to extremes, it has a perfectly clear meaning.

This flexibility is a reason for believing in its permanence, and as, on the other hand, it will disappear only to lose itself in a higher harmony, we may work with confidence, supporting ourselves upon it, certain beforehand that our labor will not be lost.

This flexibility is a reason to believe in its permanence, and since it will only fade away to become part of a greater harmony, we can work with confidence, relying on it, knowing in advance that our efforts won't be in vain.

Almost everything I have just said applies to the principle of Clausius. What distinguishes it is that it is expressed by an inequality. Perhaps it will be said it is the same with all physical laws, since their precision is always limited by errors of observation. But they at least claim to be first approximations, and it is hoped to replace them little by little by laws more and more precise. If, on the other hand, the principle of Clausius reduces to an inequality, this is not caused by the imperfection of our means of observation, but by the very nature of the question.

Almost everything I just mentioned relates to the principle of Clausius. What sets it apart is that it's stated as an inequality. Some might argue that this applies to all physical laws, since their accuracy is always affected by observational errors. However, they at least aim to be initial approximations, and there's hope to gradually replace them with more precise laws. In contrast, the principle of Clausius being an inequality isn't due to the limitations of our observational methods, but rather it stems from the inherent nature of the issue itself.

General Conclusions on Section Three

The principles of mechanics, then, present themselves to us under two different aspects. On the one hand, they are truths founded on experiment and approximately verified so far as concerns almost isolated systems. On the other hand, they are[Pg 124] postulates applicable to the totality of the universe and regarded as rigorously true.

The principles of mechanics present themselves to us in two different ways. On one hand, they are truths based on experiments and are roughly confirmed when it comes to almost isolated systems. On the other hand, they are[Pg 124] postulates that apply to the entirety of the universe and are considered strictly true.

If these postulates possess a generality and a certainty which are lacking to the experimental verities whence they are drawn, this is because they reduce in the last analysis to a mere convention which we have the right to make, because we are certain beforehand that no experiment can ever contradict it.

If these principles have a level of generality and certainty that the experimental facts they come from lack, it's because, when you break it down, they boil down to a simple convention that we are allowed to establish, since we know in advance that no experiment can ever prove it wrong.

This convention, however, is not absolutely arbitrary; it does not spring from our caprice; we adopt it because certain experiments have shown us that it would be convenient.

This convention, however, isn't completely arbitrary; it doesn't come from our whims; we choose it because certain experiments have shown us that it would be practical.

Thus is explained how experiment could make the principles of mechanics, and yet why it can not overturn them.

This explains how experiments can establish the principles of mechanics, and yet why they cannot change them.

Compare with geometry: The fundamental propositions of geometry, as for instance Euclid's postulate, are nothing more than conventions, and it is just as unreasonable to inquire whether they are true or false as to ask whether the metric system is true or false.

Compare with geometry: The basic statements of geometry, like Euclid's postulate, are just conventions, and it's just as unreasonable to question whether they are true or false as it is to ask whether the metric system is true or false.

Only, these conventions are convenient, and it is certain experiments which have taught us that.

Only, these conventions are convenient, and certain experiments have shown us that.

At first blush, the analogy is complete; the rôle of experiment seems the same. One will therefore be tempted to say: Either mechanics must be regarded as an experimental science, and then the same must hold for geometry; or else, on the contrary, geometry is a deductive science, and then one may say as much of mechanics.

At first glance, the comparison seems perfect; the role of experiment appears to be the same. One might be tempted to say: Either mechanics should be seen as an experimental science, in which case geometry must be viewed the same way; or, on the other hand, if geometry is a deductive science, then the same could be said for mechanics.

Such a conclusion would be illegitimate. The experiments which have led us to adopt as more convenient the fundamental conventions of geometry bear on objects which have nothing in common with those geometry studies; they bear on the properties of solid bodies, on the rectilinear propagation of light. They are experiments of mechanics, experiments of optics; they can not in any way be regarded as experiments of geometry. And even the principal reason why our geometry seems convenient to us is that the different parts of our body, our eye, our limbs, have the properties of solid bodies. On this account, our fundamental experiments are preeminently physiological experiments, which bear, not on space which is the object the geometer must[Pg 125] study, but on his body, that is to say, on the instrument he must use for this study.

Such a conclusion would be invalid. The experiments that have led us to find the basic principles of geometry more practical are focused on objects that have nothing in common with what geometry studies; they concern the properties of solid bodies and the straight-line propagation of light. These are experiments in mechanics and optics; they should not be considered experiments in geometry. Moreover, the main reason our geometry seems useful to us is that the various parts of our body, like our eyes and limbs, share the properties of solid bodies. Therefore, our fundamental experiments are primarily physiological experiments related not to the space that a geometer must study, but to his body, which is the tool he needs for this study.

On the contrary, the fundamental conventions of mechanics, and the experiments which prove to us that they are convenient, bear on exactly the same objects or on analogous objects. The conventional and general principles are the natural and direct generalization of the experimental and particular principles.

On the other hand, the basic rules of mechanics and the experiments that show us they are useful focus on the same objects or similar ones. The conventional and general principles are the natural and straightforward conclusions drawn from the experimental and specific principles.

Let it not be said that thus I trace artificial frontiers between the sciences; that if I separate by a barrier geometry properly so called from the study of solid bodies, I could just as well erect one between experimental mechanics and the conventional mechanics of the general principles. In fact, who does not see that in separating these two sciences I mutilate them both, and that what will remain of conventional mechanics when it shall be isolated will be only a very small thing and can in no way be compared to that superb body of doctrine called geometry?

Let’s not say that I'm creating artificial boundaries between the sciences; that if I separate geometry from the study of solid objects, I could easily create a divide between experimental mechanics and the standard mechanics based on general principles. In reality, who doesn’t see that by separating these two fields, I am diminishing both? What remains of conventional mechanics once it is isolated will be just a tiny fraction and cannot be compared to the magnificent body of knowledge that is geometry.

One sees now why the teaching of mechanics should remain experimental.

One can see now why the teaching of mechanics should stay hands-on.

Only thus can it make us comprehend the genesis of the science, and that is indispensable for the complete understanding of the science itself.

Only this way can we understand the origins of the science, and that is essential for fully grasping the science itself.

Besides, if we study mechanics, it is to apply it; and we can apply it only if it remains objective. Now, as we have seen, what the principles gain in generality and certainty they lose in objectivity. It is, therefore, above all with the objective side of the principles that we must be familiarized early, and that can be done only by going from the particular to the general, instead of the inverse.

Besides, if we study mechanics, it’s to put it to use; and we can only do that if it stays objective. Now, as we've seen, the more general and certain the principles become, the less objective they are. So, it’s essential to first get familiar with the objective aspect of these principles, and we can only achieve that by moving from the specific to the general, rather than the other way around.

The principles are conventions and disguised definitions. Yet they are drawn from experimental laws; these laws have, so to speak, been exalted into principles to which our mind attributes an absolute value.

The principles are conventions and hidden definitions. However, they come from experimental laws; these laws have, so to speak, been elevated to principles that our minds assign an absolute value to.

Some philosophers have generalized too far; they believed the principles were the whole science and consequently that the whole science was conventional.

Some philosophers have overgeneralized; they thought the principles were the entirety of the science and therefore that the entire field was just a matter of convention.

This paradoxical doctrine, called nominalism, will not bear examination.[Pg 126]

This contradictory belief, known as nominalism, doesn't hold up under scrutiny.[Pg 126]

How can a law become a principle? It expressed a relation between two real terms A and B. But it was not rigorously true, it was only approximate. We introduce arbitrarily an intermediary term C more or less fictitious, and C is by definition that which has with A exactly the relation expressed by the law.

How can a law turn into a principle? It shows a relationship between two real terms A and B. But it wasn't completely accurate; it was just an approximation. We randomly introduce an intermediary term C, which is somewhat imaginary, and C is by definition the one that has exactly the relationship with A as expressed by the law.

Then our law is separated into an absolute and rigorous principle which expresses the relation of A to C and an experimental law, approximate and subject to revision, which expresses the relation of C to B. It is clear that, however far this partition is pushed, some laws will always be left remaining.

Then our law is divided into a strict and definitive principle that shows the relationship between A and C, and an experimental law that is approximate and open to change, which describes the relationship between C and B. It's evident that no matter how much we break this down, there will always be some laws that remain.

We go to enter now the domain of laws properly so called.

We are now entering the realm of actual laws.


PART IV

NATURE


CHAPTER IX

Physics Hypotheses

The Rôle of Experiment and Generalization.—Experiment is the sole source of truth. It alone can teach us anything new; it alone can give us certainty. These are two points that can not be questioned.

The Role of Experimentation and Generalization.—Experiment is the only source of truth. It’s the only way we can learn anything new; it’s the only way we can achieve certainty. These are two points that cannot be questioned.

But then, if experiment is everything, what place will remain for mathematical physics? What has experimental physics to do with such an aid, one which seems useless and perhaps even dangerous?

But then, if experiments are all that matter, what role will mathematical physics play? What relevance does experimental physics have for such a tool, which appears to be useless and maybe even risky?

And yet mathematical physics exists, and has done unquestionable service. We have here a fact that must be explained.

And yet mathematical physics exists and has undeniably provided valuable contributions. We have a fact here that needs explaining.

The explanation is that merely to observe is not enough. We must use our observations, and to do that we must generalize. This is what men always have done; only as the memory of past errors has made them more and more careful, they have observed more and more, and generalized less and less.

The explanation is that just observing isn't enough. We need to use our observations, and to do that, we must generalize. This is what people have always done; however, as the memory of past mistakes has made them increasingly careful, they have observed more and more but generalized less and less.

Every age has ridiculed the one before it, and accused it of having generalized too quickly and too naïvely. Descartes pitied the Ionians; Descartes, in his turn, makes us smile. No doubt our children will some day laugh at us.

Every generation has laughed at the one before it, claiming it generalized too quickly and too naively. Descartes felt sorry for the Ionians; now Descartes makes us chuckle. No doubt our kids will one day laugh at us.

But can we not then pass over immediately to the goal? Is not this the means of escaping the ridicule that we foresee? Can we not be content with just the bare experiment?

But can’t we just go straight to the goal? Isn’t this the way to avoid the ridicule we expect? Can’t we be satisfied with just the basic attempt?

No, that is impossible; it would be to mistake utterly the true nature of science. The scientist must set in order. Science is built up with facts, as a house is with stones. But a collection of facts is no more a science than a heap of stones is a house.[Pg 128]

No, that's impossible; it completely misunderstands the true nature of science. A scientist must organize their findings. Science is constructed with facts, just as a house is built with stones. But a collection of facts is no more a science than a pile of stones is a house.[Pg 128]

And above all the scientist must foresee. Carlyle has somewhere said something like this: "Nothing but facts are of importance. John Lackland passed by here. Here is something that is admirable. Here is a reality for which I would give all the theories in the world." Carlyle was a fellow countryman of Bacon; but Bacon would not have said that. That is the language of the historian. The physicist would say rather: "John Lackland passed by here; that makes no difference to me, for he never will pass this way again."

And above all, the scientist must anticipate. Carlyle once said something like this: "Only facts matter. John Lackland came through here. That’s something impressive. Here’s a reality I’d trade all the theories in the world for." Carlyle was from the same country as Bacon, but Bacon wouldn’t have put it that way. That’s the historian's perspective. The physicist would say instead: "John Lackland came through here; that doesn’t matter to me, because he’ll never come this way again."

We all know that there are good experiments and poor ones. The latter will accumulate in vain; though one may have made a hundred or a thousand, a single piece of work by a true master, by a Pasteur, for example, will suffice to tumble them into oblivion. Bacon would have well understood this; it is he who invented the phrase Experimentum crucis. But Carlyle would not have understood it. A fact is a fact. A pupil has read a certain number on his thermometer; he has taken no precaution; no matter, he has read it, and if it is only the fact that counts, here is a reality of the same rank as the peregrinations of King John Lackland. Why is the fact that this pupil has made this reading of no interest, while the fact that a skilled physicist had made another reading might be on the contrary very important? It is because from the first reading we could not infer anything. What then is a good experiment? It is that which informs us of something besides an isolated fact; it is that which enables us to foresee, that is, that which enables us to generalize.

We all understand that some experiments are valuable while others are not. The less valuable ones will pile up uselessly; even if someone conducts a hundred or a thousand of them, a single piece of work by a true master, like Pasteur, for instance, can overshadow them all. Bacon would have recognized this; he coined the term Experimentum crucis. But Carlyle wouldn’t have gotten it. A fact is a fact. A student read a certain number on his thermometer; he took no precautions; it doesn't matter, he recorded it, and if only the fact matters, this reality is on the same level as the travels of King John Lackland. Why is it that the fact that this student made this reading is insignificant, while the fact that an experienced physicist made another reading could be very important? It’s because from the first reading, we can’t infer anything. So, what defines a good experiment? It’s one that tells us something beyond a mere isolated fact; it’s one that allows us to predict, in other words, one that enables us to generalize.

For without generalization foreknowledge is impossible. The circumstances under which one has worked will never reproduce themselves all at once. The observed action then will never recur; the only thing that can be affirmed is that under analogous circumstances an analogous action will be produced. In order to foresee, then, it is necessary to invoke at least analogy, that is to say, already then to generalize.

For without generalization, predicting the future is impossible. The specific circumstances in which someone has worked will never come together exactly the same way again. The observed action will never happen again; all that can be stated is that under similar circumstances, a similar action will occur. Therefore, to anticipate, it is essential to rely at least on analogy, meaning one must generalize even at that point.

No matter how timid one may be, still it is necessary to interpolate. Experiment gives us only a certain number of isolated points. We must unite these by a continuous line. This is a veritable generalization. But we do more; the curve that we shall trace will pass between the observed points and near these points;[Pg 129] it will not pass through these points themselves. Thus one does not restrict himself to generalizing the experiments, but corrects them; and the physicist who should try to abstain from these corrections and really be content with the bare experiment, would be forced to enunciate some very strange laws.

No matter how shy someone might be, it's still necessary to fill in the gaps. Experiments give us only a limited number of separate data points. We need to connect these with a smooth line. This is true generalization. But we do even more; the curve we draw will sit between the observed points and close to them; it won't go through the points themselves. So rather than just generalizing the experiments, we also refine them. A physicist who tries to avoid these corrections and simply stick with the raw data would end up stating some very odd laws.[Pg 129]

The bare facts, then, would not be enough for us; and that is why we must have science ordered, or rather organized.

The simple facts alone wouldn't be sufficient for us; that's why we need science to be structured, or rather organized.

It is often said experiments must be made without a preconceived idea. That is impossible. Not only would it make all experiment barren, but that would be attempted which could not be done. Every one carries in his mind his own conception of the world, of which he can not so easily rid himself. We must, for instance, use language; and our language is made up only of preconceived ideas and can not be otherwise. Only these are unconscious preconceived ideas, a thousand times more dangerous than the others.

It’s often said that experiments should be conducted without any preconceived notions. That’s impossible. Not only would it make every experiment pointless, but it would also attempt something that can’t be done. Everyone has their own understanding of the world in their mind, which is not easy to shake off. For example, we have to use language; and our language consists only of preconceived ideas and cannot be otherwise. These are simply unconscious preconceived ideas, which are a thousand times more dangerous than the others.

Shall we say that if we introduce others, of which we are fully conscious, we shall only aggravate the evil? I think not. I believe rather that they will serve as counterbalances to each other—I was going to say as antidotes; they will in general accord ill with one another—they will come into conflict with one another, and thereby force us to regard things under different aspects. This is enough to emancipate us. He is no longer a slave who can choose his master.

Shall we say that if we introduce others, of which we are fully conscious, we will only make the situation worse? I think not. I believe instead that they will balance each other out—I was going to say act as antidotes; they will generally be in disharmony with one another—they will clash, and in doing so, compel us to see things from different perspectives. This is enough to free us. He is no longer a slave who can choose his master.

Thus, thanks to generalization, each fact observed enables us to foresee a great many others; only we must not forget that the first alone is certain, that all others are merely probable. No matter how solidly founded a prediction may appear to us, we are never absolutely sure that experiment will not contradict it, if we undertake to verify it. The probability, however, is often so great that practically we may be content with it. It is far better to foresee even without certainty than not to foresee at all.

Thus, thanks to generalization, every observed fact allows us to anticipate many others; we just have to remember that the first one is the only certain fact, and all the others are just likely. No matter how solid a prediction seems to us, we can never be absolutely sure that testing it won’t prove us wrong. However, the likelihood is often so high that, in practice, we can be satisfied with it. It's far better to predict even without certainty than to not predict at all.

One must, then, never disdain to make a verification when opportunity offers. But all experiment is long and difficult; the workers are few; and the number of facts that we need to foresee is immense. Compared with this mass the number of direct verifications that we can make will never be anything but a negligible quantity.[Pg 130]

One should never hesitate to verify when the chance arises. However, conducting experiments is time-consuming and challenging; there aren't many people working on it, and the number of facts we need to anticipate is huge. In comparison to this vast amount, the number of direct verifications we can perform will always be just a tiny fraction.[Pg 130]

Of this few that we can directly attain, we must make the best use; it is very necessary to get from every experiment the greatest possible number of predictions, and with the highest possible degree of probability. The problem is, so to speak, to increase the yield of the scientific machine.

Of the few things we can directly achieve, we must make the most of them; it’s essential to get the greatest number of predictions from each experiment, with the highest degree of certainty. The challenge is essentially to boost the output of the scientific process.

Let us compare science to a library that ought to grow continually. The librarian has at his disposal for his purchases only insufficient funds. He ought to make an effort not to waste them.

Let’s compare science to a library that should keep expanding. The librarian has limited funds for buying new materials. He needs to make sure not to waste them.

It is experimental physics that is entrusted with the purchases. It alone, then, can enrich the library.

It’s experimental physics that is responsible for the purchases. It alone can enhance the library.

As for mathematical physics, its task will be to make out the catalogue. If the catalogue is well made, the library will not be any richer, but the reader will be helped to use its riches.

As for mathematical physics, its job will be to create the catalog. If the catalog is well done, the library won't be any richer, but it will help the reader utilize its resources.

And even by showing the librarian the gaps in his collections, it will enable him to make a judicious use of his funds; which is all the more important because these funds are entirely inadequate.

And by pointing out the gaps in his collections to the librarian, it will help him use his budget wisely; this is even more crucial since these funds are completely insufficient.

Such, then, is the rôle of mathematical physics. It must direct generalization in such a manner as to increase what I just now called the yield of science. By what means it can arrive at this, and how it can do it without danger, is what remains for us to investigate.

Such is the role of mathematical physics. It must guide generalization to enhance what I just referred to as the output of science. The methods it can use to achieve this, and how it can do so safely, is what we need to explore.

The Unity of Nature.—Let us notice, first of all, that every generalization implies in some measure the belief in the unity and simplicity of nature. As to the unity there can be no difficulty. If the different parts of the universe were not like the members of one body, they would not act on one another, they would know nothing of one another; and we in particular would know only one of these parts. We do not ask, then, if nature is one, but how it is one.

The Oneness of Nature.—First of all, let’s recognize that every generalization relies, to some extent, on believing in the unity and simplicity of nature. There’s no doubt about the unity. If the various parts of the universe weren’t like the members of a single body, they wouldn’t interact with each other, and they wouldn’t be aware of one another; and we, in particular, would only know one of these parts. So, we don't question whether nature is unified, but rather how it is unified.

As for the second point, that is not such an easy matter. It is not certain that nature is simple. Can we without danger act as if it were?

As for the second point, that’s not an easy issue. It’s not clear that nature is straightforward. Can we safely treat it as if it were?

There was a time when the simplicity of Mariotte's law was an argument invoked in favor of its accuracy; when Fresnel himself, after having said in a conversation with Laplace that nature was not concerned about analytical difficulties, felt himself obliged to make explanations, in order not to strike too hard at prevailing opinion.[Pg 131]

There was a time when the simplicity of Mariotte's law was used as a reason to support its accuracy; when Fresnel himself, after mentioning in a conversation with Laplace that nature didn't care about analytical challenges, felt he needed to provide explanations so as not to offend popular opinion.[Pg 131]

To-day ideas have greatly changed; and yet, those who do not believe that natural laws have to be simple, are still often obliged to act as if they did. They could not entirely avoid this necessity without making impossible all generalization, and consequently all science.

Today, ideas have changed a lot; however, those who don’t believe that natural laws need to be simple still often have to act as if they do. They can’t completely avoid this necessity without making generalization, and therefore all science, impossible.

It is clear that any fact can be generalized in an infinity of ways, and it is a question of choice. The choice can be guided only by considerations of simplicity. Let us take the most commonplace case, that of interpolation. We pass a continuous line, as regular as possible, between the points given by observation. Why do we avoid points making angles and too abrupt turns? Why do we not make our curve describe the most capricious zig-zags? It is because we know beforehand, or believe we know, that the law to be expressed can not be so complicated as all that.

It’s obvious that any fact can be generalized in countless ways, and it comes down to choice. That choice should be influenced solely by simplicity. Let’s consider the most basic example: interpolation. We draw a smooth line as consistently as possible between the points observed. Why do we avoid creating angles and sharp turns? Why don’t we let our curve take on the most unpredictable zig-zags? It’s because we know, or think we know, that the law we’re trying to express can’t be that complicated.

We may calculate the mass of Jupiter from either the movements of its satellites, or the perturbations of the major planets, or those of the minor planets. If we take the averages of the determinations obtained by these three methods, we find three numbers very close together, but different. We might interpret this result by supposing that the coefficient of gravitation is not the same in the three cases. The observations would certainly be much better represented. Why do we reject this interpretation? Not because it is absurd, but because it is needlessly complicated. We shall only accept it when we are forced to, and that is not yet.

We can determine Jupiter's mass by analyzing the movements of its moons, the gravitational effects on the major planets, or those on the minor planets. If we average the results from these three methods, we get three numbers that are very close but still different. One way to explain this difference might be to assume that the gravitational constant varies in these cases. This would definitely make the observations fit better. So why do we dismiss this explanation? Not because it’s ridiculous, but because it complicates things unnecessarily. We'll only accept it if we absolutely have to, and we're not there yet.

To sum up, ordinarily every law is held to be simple till the contrary is proved.

To sum up, generally every law is considered straightforward until proven otherwise.

This custom is imposed upon physicists by the causes that I have just explained. But how shall we justify it in the presence of discoveries that show us every day new details that are richer and more complex? How shall we even reconcile it with the belief in the unity of nature? For if everything depends on everything, relationships where so many diverse factors enter can no longer be simple.

This practice is enforced on physicists by the reasons I've just outlined. But how can we justify it in light of discoveries that reveal new details daily that are richer and more complex? How can we even align it with the belief in the unity of nature? Because if everything relies on everything else, relationships involving so many different factors can no longer be straightforward.

If we study the history of science, we see happen two inverse phenomena, so to speak. Sometimes simplicity hides under complex appearances; sometimes it is the simplicity which is apparent, and which disguises extremely complicated realities.

If we look at the history of science, we notice two opposite phenomena, so to speak. Sometimes, simplicity is concealed beneath complex appearances; other times, it’s the apparent simplicity that hides very complicated realities.

What is more complicated than the confused movements of[Pg 132] the planets? What simpler than Newton's law? Here nature, making sport, as Fresnel said, of analytical difficulties, employs only simple means, and by combining them produces I know not what inextricable tangle. Here it is the hidden simplicity which must be discovered.

What’s more complicated than the chaotic movements of[Pg 132] the planets? What could be simpler than Newton's law? Here, nature, as Fresnel mentioned, plays around with analytical challenges, using only straightforward methods, and by mixing them together creates a tangled mess that I can't quite figure out. It's the hidden simplicity that needs to be uncovered.

Examples of the opposite abound. In the kinetic theory of gases, one deals with molecules moving with great velocities, whose paths, altered by incessant collisions, have the most capricious forms and traverse space in every direction. The observable result is Mariotte's simple law. Every individual fact was complicated. The law of great numbers has reestablished simplicity in the average. Here the simplicity is merely apparent, and only the coarseness of our senses prevents our perceiving the complexity.

Examples of the opposite are everywhere. In the kinetic theory of gases, we deal with molecules that move at high speeds, whose paths, constantly changed by endless collisions, take on the most unpredictable shapes and travel through space in all directions. The observable outcome is Mariotte's straightforward law. Every single fact is complicated. The law of large numbers has restored simplicity in the average. Here, the simplicity is only apparent, and it's just the limitations of our senses that keep us from seeing the complexity.

Many phenomena obey a law of proportionality. But why? Because in these phenomena there is something very small. The simple law observed, then, is only a result of the general analytical rule that the infinitely small increment of a function is proportional to the increment of the variable. As in reality our increments are not infinitely small, but very small, the law of proportionality is only approximate, and the simplicity is only apparent. What I have just said applies to the rule of the superposition of small motions, the use of which is so fruitful, and which is the basis of optics.

Many phenomena follow a law of proportionality. But why? Because in these phenomena, there’s something extremely small. The simple law observed is just a result of the general analytical rule that the infinitely small change of a function is proportional to the change of the variable. In reality, our changes aren’t infinitely small, but very small, so the law of proportionality is only approximate, and the simplicity is just an illusion. What I just mentioned applies to the rule of superposition of small movements, which is very useful and forms the basis of optics.

And Newton's law itself? Its simplicity, so long undetected, is perhaps only apparent. Who knows whether it is not due to some complicated mechanism, to the impact of some subtile matter animated by irregular movements, and whether it has not become simple only through the action of averages and of great numbers? In any case, it is difficult not to suppose that the true law contains complementary terms, which would become sensible at small distances. If in astronomy they are negligible as modifying Newton's law, and if the law thus regains its simplicity, it would be only because of the immensity of celestial distances.

And Newton's law itself? Its simplicity, which went unnoticed for so long, might only seem simple. Who knows if it's actually due to some complex mechanism, influenced by some subtle matter moving in erratic ways, and whether it only appears simple because of averages and large numbers? In any case, it's hard not to think that the true law includes additional factors that would be noticeable at short distances. If those factors are insignificant in astronomy when it comes to changing Newton's law, and if the law appears simple again, it’s only because of the vastness of cosmic distances.

No doubt, if our means of investigation should become more and more penetrating, we should discover the simple under the complex, then the complex under the simple, then again the simple under the complex, and so on, without our being able to foresee what will be the last term.[Pg 133]

No doubt, if our methods of investigation keep getting deeper, we'll find the simple within the complex, then the complex within the simple, and then back to the simple within the complex, and so on, without being able to predict what the final outcome will be.[Pg 133]

We must stop somewhere, and that science may be possible we must stop when we have found simplicity. This is the only ground on which we can rear the edifice of our generalizations. But this simplicity being only apparent, will the ground be firm enough? This is what must be investigated.

We need to set a limit, and for science to be achievable, we have to stop when we discover simplicity. This is the only foundation on which we can build our theories. However, since this simplicity is just an illusion, will the foundation be solid enough? This is what we need to explore.

For that purpose, let us see what part is played in our generalizations by the belief in simplicity. We have verified a simple law in a good many particular cases; we refuse to admit that this agreement, so often repeated, is simply the result of chance, and conclude that the law must be true in the general case.

For that purpose, let's examine the role that the belief in simplicity plays in our generalizations. We've confirmed a simple principle in several specific instances; we won't accept that this recurring agreement is merely a coincidence, and we conclude that the principle must hold true in general.

Kepler notices that a planet's positions, as observed by Tycho, are all on one ellipse. Never for a moment does he have the thought that by a strange play of chance Tycho never observed the heavens except at a moment when the real orbit of the planet happened to cut this ellipse.

Kepler realizes that the positions of a planet, as seen by Tycho, all fall on a single ellipse. He never considers for a moment that it could just be a coincidence that Tycho only observed the sky at times when the actual orbit of the planet happened to intersect this ellipse.

What does it matter then whether the simplicity be real, or whether it covers a complex reality? Whether it is due to the influence of great numbers, which levels down individual differences, or to the greatness or smallness of certain quantities, which allows us to neglect certain terms, in no case is it due to chance. This simplicity, real or apparent, always has a cause. We can always follow, then, the same course of reasoning, and if a simple law has been observed in several particular cases, we can legitimately suppose that it will still be true in analogous cases. To refuse to do this would be to attribute to chance an inadmissible rôle.

What does it matter whether the simplicity is genuine or just hides a complex reality? Whether it comes from the influence of a large number of instances that smooth out individual differences, or from the size of certain quantities that lets us overlook some factors, it never happens by chance. This simplicity, whether real or perceived, always has a reason behind it. We can always follow the same line of reasoning, and if a simple law has been noticed in multiple specific situations, we can reasonably assume it will still hold true in similar situations. To refuse to do so would be to give chance an unacceptable role.

There is, however, a difference. If the simplicity were real and essential, it would resist the increasing precision of our means of measure. If then we believe nature to be essentially simple, we must, from a simplicity that is approximate, infer a simplicity that is rigorous. This is what was done formerly; and this is what we no longer have a right to do.

There is, however, a difference. If the simplicity were genuine and fundamental, it would stand up against the growing accuracy of our measurement tools. So, if we think of nature as fundamentally simple, we must, from an approximate simplicity, conclude a more precise simplicity. This is what was done in the past; and this is what we can no longer assume.

The simplicity of Kepler's laws, for example, is only apparent. That does not prevent their being applicable, very nearly, to all systems analogous to the solar system; but it does prevent their being rigorously exact.

The simplicity of Kepler's laws, for example, is just an illusion. That doesn't stop them from being mostly applicable to all systems similar to the solar system; however, it does mean they aren't strictly accurate.

The Rôle of Hypothesis.—All generalization is a hypothesis. Hypothesis, then, has a necessary rôle that no one has ever[Pg 134] contested. Only, it ought always, as soon as possible and as often as possible, to be subjected to verification. And, of course, if it does not stand this test, it ought to be abandoned without reserve. This is what we generally do, but sometimes with rather an ill humor.

The Role of a Hypothesis.—Every generalization is a hypothesis. Hypothesis, therefore, plays a necessary role that no one has ever[Pg 134] contested. However, it should always, as soon as possible and as often as possible, be put to the test. And, of course, if it doesn’t hold up, it should be discarded without hesitation. This is generally what we do, though sometimes we do so with a bit of frustration.

Well, even this ill humor is not justified. The physicist who has just renounced one of his hypotheses ought, on the contrary, to be full of joy; for he has found an unexpected opportunity for discovery. His hypothesis, I imagine, had not been adopted without consideration; it took account of all the known factors that it seemed could enter into the phenomenon. If the test does not support it, it is because there is something unexpected and extraordinary; and because there is going to be something found that is unknown and new.

Well, even this bad mood isn't justified. The physicist who has just rejected one of his hypotheses should actually be filled with joy; he's discovered an unexpected chance for discovery. His hypothesis, I assume, wasn't accepted lightly; it considered all the known factors that could be involved in the phenomenon. If the test doesn't support it, it's because there is something unexpected and extraordinary at play; and because something unknown and new is about to be uncovered.

Has the discarded hypothesis, then, been barren? Far from that, it may be said it has rendered more service than a true hypothesis. Not only has it been the occasion of the decisive experiment, but, without having made the hypothesis, the experiment would have been made by chance, so that nothing would have been derived from it. One would have seen nothing extraordinary; only one fact the more would have been catalogued without deducing from it the least consequence.

Has the discarded hypothesis been useless? Not at all; in fact, it might be said to have been more helpful than a valid hypothesis. It has not only prompted the key experiment, but without the hypothesis, the experiment would have been conducted by chance, resulting in no valuable insights. We would have observed nothing remarkable; just one more fact would have been recorded without any conclusions drawn from it.

Now on what condition is the use of hypothesis without danger?

Now, under what conditions can we use a hypothesis safely?

The firm determination to submit to experiment is not enough; there are still dangerous hypotheses; first, and above all, those which are tacit and unconscious. Since we make them without knowing it, we are powerless to abandon them. Here again, then, is a service that mathematical physics can render us. By the precision that is characteristic of it, it compels us to formulate all the hypotheses that we should make without it, but unconsciously.

The strong commitment to embrace experimentation isn’t enough; there are still risky assumptions, especially those that are unspoken and unaware. Since we create them without realizing it, we can’t easily let them go. Once again, mathematical physics can help us. Its emphasis on precision forces us to clearly outline all the assumptions we might make without realizing it.

Let us notice besides that it is important not to multiply hypotheses beyond measure, and to make them only one after the other. If we construct a theory based on a number of hypotheses, and if experiment condemns it, which of our premises is it necessary to change? It will be impossible to know. And inversely, if the experiment succeeds, shall we believe that we have[Pg 135] demonstrated all the hypotheses at once? Shall we believe that with one single equation we have determined several unknowns?

Let’s also note that it’s important not to create too many hypotheses at once and to build them one at a time. If we develop a theory based on multiple hypotheses and experiments prove it wrong, how will we know which premise needs to be changed? It’ll be impossible to figure out. Conversely, if the experiment works, should we think that we’ve proven all the hypotheses at once? Should we believe that with just one equation, we’ve solved several unknowns?

We must equally take care to distinguish between the different kinds of hypotheses. There are first those which are perfectly natural and from which one can scarcely escape. It is difficult not to suppose that the influence of bodies very remote is quite negligible, that small movements follow a linear law, that the effect is a continuous function of its cause. I will say as much of the conditions imposed by symmetry. All these hypotheses form, as it were, the common basis of all the theories of mathematical physics. They are the last that ought to be abandoned.

We also need to be careful to distinguish between the different types of hypotheses. First, there are those that are completely natural and hard to avoid. It’s tough not to think that the influence of very distant bodies is pretty much negligible, that small movements follow a straight-line pattern, and that the effect is a continuous function of its cause. The same goes for the conditions set by symmetry. All of these hypotheses essentially make up the common foundation of all theories in mathematical physics. They are the last ones that should be discarded.

There is a second class of hypotheses, that I shall term neutral. In most questions the analyst assumes at the beginning of his calculations either that matter is continuous or, on the contrary, that it is formed of atoms. He might have made the opposite assumption without changing his results. He would only have had more trouble to obtain them; that is all. If, then, experiment confirms his conclusions, will he think that he has demonstrated, for instance, the real existence of atoms?

There is a second category of hypotheses that I'll call neutral. In most cases, the analyst starts their calculations by either assuming that matter is continuous or, conversely, that it consists of atoms. They could have taken the opposite assumption without affecting their results; it would just have required more effort to reach them, that's all. So, if experiments confirm their conclusions, will they believe they have proved, for example, the actual existence of atoms?

In optical theories two vectors are introduced, of which one is regarded as a velocity, the other as a vortex. Here again is a neutral hypothesis, since the same conclusions would have been reached by taking precisely the opposite. The success of the experiment, then, can not prove that the first vector is indeed a velocity; it can only prove one thing, that it is a vector. This is the only hypothesis that has really been introduced in the premises. In order to give it that concrete appearance which the weakness of our minds requires, it has been necessary to consider it either as a velocity or as a vortex, in the same way that it has been necessary to represent it by a letter, either x or y. The result, however, whatever it may be, will not prove that it was right or wrong to regard it as a velocity any more than it will prove that it was right or wrong to call it x and not y.

In optical theories, two vectors are introduced, one considered a velocity and the other a vortex. This is yet another neutral hypothesis since the same conclusions could have been drawn by taking exactly the opposite view. The success of the experiment cannot prove that the first vector is actually a velocity; it can only prove one thing: that it is a vector. This is the only hypothesis that has truly been established in the premises. To give it that concrete feel that our minds often require, it has been necessary to think of it either as a velocity or a vortex, just as it has been necessary to represent it with a letter, either x or y. However, the result, whatever it may be, will not prove whether it was right or wrong to consider it a velocity any more than it will prove whether it was right or wrong to label it x instead of y.

These neutral hypotheses are never dangerous, if only their character is not misunderstood. They may be useful, either as devices for computation, or to aid our understanding by concrete images, to fix our ideas as the saying is. There is, then, no occasion to exclude them.[Pg 136]

These neutral hypotheses are never harmful, as long as their nature is understood correctly. They can be helpful, either as tools for calculation or to improve our understanding through clear examples, helping us to clarify our thoughts, as the saying goes. Therefore, there’s no reason to dismiss them.[Pg 136]

The hypotheses of the third class are the real generalizations. They are the ones that experiment must confirm or invalidate. Whether verified or condemned, they will always be fruitful. But for the reasons that I have set forth, they will only be fruitful if they are not too numerous.

The hypotheses of the third class are the true generalizations. They are the ones that experiments must confirm or disprove. Whether they are supported or rejected, they will always be valuable. However, for the reasons I have mentioned, they will only be valuable if there aren't too many of them.

Origin of Mathematical Physics.—Let us penetrate further, and study more closely the conditions that have permitted the development of mathematical physics. We observe at once that the efforts of scientists have always aimed to resolve the complex phenomenon directly given by experiment into a very large number of elementary phenomena.

Origin of Math Physics.—Let’s dive deeper and take a closer look at the factors that have allowed mathematical physics to develop. It’s clear that scientists have always tried to break down the complex phenomena seen in experiments into many simpler elements.

This is done in three different ways: first, in time. Instead of embracing in its entirety the progressive development of a phenomenon, the aim is simply to connect each instant with the instant immediately preceding it. It is admitted that the actual state of the world depends only on the immediate past, without being directly influenced, so to speak, by the memory of a distant past. Thanks to this postulate, instead of studying directly the whole succession of phenomena, it is possible to confine ourselves to writing its 'differential equation.' For Kepler's laws we substitute Newton's law.

This is done in three different ways: first, in time. Instead of looking at the entire gradual development of a phenomenon, the goal is simply to link each moment with the moment right before it. It's acknowledged that the current state of the world is influenced only by the recent past, without being directly affected, so to speak, by the memories of a faraway past. Thanks to this assumption, instead of studying the entire sequence of phenomena directly, we can focus on writing its 'differential equation.' We replace Kepler's laws with Newton's law.

Next we try to analyze the phenomenon in space. What experiment gives us is a confused mass of facts presented on a stage of considerable extent. We must try to discover the elementary phenomenon, which will be, on the contrary, localized in a very small region of space.

Next, we attempt to analyze the phenomenon in space. What the experiment gives us is a chaotic mix of facts presented on a fairly large stage. We need to try to uncover the fundamental phenomenon, which will, on the other hand, be concentrated in a very small area of space.

Some examples will perhaps make my thought better understood. If we wished to study in all its complexity the distribution of temperature in a cooling solid, we should never succeed. Everything becomes simple if we reflect that one point of the solid can not give up its heat directly to a distant point; it will give up its heat only to the points in the immediate neighborhood, and it is by degrees that the flow of heat can reach other parts of the solid. The elementary phenomenon is the exchange of heat between two contiguous points. It is strictly localized, and is relatively simple, if we admit, as is natural, that it is not influenced by the temperature of molecules whose distance is sensible.[Pg 137]

Some examples might help clarify my point. If we wanted to study the distribution of temperature in a cooling solid in all its complexity, we would never succeed. Everything becomes clearer when we realize that one point in the solid can't directly transfer its heat to a far-off point; it only transfers heat to the points right next to it, and gradually, the heat flow can reach other areas of the solid. The basic phenomenon is the exchange of heat between two neighboring points. It's very localized, and it's fairly straightforward if we accept, as is reasonable, that it's not affected by the temperature of molecules that are significantly far away.[Pg 137]

I bend a rod. It is going to take a very complicated form, the direct study of which would be impossible. But I shall be able, however, to attack it, if I observe that its flexure is a result only of the deformation of the very small elements of the rod, and that the deformation of each of these elements depends only on the forces that are directly applied to it, and not at all on those which may act on the other elements.

I bend a rod. It's going to take a very complex shape, the direct analysis of which would be impossible. However, I can approach it by observing that its bending is simply due to the deformation of the tiny elements of the rod, and that the deformation of each of these elements depends only on the forces directly applied to it, and not at all on the forces acting on the other elements.

In all these examples, which I might easily multiply, we admit that there is no action at a distance, or at least at a great distance. This is a hypothesis. It is not always true, as the law of gravitation shows us. It must, then, be submitted to verification. If it is confirmed, even approximately, it is precious, for it will enable us to make mathematical physics, at least by successive approximations.

In all these examples, which I could easily expand on, we agree that there's no action at a distance, or at least not over great distances. This is a theory. It's not always accurate, as the law of gravitation demonstrates. It should, therefore, be tested. If it's confirmed, even roughly, it’s valuable, as it will allow us to develop mathematical physics, at least through successive approximations.

If it does not stand the test, we must look for something else analogous; for there are still other means of arriving at the elementary phenomenon. If several bodies act simultaneously, it may happen that their actions are independent and are simply added to one another, either as vectors or as scalars. The elementary phenomenon is then the action of an isolated body. Or again, we have to deal with small movements, or more generally with small variations, which obey the well-known law of superposition. The observed movement will then be decomposed into simple movements, for example, sound into its harmonics, white light into its monochromatic components.

If it doesn’t hold up, we need to find something similar; there are still other ways to reach the fundamental phenomenon. When several objects act at the same time, their actions might be independent and simply combine, either as vectors or as scalars. The fundamental phenomenon is then the action of a single object. Alternatively, we may be dealing with small movements, or more generally, small variations that follow the well-known principle of superposition. The observed movement will then be broken down into simpler movements, like sound into its harmonics or white light into its monochromatic components.

When we have discovered in what direction it is advisable to look for the elementary phenomenon, by what means can we reach it?

When we've figured out the best direction to search for the basic phenomenon, how can we access it?

First of all, it will often happen that in order to detect it, or rather to detect the part of it useful to us, it will not be necessary to penetrate the mechanism; the law of great numbers will suffice.

First of all, it often happens that to detect it, or rather to find the part of it that's useful to us, it won't be necessary to dive into the mechanism; the law of large numbers will be enough.

Let us take again the instance of the propagation of heat. Every molecule emits rays toward every neighboring molecule. According to what law, we do not need to know. If we should make any supposition in regard to this, it would be a neutral hypothesis and consequently useless and incapable of verification. And, in fact, by the action of averages and thanks to the[Pg 138] symmetry of the medium, all the differences are leveled down, and whatever hypothesis may be made, the result is always the same.

Let’s reconsider the example of heat transfer. Every molecule sends out rays to every nearby molecule. We don’t need to know the specific law behind this. If we were to make any guesses about it, they would be neutral assumptions and therefore useless and impossible to verify. In reality, because of the action of averages and the symmetry of the medium, all differences are equalized, and no matter what hypothesis we propose, the outcome is always the same.

The same circumstance is presented in the theory of electricity and in that of capillarity. The neighboring molecules attract and repel one another. We do not need to know according to what law; it is enough for us that this attraction is sensible only at small distances, and that the molecules are very numerous, that the medium is symmetrical, and we shall only have to let the law of great numbers act.

The same situation occurs in the theories of electricity and capillarity. Nearby molecules attract and repel each other. We don’t need to understand the specific laws; it’s enough to know that this attraction is noticeable only at short distances, and that there are many molecules, and that the medium is uniform. We just need to let the law of large numbers take effect.

Here again the simplicity of the elementary phenomenon was hidden under the complexity of the resultant observable phenomenon; but, in its turn, this simplicity was only apparent, and concealed a very complex mechanism.

Here again, the straightforwardness of the basic phenomenon was masked by the complexity of the observable phenomenon that resulted; however, this straightforwardness was merely an illusion, hiding a very intricate mechanism.

The best means of arriving at the elementary phenomenon would evidently be experiment. We ought by experimental contrivance to dissociate the complex sheaf that nature offers to our researches, and to study with care the elements as much isolated as possible. For example, natural white light would be decomposed into monochromatic lights by the aid of the prism, and into polarized light by the aid of the polarizer.

The best way to understand the basic phenomenon is clearly through experimentation. We should use experimental tools to separate the complex group that nature presents to our investigations and carefully study the individual elements as much as possible in isolation. For instance, natural white light can be broken down into monochromatic lights with the help of a prism, and into polarized light using a polarizer.

Unfortunately that is neither always possible nor always sufficient, and sometimes the mind must outstrip experiment. I shall cite only one example, which has always struck me forcibly.

Unfortunately, that's not always possible or enough, and sometimes the mind has to go beyond what we can experiment with. I'll mention just one example that has always stuck with me.

If I decompose white light, I shall be able to isolate a small part of the spectrum, but however small it may be, it will retain a certain breadth. Likewise the natural lights, called monochromatic, give us a very narrow line, but not, however, infinitely narrow. It might be supposed that by studying experimentally the properties of these natural lights, by working with finer and finer lines of the spectrum, and by passing at last to the limit, so to speak, we should succeed in learning the properties of a light strictly monochromatic.

If I break down white light, I will be able to isolate a small part of the spectrum, but no matter how small it is, it will still have a certain width. Similarly, the natural lights called monochromatic give us a very narrow line, but not infinitely narrow. One might think that by experimentally studying the properties of these natural lights, working with finer and finer lines of the spectrum, and eventually approaching the limit, we could learn about the properties of a purely monochromatic light.

That would not be accurate. Suppose that two rays emanate from the same source, that we polarize them first in two perpendicular planes, then bring them back to the same plane of polarization, and try to make them interfere. If the light were strictly monochromatic, they would interfere. With our lights, which are nearly monochromatic, there will be no interference, and[Pg 139] that no matter how narrow the line. In order to be otherwise it would have to be several million times as narrow as the finest known lines.

That wouldn't be accurate. Imagine two rays coming from the same source. If we first polarize them in two perpendicular planes and then bring them back to the same plane of polarization, we try to make them interfere. If the light were completely monochromatic, they would interfere. However, with our lights, which are nearly monochromatic, there will be no interference, regardless of how narrow the line is. To interfere, it would need to be several million times narrower than the finest known lines.[Pg 139]

Here, then, the passage to the limit would have deceived us. The mind must outstrip the experiment, and if it has done so with success, it is because it has allowed itself to be guided by the instinct of simplicity.

Here, then, the journey to the limit would have tricked us. The mind must go beyond the experiment, and if it has done so successfully, it’s because it has let itself be led by the instinct of simplicity.

The knowledge of the elementary fact enables us to put the problem in an equation. Nothing remains but to deduce from this by combination the complex fact that can be observed and verified. This is what is called integration, and is the business of the mathematician.

The understanding of the basic fact allows us to express the problem in an equation. All that's left is to deduce the complex fact that can be observed and confirmed through combinations. This process is known as integration, and it is the task of the mathematician.

It may be asked why, in physical sciences, generalization so readily takes the mathematical form. The reason is now easy to see. It is not only because we have numerical laws to express; it is because the observable phenomenon is due to the superposition of a great number of elementary phenomena all alike. Thus quite naturally are introduced differential equations.

It might be questioned why, in the physical sciences, generalizations often appear in mathematical form. The reason is now clear. It's not just because we have numerical laws to express; it's also because the observable phenomenon results from the combination of many similar elementary phenomena all alike. This is how differential equations are introduced quite naturally.

It is not enough that each elementary phenomenon obeys simple laws; all those to be combined must obey the same law. Then only can the intervention of mathematics be of use; mathematics teaches us in fact to combine like with like. Its aim is to learn the result of a combination without needing to go over the combination piece by piece. If we have to repeat several times the same operation, it enables us to avoid this repetition by telling us in advance the result of it by a sort of induction. I have explained this above, in the chapter on mathematical reasoning.

It’s not enough for each basic phenomenon to follow simple laws; all the ones being combined must follow the same law. Only then can mathematics be helpful; it really shows us how to combine similar things. Its goal is to find out the result of a combination without having to analyze each part individually. If we need to perform the same operation multiple times, it allows us to skip that repetition by predicting the outcome through a form of induction. I explained this earlier in the chapter on mathematical reasoning.

But for this, all the operations must be alike. In the opposite case, it would evidently be necessary to resign ourselves to doing them in reality one after another, and mathematics would become useless.

But for this, all the operations must be the same. Otherwise, we would clearly have to accept that we do them one by one in reality, and mathematics would become pointless.

It is then thanks to the approximate homogeneity of the matter studied by physicists that mathematical physics could be born.

It is because of the general uniformity of the matter studied by physicists that mathematical physics was able to emerge.

In the natural sciences, we no longer find these conditions: homogeneity, relative independence of remote parts, simplicity of the elementary fact; and this is why naturalists are obliged to resort to other methods of generalization.

In the natural sciences, we no longer see these conditions: uniformity, relative independence of distant parts, simplicity of the basic fact; and this is why scientists have to turn to different methods of generalization.


CHAPTER X

Theories of Modern Physics

Meaning of Physical Theories.—The laity are struck to see how ephemeral scientific theories are. After some years of prosperity, they see them successively abandoned; they see ruins accumulate upon ruins; they foresee that the theories fashionable to-day will shortly succumb in their turn and hence they conclude that these are absolutely idle. This is what they call the bankruptcy of science.

Meaning of Physical Theories.—People are amazed at how short-lived scientific theories are. After a few years of success, they see them being replaced one after another; they watch as old theories pile up on top of each other, and they anticipate that the popular theories of today will soon fade away too, leading them to believe that these theories are completely pointless. This is what they refer to as the bankruptcy of science.

Their skepticism is superficial; they give no account to themselves of the aim and the rôle of scientific theories; otherwise they would comprehend that the ruins may still be good for something.

Their skepticism is shallow; they don’t consider the purpose and role of scientific theories; otherwise, they would understand that the ruins might still be valuable.

No theory seemed more solid than that of Fresnel which attributed light to motions of the ether. Yet now Maxwell's is preferred. Does this mean the work of Fresnel was in vain? No, because the aim of Fresnel was not to find out whether there is really an ether, whether it is or is not formed of atoms, whether these atoms really move in this or that sense; his object was to foresee optical phenomena.

No theory seemed more solid than Fresnel's, which linked light to movements of the ether. Yet now Maxwell's is favored. Does this mean Fresnel's work was pointless? No, because Fresnel's goal wasn't to determine if there is actually an ether, whether it consists of atoms, or if those atoms move in one direction or another; his aim was to predict optical phenomena.

Now, Fresnel's theory always permits of this, to-day as well as before Maxwell. The differential equations are always true; they can always be integrated by the same procedures and the results of this integration always retain their value.

Now, Fresnel's theory always allows for this, today just as it did before Maxwell. The differential equations are always valid; they can always be solved using the same methods, and the results of this solving always hold their significance.

And let no one say that thus we reduce physical theories to the rôle of mere practical recipes; these equations express relations, and if the equations remain true it is because these relations preserve their reality. They teach us, now as then, that there is such and such a relation between some thing and some other thing; only this something formerly we called motion; we now call it electric current. But these appellations were only images substituted for the real objects which nature will eternally hide from us. The true relations between these real objects are the only reality we can attain to, and the only condition is that[Pg 141] the same relations exist between these objects as between the images by which we are forced to replace them. If these relations are known to us, what matter if we deem it convenient to replace one image by another.

And let no one claim that we reduce physical theories to just practical recipes; these equations show relationships, and if the equations are true, it's because these relationships remain real. They teach us, now as they always have, that there is a specific relationship between one thing and another; what we used to call motion we now refer to as electric current. But these names are just images standing in for the real objects that nature will always keep hidden from us. The actual relationships between these real objects are the only reality we can grasp, and the only requirement is that[Pg 141] the same relationships exist between these objects as between the images we have to use instead. If we understand these relationships, what does it matter if we find it easier to replace one image with another?

That some periodic phenomenon (an electric oscillation, for instance) is really due to the vibration of some atom which, acting like a pendulum, really moves in this or that sense, is neither certain nor interesting. But that between electric oscillation, the motion of the pendulum and all periodic phenomena there exists a close relationship which corresponds to a profound reality; that this relationship, this similitude, or rather this parallelism extends into details; that it is a consequence of more general principles, that of energy and that of least action; this is what we can affirm; this is the truth which will always remain the same under all the costumes in which we may deem it useful to deck it out.

That some periodic phenomenon (like an electric oscillation) is actually caused by the vibration of an atom, which moves like a pendulum in one direction or another, is neither certain nor particularly interesting. However, the fact that there’s a close connection between electric oscillation, the motion of the pendulum, and all periodic phenomena points to a deeper reality. This relationship, this similarity, or perhaps this parallelism, extends into specifics and is a result of more general principles, like energy and the principle of least action. This is what we can confidently state; this is the truth that will always remain unchanged, regardless of the different ways we may choose to present it.

Numerous theories of dispersion have been proposed; the first was imperfect and contained only a small part of truth. Afterwards came that of Helmholtz; then it was modified in various ways, and its author himself imagined another founded on the principles of Maxwell. But, what is remarkable, all the scientists who came after Helmholtz reached the same equations, starting from points of departure in appearance very widely separated. I will venture to say these theories are all true at the same time, not only because they make us foresee the same phenomena, but because they put in evidence a true relation, that of absorption and anomalous dispersion. What is true in the premises of these theories is what is common to all the authors; this is the affirmation of this or that relation between certain things which some call by one name, others by another.

Numerous theories of dispersion have been suggested; the first was incomplete and contained only a small portion of the truth. Then came Helmholtz's theory, which was later modified in various ways, and the author himself conceived another based on Maxwell's principles. What's interesting is that all the scientists who followed Helmholtz arrived at the same equations, starting from seemingly very different points of view. I would argue that these theories can all be considered true simultaneously, not just because they predict the same phenomena, but also because they highlight a genuine relationship, namely that of absorption and anomalous dispersion. What is accurate in the premises of these theories is what all the authors share in common; this is the assertion of a relationship between certain concepts that some name one thing, while others name it something else.

The kinetic theory of gases has given rise to many objections, which we could hardly answer if we pretended to see in it the absolute truth. But all these objections will not preclude its having been useful, and particularly so in revealing to us a relation true and but for it profoundly hidden, that of the gaseous pressure and the osmotic pressure. In this sense, then, it may be said to be true.

The kinetic theory of gases has raised a lot of objections, which we could hardly address if we claimed to view it as the absolute truth. However, all these objections don't stop it from being useful, especially in uncovering a relationship that is real and would otherwise be deeply hidden: the connection between gas pressure and osmotic pressure. In this way, it can be considered true.

When a physicist finds a contradiction between two theories[Pg 142] equally dear to him, he sometimes says: "We will not bother about that, but hold firmly the two ends of the chain, though the intermediate links are hidden from us." This argument of an embarrassed theologian would be ridiculous if it were necessary to attribute to physical theories the sense the laity give them. In case of contradiction, one of them at least must then be regarded as false. It is no longer the same if in them be sought only what should be sought. May be they both express true relations and the contradiction is only in the images wherewith we have clothed the reality.

When a physicist encounters a conflict between two theories[Pg 142] that are both important to him, he sometimes says, "Let's not worry about that, but instead hold on to both ends of the chain, even if the links in the middle are out of sight." This reasoning from a confused theologian would seem ridiculous if we had to assign physical theories the same meaning that the general public does. In cases of conflict, at least one of those theories must be considered false. It’s different if we seek only what should be sought within them. Perhaps both theories express true relationships, and the contradiction lies only in the way we've represented the reality.

To those who find we restrict too much the domain accessible to the scientist, I answer: These questions which we interdict to you and which you regret, are not only insoluble, they are illusory and devoid of meaning.

To those who think we limit the areas available to scientists too much, I respond: The questions we prohibit you from exploring, which you wish you could tackle, are not just unsolvable; they are also illusory and meaningless.

Some philosopher pretends that all physics may be explained by the mutual impacts of atoms. If he merely means there are between physical phenomena the same relations as between the mutual impacts of a great number of balls, well and good, that is verifiable, that is perhaps true. But he means something more; and we think we understand it because we think we know what impact is in itself; why? Simply because we have often seen games of billiards. Shall we think God, contemplating his work, feels the same sensations as we in watching a billiard match? If we do not wish to give this bizarre sense to his assertion, if neither do we wish the restricted sense I have just explained, which is good sense, then it has none.

Some philosopher claims that all of physics can be explained by the interactions between atoms. If he just means that physical phenomena are similar to the interactions between a lot of balls, then that's fine; it's something we can test, and it might be true. But he means something deeper, and we think we understand it because we believe we know what an impact is in itself; why? Simply because we've often watched billiards games. Should we think that God, looking over his creation, feels the same sensations we do when watching a billiards match? If we don’t want to attribute this strange meaning to his statement, and we also don't want to stick to the simplistic interpretation I just mentioned, which makes sense, then it really has no meaning at all.

Hypotheses of this sort have therefore only a metaphorical sense. The scientist should no more interdict them than the poet does metaphors; but he ought to know what they are worth. They may be useful to give a certain satisfaction to the mind, and they will not be injurious provided they are only indifferent hypotheses.

Hypotheses like this only have a metaphorical meaning. A scientist shouldn't ban them any more than a poet bans metaphors; however, he should understand their value. They can be helpful for providing some satisfaction to the mind, and they won't be harmful as long as they are just neutral hypotheses.

These considerations explain to us why certain theories, supposed to be abandoned and finally condemned by experiment, suddenly arise from their ashes and recommence a new life. It is because they expressed true relations; and because they had not ceased to do so when, for one reason or another, we felt it necessary to enunciate the same relations in another language. So they retained a sort of latent life.[Pg 143]

These thoughts help us understand why some theories, which were believed to be discarded and ultimately disproven by experiments, unexpectedly come back to life. It's because they represented genuine relationships, and even when we found it necessary to express those relationships in different terms for various reasons, those original ideas never completely disappeared. They maintained a kind of hidden existence.[Pg 143]

Scarcely fifteen years ago was there anything more ridiculous, more naïvely antiquated, than Coulomb's fluids? And yet here they are reappearing under the name of electrons. Wherein do these permanently electrified molecules differ from Coulomb's electric molecules? It is true that in the electrons the electricity is supported by a little, a very little matter; in other words, they have a mass (and yet this is now contested); but Coulomb did not deny mass to his fluids, or, if he did, it was only with reluctance. It would be rash to affirm that the belief in electrons will not again suffer eclipse; it was none the less curious to note this unexpected resurrection.

Scarcely fifteen years ago, was there anything more ridiculous, more naively outdated, than Coulomb's fluids? And yet here they are coming back under the name of electrons. How do these permanently electrified molecules differ from Coulomb's electric molecules? It's true that in electrons, the electricity is carried by a small, very small amount of matter; in other words, they have mass (and yet this is now being debated); but Coulomb did not deny mass to his fluids, or if he did, it was only reluctantly. It would be rash to say that the belief in electrons won't fade away again; still, it’s interesting to see this unexpected comeback.

But the most striking example is Carnot's principle. Carnot set it up starting from false hypotheses; when it was seen that heat is not indestructible, but may be transformed into work, his ideas were completely abandoned; afterwards Clausius returned to them and made them finally triumph. Carnot's theory, under its primitive form, expressed, aside from true relations, other inexact relations, débris of antiquated ideas; but the presence of these latter did not change the reality of the others. Clausius had only to discard them as one lops off dead branches.

But the most striking example is Carnot's principle. Carnot started it based on false assumptions; when it was revealed that heat isn't indestructible and can be converted into work, his ideas were completely abandoned. Later, Clausius revisited them and ultimately established their validity. Carnot's theory, in its original form, expressed not only true relationships but also some inaccurate ones, remnants of outdated ideas; however, these did not alter the validity of the others. Clausius simply needed to remove them, like trimming off dead branches.

The result was the second fundamental law of thermodynamics. There were always the same relations; though these relations no longer subsisted, at least in appearance, between the same objects. This was enough for the principle to retain its value. And even the reasonings of Carnot have not perished because of that; they were applied to a material tainted with error; but their form (that is to say, the essential) remained correct.

The result was the second fundamental law of thermodynamics. The same relationships always existed; although these relationships no longer seemed to exist between the same objects. This was enough for the principle to maintain its validity. Even Carnot's reasoning hasn't disappeared because of this; it was applied to a flawed material, but its structure (that is, the essential part) remained accurate.

What I have just said illuminates at the same time the rôle of general principles such as the principle of least action, or that of the conservation of energy.

What I've just said highlights the role of general principles like the principle of least action and the conservation of energy.

These principles have a very high value; they were obtained in seeking what there was in common in the enunciation of numerous physical laws; they represent therefore, as it were, the quintessence of innumerable observations.

These principles are extremely valuable; they were derived from looking for what was common in the presentation of many physical laws; they essentially represent the essence of countless observations.

However, from their very generality a consequence results to which I have called attention in Chapter VIII, namely, that they can no longer be verified. As we can not give a general definition of energy, the principle of the conservation of energy[Pg 144] signifies simply that there is something which remains constant. Well, whatever be the new notions that future experiments shall give us about the world, we are sure in advance that there will be something there which will remain constant and which may be called energy.

However, from their very general nature, a consequence emerges that I pointed out in Chapter VIII, which is that they can no longer be verified. Since we can't provide a general definition of energy, the principle of the conservation of energy[Pg 144] simply means that there is something that stays constant. No matter what new ideas future experiments reveal about the world, we can be sure in advance that there will be something that remains constant and can be referred to as energy.

Is this to say that the principle has no meaning and vanishes in a tautology? Not at all; it signifies that the different things to which we give the name of energy are connected by a true kinship; it affirms a real relation between them. But then if this principle has a meaning, it may be false; it may be that we have not the right to extend indefinitely its applications, and yet it is certain beforehand to be verified in the strict acceptation of the term; how then shall we know when it shall have attained all the extension which can legitimately be given it? Just simply when it shall cease to be useful to us, that is, to make us correctly foresee new phenomena. We shall be sure in such a case that the relation affirmed is no longer real; for otherwise it would be fruitful; experiment, without directly contradicting a new extension of the principle, will yet have condemned it.

Is this to say that the principle has no meaning and disappears into a tautology? Not at all; it means that the different things we call energy are connected by a real relationship; it confirms a genuine connection between them. But if this principle has meaning, it could be false; we might not have the right to extend its applications indefinitely, and yet it is certain to be verified in its strict sense; so how will we know when it has reached all the legitimate applications it can have? Simply when it stops being useful to us, meaning, when it no longer helps us accurately predict new phenomena. In that case, we can be sure that the relationship we asserted is no longer real; otherwise, it would still be productive; experiments, while not directly contradicting a new extension of the principle, will have ultimately refuted it.

Physics and Mechanism.—Most theorists have a constant predilection for explanations borrowed from mechanics or dynamics. Some would be satisfied if they could explain all phenomena by motions of molecules attracting each other according to certain laws. Others are more exacting; they would suppress attractions at a distance; their molecules should follow rectilinear paths from which they could be made to deviate only by impacts. Others again, like Hertz, suppress forces also, but suppose their molecules subjected to geometric attachments analogous, for instance, to those of our linkages; they try thus to reduce dynamics to a sort of kinematics.

Physics and Engineering.—Most theorists tend to favor explanations derived from mechanics or dynamics. Some would be content to explain all phenomena through the movements of molecules that attract each other according to specific laws. Others are more demanding; they want to eliminate long-range attractions entirely, insisting that molecules travel in straight lines, only to be changed in direction by collisions. Still others, like Hertz, also dismiss forces but propose that their molecules are subject to geometric connections similar to the linkages we have; they aim to simplify dynamics into a type of kinematics.

In a word, all would bend nature into a certain form outside of which their mind could not feel satisfied. Will nature be sufficiently flexible for that?

In short, everyone would try to shape nature into a specific form that their mind wouldn’t be satisfied without. Will nature be flexible enough for that?

We shall examine this question in Chapter XII, à propos of Maxwell's theory. Whenever the principles of energy and of least action are satisfied, we shall see not only that there is always one possible mechanical explanation, but that there is always an infinity of them. Thanks to a well-known theorem of König's on[Pg 145] linkages, it could be shown that we can, in an infinity of ways, explain everything by attachments after the manner of Hertz, or also by central forces. Without doubt it could be demonstrated just as easily that everything can always be explained by simple impacts.

We will explore this question in Chapter XII, regarding Maxwell's theory. Whenever the principles of energy and least action are met, we will find not only that there is always one possible mechanical explanation, but that there are always countless explanations. Thanks to a well-known theorem by König on[Pg 145] linkages, it can be shown that we can explain everything in countless ways, whether through attachments in the style of Hertz or by using central forces. Without a doubt, it could be just as easily demonstrated that everything can also be explained by simple impacts.

For that, of course, we need not be content with ordinary matter, with that which falls under our senses and whose motions we observe directly. Either we shall suppose that this common matter is formed of atoms whose internal motions elude us, the displacement of the totality alone remaining accessible to our senses. Or else we shall imagine some one of those subtile fluids which under the name of ether or under other names, have at all times played so great a rôle in physical theories.

For that, of course, we can't just rely on ordinary matter, the kind that we can perceive and observe directly. We could assume that this common matter is made up of atoms whose internal movements we can't detect, with only the overall movement being noticeable to our senses. Alternatively, we might picture one of those subtle fluids known as ether or by other names, which have always had a significant role in physical theories.

Often one goes further and regards the ether as the sole primitive matter or even as the only true matter. The more moderate consider common matter as condensed ether, which is nothing startling; but others reduce still further its importance and see in it nothing more than the geometric locus of the ether's singularities. For instance, what we call matter is for Lord Kelvin only the locus of points where the ether is animated by vortex motions; for Riemann, it was the locus of points where ether is constantly destroyed; for other more recent authors, Wiechert or Larmor, it is the locus of points where the ether undergoes a sort of torsion of a very particular nature. If the attempt is made to occupy one of these points of view, I ask myself by what right shall we extend to the ether, under pretext that this is the true matter, mechanical properties observed in ordinary matter, which is only false matter.

Often, people take it a step further and see ether as the only basic substance or even as the only real matter. Those with a more moderate view see regular matter as compressed ether, which isn't surprising; but others downplay its significance even more, viewing it merely as the geometric location of the ether's singularities. For example, what we call matter is, for Lord Kelvin, just the points where the ether is stirred by vortex motions; for Riemann, it was where the ether is continuously destroyed; and for more recent authors like Wiechert or Larmor, it is where the ether experiences a specific type of torsion. If we try to adopt one of these perspectives, I wonder what right we have to project onto ether, claiming it to be the true matter, the mechanical properties observed in regular matter, which is merely illusory matter.

The ancient fluids, caloric, electricity, etc., were abandoned when it was perceived that heat is not indestructible. But they were abandoned for another reason also. In materializing them, their individuality was, so to speak, emphasized, a sort of abyss was opened between them. This had to be filled up on the coming of a more vivid feeling of the unity of nature, and the perception of the intimate relations which bind together all its parts. Not only did the old physicists, in multiplying fluids, create entities unnecessarily, but they broke real ties.

The ancient concepts like caloric and electricity were set aside when it became clear that heat isn’t indestructible. However, there was another reason for their abandonment. By defining these concepts, their uniqueness was highlighted, creating a kind of divide between them. This gap needed to be closed as we started to develop a stronger sense of the unity of nature and recognized the deep connections that link all its components. Not only did the old physicists create unnecessary entities by multiplying these concepts, but they also disrupted real connections.

It is not sufficient for a theory to affirm no false relations, it must not hide true relations.[Pg 146]

A theory must not only avoid claiming false relationships, but it also has to reveal true ones.[Pg 146]

And does our ether really exist? We know the origin of our belief in the ether. If light reaches us from a distant star, during several years it was no longer on the star and not yet on the earth; it must then be somewhere and sustained, so to speak, by some material support.

And does our ether really exist? We know where our belief in the ether comes from. If light travels to us from a distant star, for several years it isn't on the star anymore and hasn't reached the earth yet; it must then be somewhere and, so to speak, supported by some material.

The same idea may be expressed under a more mathematical and more abstract form. What we ascertain are the changes undergone by material molecules; we see, for instance, that our photographic plate feels the consequences of phenomena of which the incandescent mass of the star was the theater several years before. Now, in ordinary mechanics the state of the system studied depends only on its state at an instant immediately anterior; therefore the system satisfies differential equations. On the contrary, if we should not believe in the ether, the state of the material universe would depend not only on the state immediately preceding, but on states much older; the system would satisfy equations of finite differences. It is to escape this derogation of the general laws of mechanics that we have invented the ether.

The same idea can be expressed in a more mathematical and abstract way. What we observe are the changes that material molecules go through; for example, we notice that our photographic plate records the effects of events that happened in the star's glowing mass several years earlier. In ordinary mechanics, the state of the system being studied depends only on its state at an instant just before it; therefore, the system follows differential equations. Conversely, if we were to reject the idea of the ether, the state of the material universe would depend not just on the immediately previous state but also on much older states; the system would then follow equations of finite differences. We invented the ether to avoid this limitation on the general laws of mechanics.

That would still only oblige us to fill up, with the ether, the interplanetary void, but not to make it penetrate the bosom of the material media themselves. Fizeau's experiment goes further. By the interference of rays which have traversed air or water in motion, it seems to show us two different media interpenetrating and yet changing place one with regard to the other.

That would still only require us to fill the interplanetary void with ether, but not to let it enter the very substance of the material media. Fizeau's experiment goes further. By using the interference of rays that have passed through moving air or water, it appears to show us two different media overlapping while still changing position relative to each other.

We seem to touch the ether with the finger.

We seem to touch the atmosphere with our fingertips.

Yet experiments may be conceived which would make us touch it still more nearly. Suppose Newton's principle, of the equality of action and reaction, no longer true if applied to matter alone, and that we have established it. The geometric sum of all the forces applied to all the material molecules would no longer be null. It would be necessary then, if we did not wish to change all mechanics, to introduce the ether, in order that this action which matter appeared to experience should be counterbalanced by the reaction of matter on something.

Yet we can come up with experiments that would allow us to understand it even better. Imagine if Newton's principle about action and reaction didn't apply just to matter anymore and we manage to prove that. The total of all the forces acting on all the material molecules wouldn’t cancel out anymore. So, unless we want to change all of mechanics, we would need to introduce ether to ensure that this action that matter seems to experience is balanced by the reaction of matter on something else.

Or again, suppose we discover that optical and electrical phenomena are influenced by the motion of the earth. We should be led to conclude that these phenomena might reveal to us not[Pg 147] only the relative motions of material bodies, but what would seem to be their absolute motions. Again, an ether would be necessary, that these so-called absolute motions should not be their displacements with regard to a void space, but their displacements with regard to something concrete.

Or imagine we find that optical and electrical phenomena are affected by the movement of the Earth. We would conclude that these phenomena could show us not only the relative motions of physical bodies but also what seem to be their absolute motions. Once again, an ether would be needed so that these so-called absolute motions represent their displacements concerning something concrete, rather than just a void space.

Shall we ever arrive at that? I have not this hope, I shall soon say why, and yet it is not so absurd, since others have had it.

Shall we ever get there? I don't have that hope, and I’ll explain why soon, but it’s not so crazy since others have had it.

For instance, if the theory of Lorentz, of which I shall speak in detail further on in Chapter XIII., were true, Newton's principle would not apply to matter alone, and the difference would not be very far from being accessible to experiment.

For example, if Lorentz's theory, which I will discuss in detail later in Chapter XIII, were true, Newton's principle wouldn't apply to matter alone, and the difference would likely be close enough to be tested experimentally.

On the other hand, many researches have been made on the influence of the earth's motion. The results have always been negative. But these experiments were undertaken because the outcome was not sure in advance, and, indeed, according to the ruling theories, the compensation would be only approximate, and one might expect to see precise methods give positive results.

On the other hand, a lot of research has been done on the impact of the earth's movement. The results have consistently been negative. However, these experiments were conducted because the outcome was uncertain beforehand, and according to prevailing theories, the compensation would only be rough. One would expect that accurate methods would yield positive results.

I believe that such a hope is illusory; it was none the less interesting to show that a success of this sort would open to us, in some sort, a new world.

I think that kind of hope is unrealistic; it was still quite intriguing to show that a success like this would, in a way, open up a new world for us.

And now I must be permitted a digression; I must explain, in fact, why I do not believe, despite Lorentz, that more precise observations can ever put in evidence anything else than the relative displacements of material bodies. Experiments have been made which should have disclosed the terms of the first order; the results have been negative; could that be by chance? No one has assumed that; a general explanation has been sought, and Lorentz has found it; he has shown that the terms of the first order must destroy each other, but not those of the second. Then more precise experiments were made; they also were negative; neither could this be the effect of chance; an explanation was necessary; it was found; they always are found; of hypotheses there is never lack.

And now I need to take a moment to digress; I need to explain why I don’t believe, despite Lorentz, that more precise observations can reveal anything other than the relative movements of material bodies. Experiments have been conducted that should have uncovered the first-order terms; the results were negative. Could that really be a coincidence? No one has thought so; a general explanation has been sought, and Lorentz has provided it. He showed that the first-order terms must cancel each other out, but not the second-order terms. Then, even more precise experiments were conducted; they too were negative. This couldn’t just be chance; an explanation was needed; it was discovered; they always find one; there’s never a shortage of hypotheses.

But this is not enough; who does not feel that this is still to leave to chance too great a rôle? Would not that also be a chance, this singular coincidence which brought it about that a certain circumstance should come just in the nick of time to[Pg 148] destroy the terms of the first order, and that another circumstance, wholly different, but just as opportune, should take upon itself to destroy those of the second order? No, it is necessary to find an explanation the same for the one as for the other, and then everything leads us to think that this explanation will hold good equally well for the terms of higher order, and that the mutual destruction of these terms will be rigorous and absolute.

But this isn't enough; who doesn't feel that this leaves too much to chance? Wouldn't it also be a chance that this unique coincidence happened, causing a specific situation to come at just the right moment to[Pg 148] disrupt the first set of circumstances, and that another entirely different situation, but equally timely, would then step in to disrupt the second set? No, we need to find an explanation that applies equally to both cases, and everything suggests that this explanation will also work for the higher-level circumstances, ensuring that the mutual disruption of these circumstances will be strict and absolute.

Present State of the Science.—In the history of the development of physics we distinguish two inverse tendencies.

Current State of the Science.—In the history of the development of physics, we can identify two opposing trends.

On the one hand, new bonds are continually being discovered between objects which had seemed destined to remain forever unconnected; scattered facts cease to be strangers to one another; they tend to arrange themselves in an imposing synthesis. Science advances toward unity and simplicity.

On one hand, new connections are constantly being found between things that once seemed like they would never connect; unrelated facts stop being strangers to each other; they start to come together in an impressive synthesis. Science moves towards unity and simplicity.

On the other hand, observation reveals to us every day new phenomena; they must long await their place and sometimes, to make one for them, a corner of the edifice must be demolished. In the known phenomena themselves, where our crude senses showed us uniformity, we perceive details from day to day more varied; what we believed simple becomes complex, and science appears to advance toward variety and complexity.

On the other hand, observation shows us new phenomena every day; they often have to wait a long time for their proper place, and sometimes, to make room for them, a part of the structure must be torn down. In the familiar phenomena themselves, where our basic senses showed us uniformity, we notice more and more varied details every day; what we thought was simple turns out to be complex, and science seems to be moving towards variety and complexity.

Of these two inverse tendencies, which seem to triumph turn about, which will win? If it be the first, science is possible; but nothing proves this a priori, and it may well be feared that after having made vain efforts to bend nature in spite of herself to our ideal of unity, submerged by the ever-rising flood of our new riches, we must renounce classifying them, abandon our ideal, and reduce science to the registration of innumerable recipes.

Of these two conflicting tendencies, which seem to take turns being dominant, which one will prevail? If it’s the first, then science is possible; however, there’s no proof of this a priori, and it’s quite possible that after struggling in vain to shape nature to fit our ideal of unity, overwhelmed by the ever-growing tide of our new wealth, we will have to give up categorizing them, abandon our ideal, and limit science to just recording countless formulas.

To this question we can not reply. All we can do is to observe the science of to-day and compare it with that of yesterday. From this examination we may doubtless draw some encouragement.

To this question, we can't answer. All we can do is observe today's science and compare it with yesterday's. From this examination, we can surely find some encouragement.

Half a century ago, hope ran high. The discovery of the conservation of energy and of its transformations had revealed to us the unity of force. Thus it showed that the phenomena of heat could be explained by molecular motions. What was the nature of these motions was not exactly known, but no one[Pg 149] doubted that it soon would be. For light, the task seemed completely accomplished. In what concerns electricity, things were less advanced. Electricity had just annexed magnetism. This was a considerable step toward unity, and a decisive step.

Half a century ago, optimism was high. The discovery of energy conservation and its transformations revealed the unity of force. It demonstrated that heat phenomena could be explained by molecular motion. The exact nature of these motions wasn’t fully understood, but no one[Pg 149] doubted it would be figured out soon. As for light, the task seemed mostly done. However, things were less progressed when it came to electricity. Electricity had just merged with magnetism. This was a significant and decisive step towards unity.

But how should electricity in its turn enter into the general unity, how should it be reduced to the universal mechanism?

But how should electricity fit into the overall unity, and how should it be integrated into the universal mechanism?

Of that no one had any idea. Yet the possibility of this reduction was doubted by none, there was faith. Finally, in what concerns the molecular properties of material bodies, the reduction seemed still easier, but all the detail remained hazy. In a word, the hopes were vast and animated, but vague. To-day, what do we see? First of all, a prime progress, immense progress. The relations of electricity and light are now known; the three realms, of light, of electricity and of magnetism, previously separated, form now but one; and this annexation seems final.

Of that, no one had any idea. But everyone believed in the possibility of this reduction. Ultimately, when it comes to the molecular properties of materials, the reduction seemed even more straightforward, though all the details were still unclear. In short, the hopes were great and lively, but vague. Today, what do we see? First and foremost, incredible progress. The connections between electricity and light are now understood; the three realms of light, electricity, and magnetism, which were once separate, now come together as one, and this unification seems to be permanent.

This conquest, however, has cost us some sacrifices. The optical phenomena subordinate themselves as particular cases under the electrical phenomena; so long as they remained isolated, it was easy to explain them by motions that were supposed to be known in all their details, that was a matter of course; but now an explanation, to be acceptable, must be easily capable of extension to the entire electric domain. Now that is a matter not without difficulties.

This conquest, however, has come with certain sacrifices. The optical phenomena fall under the category of electrical phenomena; as long as they were isolated, it was straightforward to explain them with movements that were believed to be fully understood, which was taken for granted. But now, for an explanation to be acceptable, it must be easily extendable to the entire electric field. This is not without its challenges.

The most satisfactory theory we have is that of Lorentz, which, as we shall see in the last chapter, explains electric currents by the motions of little electrified particles; it is unquestionably the one which best explains the known facts, the one which illuminates the greatest number of true relations, the one of which most traces will be found in the final construction. Nevertheless, it still has a serious defect, which I have indicated above; it is contrary to Newton's law of the equality of action and reaction; or rather, this principle, in the eyes of Lorentz, would not be applicable to matter alone; for it to be true, it would be necessary to take account of the action of the ether on matter and of the reaction of matter on the ether.

The most satisfying theory we have is Lorentz's, which, as we’ll see in the last chapter, explains electric currents through the movement of tiny electrified particles. It undoubtedly does the best job of explaining the known facts, illuminating the greatest number of true relationships, and its traces will be most evident in the final construction. However, it still has a major flaw that I've mentioned earlier; it contradicts Newton's law of equal action and reaction. In Lorentz's view, this principle wouldn’t apply just to matter; for it to be true, we would need to consider the ether’s impact on matter and the matter’s impact on the ether.

Now, from what we know at present, it seems probable that things do not happen in this way.

Now, based on what we know right now, it seems likely that things don't happen like this.

However that may be, thanks to Lorentz, Fizeau's results on[Pg 150] the optics of moving bodies, the laws of normal and anomalous dispersion and of absorption find themselves linked to one another and to the other properties of the ether by bonds which beyond any doubt will never more be broken. See the facility with which the new Zeeman effect has found its place already and has even aided in classifying Faraday's magnetic rotation which had defied Maxwell's efforts; this facility abundantly proves that the theory of Lorentz is not an artificial assemblage destined to fall asunder. It will probably have to be modified, but not destroyed.

However that may be, thanks to Lorentz, Fizeau's results on[Pg 150] the optics of moving bodies, the laws of normal and unusual dispersion and absorption are now connected to one another and to other properties of the ether by links that will undoubtedly never be broken. Notice how easily the new Zeeman effect has already found its place and has even helped classify Faraday's magnetic rotation, which had challenged Maxwell's efforts; this ease clearly shows that Lorentz's theory is not just an artificial collection of ideas meant to fall apart. It will likely need some modifications, but it won't be destroyed.

But Lorentz had no aim beyond that of embracing in one totality all the optics and electrodynamics of moving bodies; he never pretended to give a mechanical explanation of them. Larmor goes further; retaining the theory of Lorentz in essentials, he grafts upon it, so to speak, MacCullagh's ideas on the direction of the motions of the ether.

But Lorentz didn’t aim for anything beyond unifying all the optics and electrodynamics of moving bodies; he never claimed to provide a mechanical explanation for them. Larmor goes a step further; while keeping the core ideas of Lorentz, he adds, so to speak, MacCullagh's thoughts on the direction of the motions of the ether.

According to him, the velocity of the ether would have the same direction and the same magnitude as the magnetic force. However ingenious this attempt may be, the defect of the theory of Lorentz remains and is even aggravated. With Lorentz, we do not know what are the motions of the ether; thanks to this ignorance, we may suppose them such that, compensating those of matter, they reestablish the equality of action and reaction. With Larmor, we know the motions of the ether, and we can ascertain that the compensation does not take place.

According to him, the speed of the ether would be in the same direction and have the same strength as the magnetic force. However clever this attempt might be, the flaw in Lorentz's theory still exists and is even worsened. With Lorentz, we don’t know the movements of the ether; because of this uncertainty, we can imagine them in a way that balances those of matter, restoring the equality of action and reaction. With Larmor, we know the movements of the ether, and we can determine that this balancing does not happen.

If Larmor has failed, as it seems to me he has, does that mean that a mechanical explanation is impossible? Far from it: I have said above that when a phenomenon obeys the two principles of energy and of least action, it admits of an infinity of mechanical explanations; so it is, therefore, with the optical and electrical phenomena.

If Larmor has failed, as it seems to me he has, does that mean a mechanical explanation is impossible? Not at all: I mentioned earlier that when a phenomenon follows the two principles of energy and least action, it allows for countless mechanical explanations; this is also true for optical and electrical phenomena.

But this is not enough: for a mechanical explanation to be good, it must be simple; for choosing it among all which are possible, there should be other reasons besides the necessity of making a choice. Well, we have not as yet a theory satisfying this condition and consequently good for something. Must we lament this? That would be to forget what is the goal sought; this is not mechanism; the true, the sole aim is unity.

But this isn’t enough: for a mechanical explanation to be valid, it needs to be simple; when selecting one among all the possible options, there should be reasons beyond just needing to make a choice. As of now, we don’t have a theory that meets this requirement and is therefore useful. Should we be upset about this? That would mean forgetting the ultimate goal; it’s not just about mechanisms; the true, primary goal is unity.

We must therefore set bounds to our ambition; let us not try[Pg 151] to formulate a mechanical explanation; let us be content with showing that we could always find one if we wished to. In this regard we have been successful; the principle of the conservation of energy has received only confirmations; a second principle has come to join it, that of least action, put under the form which is suitable for physics. It also has always been verified, at least in so far as concerns reversible phenomena which thus obey the equations of Lagrange, that is to say, the most general laws of mechanics.

We should therefore limit our ambitions; let’s not try[Pg 151] to create a mechanical explanation; let’s be satisfied with showing that we could always find one if we wanted to. In this regard, we’ve been successful; the principle of conservation of energy has been consistently confirmed; a second principle has joined it, the principle of least action, formatted appropriately for physics. This too has always been verified, at least regarding reversible phenomena, which comply with the equations of Lagrange, that is, the most general laws of mechanics.

Irreversible phenomena are much more rebellious. Yet these also are being coordinated, and tend to come into unity; the light which has illuminated them has come to us from Carnot's principle. Long did thermodynamics confine itself to the study of the dilatation of bodies and their changes of state. For some time past it has been growing bolder and has considerably extended its domain. We owe to it the theory of the galvanic battery and that of the thermoelectric phenomena; there is not in all physics a corner that it has not explored, and it has attacked chemistry itself.

Irreversible phenomena are much more unpredictable. However, they are also being organized and tend to come together; the knowledge that has illuminated them has come to us from Carnot's principle. For a long time, thermodynamics focused on studying the expansion of materials and their changes in state. Recently, it has grown more ambitious and has significantly expanded its reach. We owe to it the theory of the galvanic battery and thermoelectric phenomena; there isn't a part of physics that it hasn't investigated, and it has even delved into chemistry itself.

Everywhere the same laws reign; everywhere, under the diversity of appearances, is found again Carnot's principle; everywhere also is found that concept so prodigiously abstract of entropy, which is as universal as that of energy and seems like it to cover a reality. Radiant heat seemed destined to escape it; but recently we have seen that submit to the same laws.

Everywhere the same laws apply; everywhere, beneath the different appearances, we find Carnot's principle again; everywhere we also find that incredibly abstract concept of entropy, which is as universal as that of energy and seems to encompass a reality. Radiant heat seemed destined to evade it; however, recently we have seen that it also conforms to the same laws.

In this way fresh analogies are revealed to us, which may often be followed into detail; ohmic resistance resembles the viscosity of liquids; hysteresis would resemble rather the friction of solids. In all cases, friction would appear to be the type which the most various irreversible phenomena copy, and this kinship is real and profound.

In this way, new comparisons are shown to us, which can often be explored in detail; ohmic resistance is similar to the viscosity of liquids; hysteresis is more like the friction of solids. In all instances, friction seems to be the model that the most diverse irreversible phenomena imitate, and this connection is genuine and deep.

Of these phenomena a mechanical explanation, properly so called, has also been sought. They hardly lent themselves to it. To find it, it was necessary to suppose that the irreversibility is only apparent, that the elementary phenomena are reversible and obey the known laws of dynamics. But the elements are extremely numerous and blend more and more, so that to our crude sight all appears to tend toward uniformity, that is, everything seems to[Pg 152] go forward in the same sense without hope of return. The apparent irreversibility is thus only an effect of the law of great numbers. But, only a being with infinitely subtile senses, like Maxwell's imaginary demon, could disentangle this inextricable skein and turn back the course of the universe.

A mechanical explanation for these phenomena has also been sought, but it hardly fits. To find one, we would need to assume that the irreversibility is just an illusion, that the fundamental phenomena are reversible and follow known laws of dynamics. However, there are an incredibly large number of these elements, and they increasingly blend together, so to our unrefined perception, everything seems to head toward uniformity; it all appears to be moving in the same direction with no chance of return. This apparent irreversibility is merely a result of the law of large numbers. Only a being with infinitely sensitive senses, like Maxwell's imaginary demon, could unravel this complex web and reverse the course of the universe.

This conception, which attaches itself to the kinetic theory of gases, has cost great efforts and has not, on the whole, been fruitful; but it may become so. This is not the place to examine whether it does not lead to contradictions and whether it is in conformity with the true nature of things.

This idea, which is based on the kinetic theory of gases, has required a lot of effort and hasn't really been productive overall; however, it might be in the future. This isn't the place to discuss whether it leads to contradictions or if it aligns with the true nature of things.

We signalize, however, M. Gouy's original ideas on the Brownian movement. According to this scientist, this singular motion should escape Carnot's principle. The particles which it puts in swing would be smaller than the links of that so compacted skein; they would therefore be fitted to disentangle them and hence to make the world go backward. We should almost see Maxwell's demon at work.

We highlight M. Gouy's original ideas about Brownian motion. According to this scientist, this unique movement would bypass Carnot's principle. The particles that are set in motion would be smaller than the strands of that tightly packed skein; therefore, they would be able to untangle them and effectively reverse the flow of the world. We could almost envision Maxwell's demon in action.

To summarize, the previously known phenomena are better and better classified, but new phenomena come to claim their place; most of these, like the Zeeman effect, have at once found it.

To sum up, the previously known phenomena are becoming more and more accurately classified, but new phenomena are emerging to take their place; most of these, like the Zeeman effect, have quickly found their spot.

But we have the cathode rays, the X-rays, those of uranium and of radium. Herein is a whole world which no one suspected. How many unexpected guests must be stowed away?

But we have cathode rays, X-rays, and those from uranium and radium. This reveals an entire world that no one anticipated. How many surprising visitors might be hidden away?

No one can yet foresee the place they will occupy. But I do not believe they will destroy the general unity; I think they will rather complete it. On the one hand, in fact, the new radiations seem connected with the phenomena of luminescence; not only do they excite fluorescence, but they sometimes take birth in the same conditions as it.

No one can predict the role they will play yet. However, I don't think they will disrupt the overall unity; instead, I believe they will enhance it. On one hand, the new radiations appear to be linked to luminescence phenomena; they not only trigger fluorescence but can also arise under the same conditions.

Nor are they without kinship with the causes which produce the electric spark under the action of the ultra-violet light.

Nor are they without a connection to the factors that create the electric spark when exposed to ultraviolet light.

Finally, and above all, it is believed that in all these phenomena are found true ions, animated, it is true, by velocities incomparably greater than in the electrolytes.

Finally, and most importantly, it is believed that in all these phenomena there are true ions, which are indeed animated by speeds that are vastly greater than those found in the electrolytes.

That is all very vague, but it will all become more precise.

That’s all pretty unclear, but it will all get clearer soon.

Phosphorescence, the action of light on the spark, these were regions rather isolated and consequently somewhat neglected by investigators. One may now hope that a new path will be[Pg 153] constructed which will facilitate their communications with the rest of science.

Phosphorescence, the effect of light on the spark, were areas that were pretty isolated and therefore a bit overlooked by researchers. Now, there is hope that a new path will be[Pg 153] created that will improve their connections with the rest of science.

Not only do we discover new phenomena, but in those we thought we knew, unforeseen aspects reveal themselves. In the free ether, the laws retain their majestic simplicity; but matter, properly so called, seems more and more complex; all that is said of it is never more than approximate, and at each instant our formulas require new terms.

Not only do we discover new phenomena, but even in those we thought we understood, unexpected aspects come to light. In free space, the laws maintain their impressive simplicity; however, matter, in the true sense, appears increasingly complex; everything said about it is only ever approximate, and at every moment our formulas need new terms.

Nevertheless the frames are not broken; the relations that we have recognized between objects we thought simple still subsist between these same objects when we know their complexity, and it is that alone which is of importance. Our equations become, it is true, more and more complicated, in order to embrace more closely the complexity of nature; but nothing is changed in the relations which permit the deducing of these equations one from another. In a word, the form of these equations has persisted.

Nevertheless, the frameworks are still intact; the connections we've identified between objects we once considered simple still exist between these same objects when we understand their complexity, and that's what really matters. Our equations are indeed becoming increasingly complex to better capture the intricacies of nature; but the relationships that allow us to derive these equations from one another remain unchanged. In short, the structure of these equations has stayed the same.

Take, for example, the laws of reflection: Fresnel had established them by a simple and seductive theory which experiment seemed to confirm. Since then more precise researches have proved that this verification was only approximate; they have shown everywhere traces of elliptic polarization. But, thanks to the help that the first approximation gave us, we found forthwith the cause of these anomalies, which is the presence of a transition layer; and Fresnel's theory has subsisted in its essentials.

Take, for example, the laws of reflection: Fresnel had established them through a straightforward and appealing theory that experiments seemed to back up. Since then, more detailed research has revealed that this verification was only somewhat accurate; it has shown evidence of elliptical polarization everywhere. However, thanks to the insights provided by the initial approximation, we quickly identified the cause of these anomalies, which is the presence of a transition layer; and Fresnel's theory has remained valid in its essentials.

But there is a reflection we can not help making: All these relations would have remained unperceived if one had at first suspected the complexity of the objects they connect. It has long been said: If Tycho had had instruments ten times more precise neither Kepler, nor Newton, nor astronomy would ever have been. It is a misfortune for a science to be born too late, when the means of observation have become too perfect. This is to-day the case with physical chemistry; its founders are embarrassed in their general grasp by third and fourth decimals; happily they are men of a robust faith.

But there's a thought we can't ignore: All these connections would have gone unnoticed if we initially suspected how complicated the objects they link really are. It has often been said: If Tycho had had instruments ten times more accurate, neither Kepler, nor Newton, nor astronomy would have ever existed. It's unfortunate for a science to emerge too late when observational tools have become too advanced. This is the situation today with physical chemistry; its pioneers struggle with their overall understanding due to overly precise decimals; fortunately, they are people of strong faith.

The better one knows the properties of matter the more one sees continuity reign. Since the labors of Andrews and of van der Waals, we get an idea of how the passage is made from the liquid to the gaseous state and that this passage is not abrupt. Similarly,[Pg 154] there is no gap between the liquid and solid states, and in the proceedings of a recent congress is to be seen, alongside of a work on the rigidity of liquids, a memoir on the flow of solids.

The better we understand the properties of matter, the more we recognize that continuity is key. Thanks to the work of Andrews and van der Waals, we have insight into how the transition from liquid to gas happens gradually rather than suddenly. Similarly,[Pg 154] there is no clear divide between liquid and solid states, and at a recent conference, alongside research on the rigidity of liquids, there was a paper discussing the flow of solids.

By this tendency no doubt simplicity loses; some phenomenon was formerly represented by several straight lines, now these straights must be joined by curves more or less complicated. In compensation unity gains notably. Those cut-off categories quieted the mind, but they did not satisfy it.

By this trend, simplicity undoubtedly suffers; something that used to be shown with several straight lines now needs to be connected by curves that are more or less complex. In return, unity gains significantly. Those disconnected categories calmed the mind, but they didn’t truly satisfy it.

Finally the methods of physics have invaded a new domain, that of chemistry; physical chemistry is born. It is still very young, but we already see that it will enable us to connect such phenomena as electrolysis, osmosis and the motions of ions.

Finally, the methods of physics have entered a new area, that of chemistry; physical chemistry has emerged. It's still quite young, but we're already seeing that it will allow us to link phenomena like electrolysis, osmosis, and the movement of ions.

From this rapid exposition, what shall we conclude?

From this quick overview, what should we conclude?

Everything considered, we have approached unity; we have not been as quick as was hoped fifty years ago, we have not always taken the predicted way; but, finally, we have gained ever so much ground.

Taking everything into account, we've come closer to unity; we weren't as fast as we expected fifty years ago, and we haven't always followed the expected path; but, in the end, we've made significant progress.


CHAPTER XI

Probability Calculus

Doubtless it will be astonishing to find here thoughts about the calculus of probabilities. What has it to do with the method of the physical sciences? And yet the questions I shall raise without solving present themselves naturally to the philosopher who is thinking about physics. So far is this the case that in the two preceding chapters I have often been led to use the words 'probability' and 'chance.'

Doubtless it will be astonishing to find here thoughts about the calculus of probabilities. What does it have to do with the method of the physical sciences? And yet the questions I’ll raise without solving come up naturally for a philosopher thinking about physics. This is so true that in the two previous chapters I have often been led to use the terms ‘probability’ and ‘chance.’

'Predicted facts,' as I have said above, 'can only be probable.' "However solidly founded a prediction may seem to us to be, we are never absolutely sure that experiment will not prove it false. But the probability is often so great that practically we may be satisfied with it." And a little further on I have added: "See what a rôle the belief in simplicity plays in our generalizations. We have verified a simple law in a great number of particular cases; we refuse to admit that this coincidence, so often repeated, can be a mere effect of chance...."

'Predicted facts,' as I mentioned earlier, 'can only be probable.' "No matter how solid a prediction seems to us, we can never be completely sure that experiments won't prove it wrong. But the likelihood is often so high that we can practically be satisfied with it." And later on, I added: "Look at the role that belief in simplicity plays in our understanding. We’ve confirmed a simple law in a vast number of specific cases; we refuse to accept that this often-repeated coincidence is just a matter of chance...."

Thus in a multitude of circumstances the physicist is in the same position as the gambler who reckons up his chances. As often as he reasons by induction, he requires more or less consciously the calculus of probabilities, and this is why I am obliged to introduce a parenthesis, and interrupt our study of method in the physical sciences in order to examine a little more closely the value of this calculus, and what confidence it merits.

Thus, in many situations, the physicist is like a gambler assessing his odds. Every time he reasons by induction, he consciously or unconsciously relies on probability calculations. This is why I need to take a moment to pause and interrupt our exploration of methods in the physical sciences to take a closer look at the value of this calculus and the level of trust we should place in it.

The very name calculus of probabilities is a paradox. Probability opposed to certainty is what we do not know, and how can we calculate what we do not know? Yet many eminent savants have occupied themselves with this calculus, and it can not be denied that science has drawn therefrom no small advantage.

The term "calculus of probabilities" is itself a contradiction. Probability, which is the opposite of certainty, is about what we don't know; so how can we calculate what we don't know? Yet many distinguished scholars have focused on this calculus, and it’s undeniable that science has gained significant benefits from it.

How can we explain this apparent contradiction?

How can we explain this seeming contradiction?

Has probability been defined? Can it even be defined? And if it can not, how dare we reason about it? The definition, it will[Pg 156] be said, is very simple: the probability of an event is the ratio of the number of cases favorable to this event to the total number of possible cases.

Has probability been defined? Can it even be defined? And if it can't, how do we dare to reason about it? The definition, it will[Pg 156] be said, is very straightforward: the probability of an event is the ratio of the number of cases that favor this event to the total number of possible cases.

A simple example will show how incomplete this definition is. I throw two dice. What is the probability that one of the two at least turns up a six? Each die can turn up in six different ways; the number of possible cases is 6 × 6 = 36; the number of favorable cases is 11; the probability is 11/36.

A simple example will show how incomplete this definition is. I roll two dice. What’s the probability that at least one of them shows a six? Each die can show six different results; the total number of possible outcomes is 6 × 6 = 36; the number of favorable outcomes is 11; the probability is 11/36.

That is the correct solution. But could I not just as well say: The points which turn up on the two dice can form 6 × 7/2 = 21 different combinations? Among these combinations 6 are favorable; the probability is 6/21.

That’s the right answer. But can I also say: The numbers that show up on the two dice can create 6 × 7/2 = 21 different combinations? Out of these combinations, 6 are favorable; the probability is 6/21.

Now why is the first method of enumerating the possible cases more legitimate than the second? In any case it is not our definition that tells us.

Now why is the first method of listing the possible cases more valid than the second? In any case, it isn't our definition that informs us.

We are therefore obliged to complete this definition by saying: '... to the total number of possible cases provided these cases are equally probable.' So, therefore, we are reduced to defining the probable by the probable.

We are therefore required to finish this definition by stating: '... to the total number of possible cases as long as those cases are equally likely.' So, we end up defining the probable in terms of the probable.

How can we know that two possible cases are equally probable? Will it be by a convention? If we place at the beginning of each problem an explicit convention, well and good. We shall then have nothing to do but apply the rules of arithmetic and of algebra, and we shall complete our calculation without our result leaving room for doubt. But if we wish to make the slightest application of this result, we must prove our convention was legitimate, and we shall find ourselves in the presence of the very difficulty we thought to escape.

How can we be sure that two possible scenarios are equally likely? Is it just a matter of convention? If we lay out a clear convention at the start of each problem, that's great. We can then just apply the rules of arithmetic and algebra, and we'll finish our calculations without any doubt about the outcome. But if we want to use this result in any way, we need to prove that our convention was valid, and we'll end up facing the very challenge we hoped to avoid.

Will it be said that good sense suffices to show us what convention should be adopted? Alas! M. Bertrand has amused himself by discussing the following simple problem: "What is the probability that a chord of a circle may be greater than the side of the inscribed equilateral triangle?" The illustrious geometer successively adopted two conventions which good sense seemed equally to dictate and with one he found 1/2, with the other 1/3.

Will it be said that common sense is enough to show us what convention we should follow? Unfortunately, M. Bertrand entertained himself by discussing this straightforward problem: "What is the probability that a chord of a circle could be longer than the side of the inscribed equilateral triangle?" The renowned geometer tried out two conventions that seemed equally sensible, and with one, he found 1/2, and with the other, he found 1/3.

The conclusion which seems to follow from all this is that the calculus of probabilities is a useless science, and that the obscure[Pg 157] instinct which we may call good sense, and to which we are wont to appeal to legitimatize our conventions, must be distrusted.

The conclusion that seems to follow from all this is that probability calculus is a pointless science, and that the unclear[Pg 157] instinct we might call common sense, which we often rely on to validate our norms, should be questioned.

But neither can we subscribe to this conclusion; we can not do without this obscure instinct. Without it science would be impossible, without it we could neither discover a law nor apply it. Have we the right, for instance, to enunciate Newton's law? Without doubt, numerous observations are in accord with it; but is not this a simple effect of chance? Besides how do we know whether this law, true for so many centuries, will still be true next year? To this objection, you will find nothing to reply, except: 'That is very improbable.'

But we can’t agree with this conclusion; we can’t do without this hidden instinct. Without it, science would be impossible; we wouldn’t be able to discover a law or apply it. Do we have the right, for example, to state Newton's law? No doubt, many observations support it; but isn’t that just a coincidence? Plus, how do we know if this law, which has been true for so many centuries, will still hold true next year? To this objection, you’ll have nothing to say except: 'That’s very unlikely.'

But grant the law. Thanks to it, I believe myself able to calculate the position of Jupiter a year from now. Have I the right to believe this? Who can tell if a gigantic mass of enormous velocity will not between now and that time pass near the solar system, and produce unforeseen perturbations? Here again the only answer is: 'It is very improbable.'

But let's accept the law. Because of it, I think I can figure out where Jupiter will be a year from now. Do I have the right to think this? Who knows if a massive object moving at high speed will come close to our solar system before then and cause unexpected changes? Once again, the only answer is: 'It's very unlikely.'

From this point of view, all the sciences would be only unconscious applications of the calculus of probabilities. To condemn this calculus would be to condemn the whole of science.

From this perspective, all sciences would merely be unconscious uses of probability calculus. To criticize this calculus would be to criticize all of science.

I shall dwell lightly on the scientific problems in which the intervention of the calculus of probabilities is more evident. In the forefront of these is the problem of interpolation, in which, knowing a certain number of values of a function, we seek to divine the intermediate values.

I will briefly touch on the scientific issues where the use of probability calculus is most apparent. At the forefront of these is the interpolation problem, where we try to determine the intermediate values by knowing a certain number of function values.

I shall likewise mention: the celebrated theory of errors of observation, to which I shall return later; the kinetic theory of gases, a well-known hypothesis, wherein each gaseous molecule is supposed to describe an extremely complicated trajectory, but in which, through the effect of great numbers, the mean phenomena, alone observable, obey the simple laws of Mariotte and Gay-Lussac.

I will also mention the famous theory of observational errors, to which I will return later; the kinetic theory of gases, a well-known hypothesis where each gas molecule is thought to follow a very complex path, but where, due to the influence of large numbers, the average phenomena that can actually be observed follow the simple laws of Mariotte and Gay-Lussac.

All these theories are based on the laws of great numbers, and the calculus of probabilities would evidently involve them in its ruin. It is true that they have only a particular interest and that, save as far as interpolation is concerned, these are sacrifices to which we might readily be resigned.

All these theories are based on the laws of large numbers, and the calculus of probabilities would clearly lead to their downfall. It's true that they are only of specific interest and that, except for interpolation, these are sacrifices we could easily accept.

But, as I have said above, it would not be only these partial[Pg 158] sacrifices that would be in question; it would be the legitimacy of the whole of science that would be challenged.

But, as I mentioned earlier, it wouldn't just be these partial[Pg 158] sacrifices that would be at stake; it would be the legitimacy of all of science that would be questioned.

I quite see that it might be said: "We are ignorant, and yet we must act. For action, we have not time to devote ourselves to an inquiry sufficient to dispel our ignorance. Besides, such an inquiry would demand an infinite time. We must therefore decide without knowing; we are obliged to do so, hit or miss, and we must follow rules without quite believing them. What I know is not that such and such a thing is true, but that the best course for me is to act as if it were true." The calculus of probabilities, and consequently science itself, would thenceforth have merely a practical value.

I can see how someone might say, "We're in the dark, but we have to take action. We don't have time to dive deep into figuring things out to eliminate our ignorance. Besides, that kind of exploration would take forever. So we have to make decisions without fully understanding; we have no choice but to go for it, whether we're right or wrong, and we have to follow guidelines even if we don't completely believe in them. What I understand is not that something is definitely true, but that the best approach for me is to act as if it is true." From this perspective, the calculus of probabilities, and science in general, would only hold practical significance.

Unfortunately the difficulty does not thus disappear. A gambler wants to try a coup; he asks my advice. If I give it to him, I shall use the calculus of probabilities, but I shall not guarantee success. This is what I shall call subjective probability. In this case, we might be content with the explanation of which I have just given a sketch. But suppose that an observer is present at the game, that he notes all its coups, and that the game goes on a long time. When he makes a summary of his book, he will find that events have taken place in conformity with the laws of the calculus of probabilities. This is what I shall call objective probability, and it is this phenomenon which has to be explained.

Unfortunately, the difficulty doesn't just go away. A gambler wants to try a coup; he asks for my advice. If I give it to him, I'll use probability calculations, but I won't guarantee success. I’ll refer to this as subjective probability. In this case, we could be satisfied with the explanation I've just outlined. But suppose an observer is present at the game, noting all its coups, and the game continues for a long time. When he summarizes his notes, he'll find that events have occurred according to the principles of probability theory. This I'll refer to as objective probability, and this is the phenomenon that needs to be explained.

There are numerous insurance companies which apply the rules of the calculus of probabilities, and they distribute to their shareholders dividends whose objective reality can not be contested. To invoke our ignorance and the necessity to act does not suffice to explain them.

There are many insurance companies that use probability calculations, and they pay dividends to their shareholders that are undeniable in their reality. Just claiming ignorance and the need to take action isn’t enough to explain them.

Thus absolute skepticism is not admissible. We may distrust, but we can not condemn en bloc. Discussion is necessary.

Thus, complete skepticism isn't acceptable. We can be doubtful, but we can't condemn en bloc. Discussion is essential.

I. Classification of the Problems of Probability.—In order to classify the problems which present themselves à propos of probabilities, we may look at them from many different points of view, and, first, from the point of view of generality. I have said above that probability is the ratio of the number of favorable cases to the number of possible cases. What for want of a better term I call the generality will increase with the number of[Pg 159] possible cases. This number may be finite, as, for instance, if we take a throw of the dice in which the number of possible cases is 36. That is the first degree of generality.

I. Classification of Probability Problems.—To classify the problems that arise regarding probabilities, we can approach them from various perspectives, starting with the perspective of generality. As mentioned earlier, probability is the ratio of the number of favorable outcomes to the total number of possible outcomes. What I refer to as generality increases with the number of [Pg 159] possible outcomes. This number can be finite; for example, when rolling dice, the total number of possible outcomes is 36. This represents the first degree of generality.

But if we ask, for example, what is the probability that a point within a circle is within the inscribed square, there are as many possible cases as there are points in the circle, that is to say, an infinity. This is the second degree of generality. Generality can be pushed further still. We may ask the probability that a function will satisfy a given condition. There are then as many possible cases as one can imagine different functions. This is the third degree of generality, to which we rise, for instance, when we seek to find the most probable law in conformity with a finite number of observations.

But if we ask, for example, what’s the probability that a point inside a circle is also inside the square drawn within it, there are as many possible cases as there are points in the circle, meaning an infinite amount. This is the second level of generality. Generality can go even further. We might ask about the probability that a function meets a specific condition. In this case, there are as many possible cases as there are different functions one can imagine. This is the third level of generality, which we reach, for instance, when we try to identify the most likely rule that fits a limited set of observations.

We may place ourselves at a point of view wholly different. If we were not ignorant, there would be no probability, there would be room for nothing but certainty. But our ignorance can not be absolute, for then there would no longer be any probability at all, since a little light is necessary to attain even this uncertain science. Thus the problems of probability may be classed according to the greater or less depth of this ignorance.

We can adopt a completely different perspective. If we were fully knowledgeable, there would be no chance, only certainty. However, our ignorance can't be total, because then there would be no probability whatsoever, since some understanding is needed to engage with this uncertain field of study. Therefore, we can categorize the issues of probability based on how deep or shallow this ignorance is.

In mathematics even we may set ourselves problems of probability. What is the probability that the fifth decimal of a logarithm taken at random from a table is a '9'? There is no hesitation in answering that this probability is 1/10; here we possess all the data of the problem. We can calculate our logarithm without recourse to the table, but we do not wish to give ourselves the trouble. This is the first degree of ignorance.

In math, we can even create our own probability problems. What's the chance that the fifth decimal of a logarithm picked randomly from a table is a '9'? There's no doubt the answer is 1/10; we have all the information we need for the problem. We could calculate the logarithm ourselves without using the table, but we don't want to put in the effort. This is the first level of ignorance.

In the physical sciences our ignorance becomes greater. The state of a system at a given instant depends on two things: Its initial state, and the law according to which that state varies. If we know both this law and this initial state, we shall have then only a mathematical problem to solve, and we fall back upon the first degree of ignorance.

In the physical sciences, our lack of knowledge increases. The condition of a system at any moment relies on two factors: its initial state and the law governing how that state changes. If we understand both this law and the initial state, we are left with just a mathematical problem to solve, bringing us back to the first level of ignorance.

But it often happens that we know the law, and do not know the initial state. It may be asked, for instance, what is the present distribution of the minor planets? We know that from all time they have obeyed the laws of Kepler, but we do not know what was their initial distribution.[Pg 160]

But it often happens that we know the law, but don’t know the starting point. For example, we might wonder what the current distribution of the minor planets is. We know they have always followed Kepler's laws, but we don’t know what their initial distribution was.[Pg 160]

In the kinetic theory of gases, we assume that the gaseous molecules follow rectilinear trajectories, and obey the laws of impact of elastic bodies. But, as we know nothing of their initial velocities, we know nothing of their present velocities.

In the kinetic theory of gases, we assume that the gas molecules move in straight lines and follow the laws of collisions for elastic bodies. However, since we don't know their initial velocities, we can't determine their current velocities either.

The calculus of probabilities only enables us to predict the mean phenomena which will result from the combination of these velocities. This is the second degree of ignorance.

The calculus of probabilities only allows us to predict the average outcomes that will result from combining these speeds. This is the second level of ignorance.

Finally it is possible that not only the initial conditions but the laws themselves are unknown. We then reach the third degree of ignorance and in general we can no longer affirm anything at all as to the probability of a phenomenon.

Finally, it's possible that not only the initial conditions but also the laws themselves are unknown. We then reach the third level of ignorance, and generally, we can no longer assert anything at all about the probability of a phenomenon.

It often happens that instead of trying to guess an event, by means of a more or less imperfect knowledge of the law, the events may be known and we want to find the law; or that instead of deducing effects from causes, we wish to deduce the causes from the effects. These are the problems called probability of causes, the most interesting from the point of view of their scientific applications.

It often occurs that instead of trying to predict an event using a somewhat flawed understanding of the law, we actually know the events and want to discover the law; or that instead of inferring effects from causes, we want to infer the causes from the effects. These dilemmas are referred to as probability of causes, which are the most fascinating regarding their scientific applications.

I play écarté with a gentleman I know to be perfectly honest. He is about to deal. What is the probability of his turning up the king? It is 1/8. This is a problem of the probability of effects.

I play écarté with a guy I know is completely honest. He's about to deal. What are the chances he’ll draw the king? It’s 1/8. This is a problem involving the probability of outcomes.

I play with a gentleman whom I do not know. He has dealt ten times, and he has turned up the king six times. What is the probability that he is a sharper? This is a problem in the probability of causes.

I’m playing with a guy I don’t know. He’s dealt ten times and has revealed the king six times. What’s the chance that he’s cheating? This is a question about the probability of causes.

It may be said that this is the essential problem of the experimental method. I have observed n values of x and the corresponding values of y. I have found that the ratio of the latter to the former is practically constant. There is the event, what is the cause?

It can be said that this is the core issue of the experimental method. I have recorded n values of x and their corresponding y values. I have discovered that the ratio of the latter to the former is almost constant. There is the situation, but what is the cause?

Is it probable that there is a general law according to which y would be proportional to x, and that the small divergencies are due to errors of observation? This is a type of question that one is ever asking, and which we unconsciously solve whenever we are engaged in scientific work.

Is it likely that there’s a general law stating that y would be proportional to x, and that the small differences are due to observational errors? This is the kind of question we constantly ask, and we unconsciously address it whenever we’re involved in scientific work.

I am now going to pass in review these different categories of[Pg 161] problems, discussing in succession what I have called above subjective and objective probability.

I will now review these different categories of[Pg 161] problems, discussing in order what I've previously referred to as subjective and objective probability.

II. Probability in Mathematics.—The impossibility of squaring the circle has been proved since 1882; but even before that date all geometers considered that impossibility as so 'probable,' that the Academy of Sciences rejected without examination the alas! too numerous memoirs on this subject, that some unhappy madmen sent in every year.

II. Probability in Math.—The impossibility of squaring the circle has been proven since 1882; however, even before that, all geometers regarded that impossibility as so 'likely' that the Academy of Sciences rejected without review the unfortunately too numerous papers on this topic that some misguided individuals submitted every year.

Was the Academy wrong? Evidently not, and it knew well that in acting thus it did not run the least risk of stifling a discovery of moment. The Academy could not have proved that it was right; but it knew quite well that its instinct was not mistaken. If you had asked the Academicians, they would have answered: "We have compared the probability that an unknown savant should have found out what has been vainly sought for so long, with the probability that there is one madman the more on the earth; the second appears to us the greater." These are very good reasons, but there is nothing mathematical about them; they are purely psychological.

Was the Academy wrong? Clearly not, and it understood that by acting this way, it wasn’t risking the suppression of an important discovery. The Academy couldn’t prove it was right, but it was confident that its instincts were correct. If you had asked the Academicians, they would have said, "We’ve weighed the likelihood that an unknown expert has discovered what has been unsuccessfully pursued for so long against the likelihood that there’s one more madman on earth; the latter seems more probable to us." These are solid reasons, but they aren't mathematical; they're purely psychological.

And if you had pressed them further they would have added: "Why do you suppose a particular value of a transcendental function to be an algebraic number; and if π were a root of an algebraic equation, why do you suppose this root to be a period of the function sin 2x, and not the same about the other roots of this same equation?" To sum up, they would have invoked the principle of sufficient reason in its vaguest form.

And if you had pushed them a bit more, they would have said: "Why do you think a specific value of a transcendental function is an algebraic number? And if π were a solution to an algebraic equation, why would you think this solution is a period of the function sin 2x, and not the same for the other solutions of this equation?" In short, they would have referenced the principle of sufficient reason in its most general sense.

But what could they deduce from it? At most a rule of conduct for the employment of their time, more usefully spent at their ordinary work than in reading a lucubration that inspired in them a legitimate distrust. But what I call above objective probability has nothing in common with this first problem.

But what could they figure out from it? At most, it would be a guideline for how to spend their time, which is better spent on their usual work than on reading a piece that made them justifiably suspicious. However, what I referred to earlier as objective probability is completely different from this initial issue.

It is otherwise with the second problem.

It’s different with the second problem.

Consider the first 10,000 logarithms that we find in a table. Among these 10,000 logarithms I take one at random. What is the probability that its third decimal is an even number? You will not hesitate to answer 1/2; and in fact if you pick out in a table the third decimals of these 10,000 numbers, you will find nearly as many even digits as odd.[Pg 162]

Consider the first 10,000 logarithms that we see in a table. Among these 10,000 logarithms, I take one at random. What’s the probability that its third decimal is an even number? You would probably say 1/2; and in fact, if you look at the third decimals of these 10,000 numbers, you'll find almost as many even digits as odd.[Pg 162]

Or if you prefer, let us write 10,000 numbers corresponding to our 10,000 logarithms, each of these numbers being +1 if the third decimal of the corresponding logarithm is even, and −1 if odd. Then take the mean of these 10,000 numbers.

Or if you prefer, let's write 10,000 numbers that match our 10,000 logarithms, where each number is +1 if the third decimal of the corresponding logarithm is even, and -1 if it's odd. Then calculate the average of these 10,000 numbers.

I do not hesitate to say that the mean of these 10,000 numbers is probably 0, and if I were actually to calculate it I should verify that it is extremely small.

I can confidently say that the average of these 10,000 numbers is probably 0, and if I were to actually calculate it, I would find that it is very small.

But even this verification is needless. I might have rigorously proved that this mean is less than 0.003. To prove this result, I should have had to make a rather long calculation for which there is no room here, and for which I confine myself to citing an article I published in the Revue générale des Sciences, April 15, 1899. The only point to which I wish to call attention is the following: in this calculation, I should have needed only to rest my case on two facts, to wit, that the first and second derivatives of the logarithm remain, in the interval considered, between certain limits.

But even this verification is unnecessary. I could have rigorously shown that this average is less than 0.003. To prove this, I would have had to do a fairly long calculation, which there's no space for here, so I will just refer to an article I published in the Revue générale des Sciences on April 15, 1899. The only point I want to highlight is this: in this calculation, I would only need to rely on two facts, namely, that the first and second derivatives of the logarithm stay within certain limits in the interval considered.

Hence this important consequence that the property is true not only of the logarithm, but of any continuous function whatever, since the derivatives of every continuous function are limited.

Hence, this important outcome indicates that this property applies not just to logarithms, but to any continuous function, since the derivatives of all continuous functions are bounded.

If I was certain beforehand of the result, it is first, because I had often observed analogous facts for other continuous functions; and next, because I made in my mind, in a more or less unconscious and imperfect manner, the reasoning which led me to the preceding inequalities, just as a skilled calculator before finishing his multiplication takes into account what it should come to approximately.

If I was sure about the outcome beforehand, it's primarily because I've seen similar cases with other continuous functions; and secondly, because I unconsciously and somewhat imperfectly reasoned my way to the previous inequalities, much like a skilled calculator who has an idea of what the result of their multiplication should roughly be before completing it.

And besides, since what I call my intuition was only an incomplete summary of a piece of true reasoning, it is clear why observation has confirmed my predictions, and why the objective probability has been in agreement with the subjective probability.

And besides, since what I refer to as my intuition was just an incomplete summary of a valid reasoning process, it’s clear why observation has backed up my predictions, and why the objective probability has matched the subjective probability.

As a third example I shall choose the following problem: A number u is taken at random, and n is a given very large integer. What is the probable value of sin nu? This problem has no meaning by itself. To give it one a convention is needed. We shall agree that the probability for the number u to lie between a and a+ is equal to ϕ(a)da; that it is therefore proportional to the infinitely small interval da, and equal to this multiplied by a function ϕ(a) depending only on a. As for this function, I[Pg 163] choose it arbitrarily, but I must assume it to be continuous. The value of sin nu remaining the same when u increases by 2π, I may without loss of generality assume that u lies between 0 and 2π, and I shall thus be led to suppose that ϕ(a) is a periodic function whose period is 2π.

As a third example, let's consider the following problem: A random number u is selected, and n is a very large integer. What’s the likely value of sin nu? This question doesn't have any significance on its own. To give it meaning, we need to establish a convention. We will agree that the probability of the number u being between a and a+ is equal to ϕ(a)da; which means it's proportional to the infinitesimal interval da, and equals that multiplied by the function ϕ(a) that only depends on a. Regarding this function, I[Pg 163] will choose it arbitrarily, but I need to assume it’s continuous. Since the value of sin nu remains the same when u increases by 2π, I can assume without losing any generality that u lies between 0 and 2π, leading me to suppose that ϕ(a) is a periodic function with a period of 2π.

The probable value sought is readily expressed by a simple integral, and it is easy to show that this integral is less than

The probable value being sought can be easily represented by a simple integral, and it's straightforward to demonstrate that this integral is less than

2πMknk,

2πMknk,

Mk being the maximum value of the kth derivative of ϕ(u). We see then that if the kth derivative is finite, our probable value will tend toward 0 when n increases indefinitely, and that more rapidly than 1/nk−1.

Mk is the highest value of the kth derivative of ϕ(u). We can therefore see that if the kth derivative is finite, our expected value will approach 0 as n increases indefinitely, and this will happen more quickly than 1/nk−1.

The probable value of sin nu when n is very large is therefore naught. To define this value I required a convention; but the result remains the same whatever that convention may be. I have imposed upon myself only slight restrictions in assuming that the function ϕ(a) is continuous and periodic, and these hypotheses are so natural that we may ask ourselves how they can be escaped.

The likely value of sin nu when n is really large is basically zero. I needed a convention to define this value; however, the outcome stays the same no matter what that convention is. I’ve only placed a few minor limits on myself by assuming that the function ϕ(a) is continuous and periodic, and these assumptions are so obvious that we can wonder how we could possibly avoid them.

Examination of the three preceding examples, so different in all respects, has already given us a glimpse, on the one hand, of the rôle of what philosophers call the principle of sufficient reason, and, on the other hand, of the importance of the fact that certain properties are common to all continuous functions. The study of probability in the physical sciences will lead us to the same result.

Looking at the three previous examples, which are very different in every way, has already shown us, on one hand, the role of what philosophers refer to as the principle of sufficient reason, and, on the other hand, the significance of the fact that certain properties are shared by all continuous functions. Studying probability in the physical sciences will lead us to the same conclusion.

III. Probability in the Physical Sciences.—We come now to the problems connected with what I have called the second degree of ignorance, those, namely, in which we know the law, but do not know the initial state of the system. I could multiply examples, but will take only one. What is the probable present distribution of the minor planets on the zodiac?

III. Probability in the Physical Sciences.—Now we address the issues related to what I've termed the second degree of ignorance, specifically those situations where we understand the law, but don't know the system's initial state. I could provide many examples, but I'll mention just one. What is the likely current distribution of the minor planets in the zodiac?

We know they obey the laws of Kepler. We may even, without at all changing the nature of the problem, suppose that their orbits are all circular, and situated in the same plane, and that we know this plane. On the other hand, we are in absolute ignorance as to what was their initial distribution. However, we do not[Pg 164] hesitate to affirm that their distribution is now nearly uniform. Why?

We know they follow Kepler's laws. We can even assume, without changing the problem, that their orbits are all circular, lie in the same plane, and that we know this plane. However, we have no idea what their initial distribution was. Still, we confidently state that their current distribution is nearly uniform. Why?

Let b be the longitude of a minor planet in the initial epoch, that is to say, the epoch zero. Let a be its mean motion. Its longitude at the present epoch, that is to say at the epoch t, will be at + b. To say that the present distribution is uniform is to say that the mean value of the sines and cosines of multiples of at + b is zero. Why do we assert this?

Let b be the longitude of a minor planet at the starting point, which we call epoch zero. Let a be its average movement. Its longitude now, at epoch t, will be at + b. When we say the current distribution is uniform, we mean that the average value of the sines and cosines of multiples of at + b is zero. Why do we make this assertion?

Let us represent each minor planet by a point in a plane, to wit, by a point whose coordinates are precisely a and b. All these representative points will be contained in a certain region of the plane, but as they are very numerous this region will appear dotted with points. We know nothing else about the distribution of these points.

Let’s represent each minor planet as a point on a plane, specifically a point with coordinates a and b. All these representative points will be found within a certain area of the plane, but since they are very numerous, this area will look dotted with points. We don’t know anything else about how these points are distributed.

What do we do when we wish to apply the calculus of probabilities to such a question? What is the probability that one or more representative points may be found in a certain portion of the plane? In our ignorance, we are reduced to making an arbitrary hypothesis. To explain the nature of this hypothesis, allow me to use, in lieu of a mathematical formula, a crude but concrete image. Let us suppose that over the surface of our plane has been spread an imaginary substance, whose density is variable, but varies continuously. We shall then agree to say that the probable number of representative points to be found on a portion of the plane is proportional to the quantity of fictitious matter found there. If we have then two regions of the plane of the same extent, the probabilities that a representative point of one of our minor planets is found in one or the other of these regions will be to one another as the mean densities of the fictitious matter in the one and the other region.

What do we do when we want to use probability calculus to tackle this question? What is the likelihood that one or more representative points can be found in a specific area of the plane? Since we're unsure, we have to make an arbitrary assumption. To clarify this assumption, let me use a simple but clear analogy instead of a mathematical formula. Imagine that an imaginary substance with a variable but smoothly changing density has been spread over the surface of our plane. We'll agree that the expected number of representative points in a section of the plane is proportional to the amount of this imaginary substance present there. If we have two equal areas of the plane, the chances of finding a representative point of one of our minor planets in either of those areas will relate to the average densities of the imaginary substance in each area.

Here then are two distributions, one real, in which the representative points are very numerous, very close together, but discrete like the molecules of matter in the atomic hypothesis; the other remote from reality, in which our representative points are replaced by continuous fictitious matter. We know that the latter can not be real, but our ignorance forces us to adopt it.

Here are two distributions: one is real, with many representative points that are very close together but separate, like molecules in the atomic theory; the other is far from reality, where our representative points are replaced by continuous imaginary matter. We know the latter can't be real, but our lack of understanding makes us accept it.

If again we had some idea of the real distribution of the representative points, we could arrange it so that in a region[Pg 165] of some extent the density of this imaginary continuous matter would be nearly proportional to the number of the representative points, or, if you wish, to the number of atoms which are contained in that region. Even that is impossible, and our ignorance is so great that we are forced to choose arbitrarily the function which defines the density of our imaginary matter. Only we shall be forced to a hypothesis from which we can hardly get away, we shall suppose that this function is continuous. That is sufficient, as we shall see, to enable us to reach a conclusion.

If we had a better understanding of how the representative points are actually distributed, we could arrange it so that in a certain area[Pg 165] the density of this imaginary continuous matter would be nearly proportional to the number of representative points, or, if you prefer, to the number of atoms in that area. However, that's still not possible, and our lack of knowledge is so significant that we have to choose the function that defines the density of our imaginary matter arbitrarily. We will have to assume a hypothesis that we can hardly avoid, which is that this function is continuous. This assumption is enough, as we will see, to help us reach a conclusion.

What is at the instant t the probable distribution of the minor planets? Or rather what is the probable value of the sine of the longitude at the instant t, that is to say of sin (at + b)? We made at the outset an arbitrary convention, but if we adopt it, this probable value is entirely defined. Divide the plane into elements of surface. Consider the value of sin (at + b) at the center of each of these elements; multiply this value by the surface of the element, and by the corresponding density of the imaginary matter. Take then the sum for all the elements of the plane. This sum, by definition, will be the probable mean value we seek, which will thus be expressed by a double integral. It may be thought at first that this mean value depends on the choice of the function which defines the density of the imaginary matter, and that, as this function ϕ is arbitrary, we can, according to the arbitrary choice which we make, obtain any mean value. This is not so.

What is the likely distribution of the minor planets at that moment t? More specifically, what is the probable value of the sine of the longitude at that moment t, which means sin (at + b)? We established an arbitrary convention at the beginning, but if we stick to it, this probable value is clearly defined. Divide the plane into surface elements. Look at the value of sin (at + b) at the center of each of these elements; multiply this value by the area of the element and by the corresponding density of the imaginary matter. Then sum this for all the elements of the plane. This total, by definition, will be the average value we’re looking for, expressed as a double integral. At first glance, one might think that this average value depends on the selection of the function that defines the density of the imaginary matter, and because this function ϕ is arbitrary, depending on our choice, we can obtain any average value. This is not the case.

A simple calculation shows that our double integral decreases very rapidly when t increases. Thus I could not quite tell what hypothesis to make as to the probability of this or that initial distribution; but whatever the hypothesis made, the result will be the same, and this gets me out of my difficulty.

A simple calculation shows that our double integral decreases very quickly as t increases. So, I couldn't really decide what hypothesis to make about the probability of this or that initial distribution; but no matter what hypothesis is chosen, the result will be the same, and this solves my problem.

Whatever be the function ϕ, the mean value tends toward zero as t increases, and as the minor planets have certainly accomplished a very great number of revolutions, I may assert that this mean value is very small.

Whatever the function ϕ is, the average value approaches zero as t increases, and since the minor planets have definitely completed a vast number of revolutions, I can say that this average value is very small.

I may choose ϕ as I wish, save always one restriction: this function must be continuous; and, in fact, from the point of view of subjective probability, the choice of a discontinuous function would have been unreasonable. For instance, what reason could[Pg 166] I have for supposing that the initial longitude might be exactly 0°, but that it could not lie between 0° and 1°?

I can choose ϕ however I like, but there's one condition: this function has to be continuous. From the perspective of subjective probability, picking a discontinuous function just wouldn't make sense. For example, why would I think that the initial longitude could be exactly 0°, but not anywhere between 0° and 1°?

But the difficulty reappears if we take the point of view of objective probability, if we pass from our imaginary distribution in which the fictitious matter was supposed continuous to the real distribution in which our representative points form, as it were, discrete atoms.

But the difficulty comes back if we consider the perspective of objective probability, moving from our imaginary distribution, where the assumed matter was continuous, to the real distribution where our representative points essentially form discrete atoms.

The mean value of sin (at + b) will be represented quite simply by

The average value of sin (at + b) can be shown quite easily by

(1/n) Σ sin (at + b),

(1/n) Σ sin (at + b),

n being the number of minor planets. In lieu of a double integral referring to a continuous function, we shall have a sum of discrete terms. And yet no one will seriously doubt that this mean value is practically very small.

n is the number of minor planets. Instead of a double integral relating to a continuous function, we'll have a sum of discrete terms. Still, no one will really question that this mean value is actually very small.

Our representative points being very close together, our discrete sum will in general differ very little from an integral.

Our representative points are very close together, so our discrete sum will generally be quite similar to an integral.

An integral is the limit toward which a sum of terms tends when the number of these terms is indefinitely increased. If the terms are very numerous, the sum will differ very little from its limit, that is to say from the integral, and what I said of this latter will still be true of the sum itself.

An integral is the limit that a sum of terms approaches as the number of terms increases indefinitely. If there are a lot of terms, the sum will be very close to its limit, which is the integral, and everything I said about the integral will also apply to the sum itself.

Nevertheless, there are exceptions. If, for instance, for all the minor planets,

Nevertheless, there are exceptions. If, for example, for all the minor planets,

b = π/2 − at,

b = π/2 − at,

the longitude for all the planets at the time t would be π/2, and the mean value would evidently be equal to unity. For this to be the case, it would be necessary that at the epoch 0, the minor planets must have all been lying on a spiral of peculiar form, with its spires very close together. Every one will admit that such an initial distribution is extremely improbable (and, even supposing it realized, the distribution would not be uniform at the present time, for example, on January 1, 1913, but it would become so a few years later).

the longitude for all the planets at time t would be π/2, and the average value would clearly be equal to one. For this to happen, it would be necessary that at the starting point, the minor planets had all been arranged on a uniquely shaped spiral, with its arms very close together. Everyone would agree that such an initial arrangement is highly unlikely (and, even if it did happen, the distribution would not be uniform at the present time, for instance, on January 1, 1913, but it would become so a few years later).

Why then do we think this initial distribution improbable? This must be explained, because if we had no reason for rejecting[Pg 167] as improbable this absurd hypothesis everything would break down, and we could no longer make any affirmation about the probability of this or that present distribution.

Why do we consider this initial distribution unlikely? This needs to be clarified, because if we have no reason to view[Pg 167] it as improbable, this ridiculous hypothesis would unravel everything, and we wouldn't be able to make any statements about the likelihood of any current distribution.

Once more we shall invoke the principle of sufficient reason to which we must always recur. We might admit that at the beginning the planets were distributed almost in a straight line. We might admit that they were irregularly distributed. But it seems to us that there is no sufficient reason for the unknown cause that gave them birth to have acted along a curve so regular and yet so complicated, which would appear to have been expressly chosen so that the present distribution would not be uniform.

Once again, we will refer to the principle of sufficient reason, which we must always revisit. We might acknowledge that at the start, the planets were arranged almost in a straight line. We might also accept that they were distributed irregularly. However, it seems to us that there isn’t a sufficient reason for the unknown cause that created them to have acted in such a regular yet complicated curve, which seems to have been deliberately chosen so that the current distribution wouldn’t be uniform.

IV. Rouge et Noir.—The questions raised by games of chance, such as roulette, are, fundamentally, entirely analogous to those we have just treated. For example, a wheel is partitioned into a great number of equal subdivisions, alternately red and black. A needle is whirled with force, and after having made a great number of revolutions, it stops before one of these subdivisions. The probability that this division is red is evidently 1/2. The needle describes an angle θ, including several complete revolutions. I do not know what is the probability that the needle may be whirled with a force such that this angle should lie between θ and θ + dθ; but I can make a convention. I can suppose that this probability is ϕ(θ)dθ. As for the function ϕ(θ), I can choose it in an entirely arbitrary manner. There is nothing that can guide me in my choice, but I am naturally led to suppose this function continuous.

IV. Red and Black.—The issues raised by games of chance, like roulette, are basically the same as the ones we've just discussed. For instance, a wheel is divided into many equal sections, alternating between red and black. A needle is spun with force, and after a lot of revolutions, it stops at one of these sections. The chance of this section being red is clearly 1/2. The needle creates an angle θ, making several complete revolutions. I don’t know what the probability is that the needle could be spun with enough force for this angle to be between θ and θ + dθ; however, I can establish a convention. I’ll assume that this probability is ϕ(θ)dθ. As for the function ϕ(θ), I can define it in any way I choose. There’s nothing that can direct my choice, but I’m naturally inclined to think of this function as continuous.

Let ε be the length (measured on the circumference of radius 1) of each red and black subdivision. We have to calculate the integral of ϕ(θ)dθ, extending it, on the one hand, to all the red divisions and, on the other hand, to all the black divisions, and to compare the results.

Let ε be the length (measured along the circumference of radius 1) of each red and black section. We need to calculate the integral of ϕ(θ)dθ, extending it to all the red sections on one side and to all the black sections on the other, and then compare the results.

Consider an interval 2ε, comprising a red division and a black division which follows it. Let M and m be the greatest and least values of the function ϕ(θ) in this interval. The integral extended to the red divisions will be smaller than ΣMε; the integral extended to the black divisions will be greater than Σmε; the difference will therefore be less than Σ(M − m)ε. But, if the function θ is supposed continuous; if, besides, the interval ε is very[Pg 168] small with respect to the total angle described by the needle, the difference M − m will be very small. The difference of the two integrals will therefore be very small, and the probability will be very nearly 1/2.

Consider an interval of 2ε, which consists of a red section followed by a black section. Let M and m be the highest and lowest values of the function ϕ(θ) in this interval. The integral over the red sections will be less than ΣMε; the integral over the black sections will be greater than Σmε; thus, the difference will be less than Σ(M − m)ε. However, if we assume that the function θ is continuous, and if the interval ε is very[Pg 168] small compared to the overall angle covered by the needle, the difference M − m will also be very small. Consequently, the difference between the two integrals will be very small, and the probability will be very close to 1/2.

We see that without knowing anything of the function θ, I must act as if the probability were 1/2. We understand, on the other hand, why, if, placing myself at the objective point of view, I observe a certain number of coups, observation will give me about as many black coups as red.

We see that without knowing anything about the function θ, I have to act as if the probability is 1/2. On the other hand, we understand why, if I take an objective viewpoint and observe a certain number of flips, I will end up seeing about as many black flips as red.

All players know this objective law; but it leads them into a remarkable error, which has been often exposed, but into which they always fall again. When the red has won, for instance, six times running, they bet on the black, thinking they are playing a safe game; because, say they, it is very rare that red wins seven times running.

All players are aware of this fundamental principle; however, it leads them into a significant mistake, which has been pointed out many times, yet they keep falling into it. For example, when red has won six times in a row, they start betting on black, believing they’re making a safe choice; because, as they say, it’s very unlikely that red will win seven times in a row.

In reality their probability of winning remains 1/2. Observation shows, it is true, that series of seven consecutive reds are very rare, but series of six reds followed by a black are just as rare.

In reality, their chances of winning stay at 1/2. It's true that observing series of seven consecutive reds is quite rare, but series of six reds followed by a black are just as uncommon.

They have noticed the rarity of the series of seven reds; if they have not remarked the rarity of six reds and a black, it is only because such series strike the attention less.

They have noticed how rare it is to find seven reds in a row; if they haven't observed how rare it is to have six reds and one black, it's only because that combination catches their attention less.

V. The Probability of Causes.—We now come to the problems of the probability of causes, the most important from the point of view of scientific applications. Two stars, for instance, are very close together on the celestial sphere. Is this apparent contiguity a mere effect of chance? Are these stars, although on almost the same visual ray, situated at very different distances from the earth, and consequently very far from one another? Or, perhaps, does the apparent correspond to a real contiguity? This is a problem on the probability of causes.

V. The Likelihood of Causes.—Now we’ll discuss the problems of the probability of causes, which are crucial for scientific applications. For example, two stars may appear very close together in the night sky. Is this seeming proximity just a coincidence? Are these stars, although they appear on nearly the same visual line, actually at very different distances from the Earth, and therefore very far apart? Or maybe their apparent closeness reflects a true proximity? This is a question about the probability of causes.

I recall first that at the outset of all problems of the probability of effects that have hitherto occupied us, we have always had to make a convention, more or less justified. And if in most cases the result was, in a certain measure, independent of this convention, this was only because of certain hypotheses which permitted us to reject a priori discontinuous functions, for example, or certain absurd conventions.

I remember that at the beginning of all the probability issues we've been dealing with, we always had to establish a convention, somewhat justified. And even though in most cases the outcome was somewhat independent of this convention, it was only because of certain assumptions that allowed us to dismiss a priori discontinuous functions, for instance, or some unreasonable conventions.

We shall find something analogous when we deal with the[Pg 169] probability of causes. An effect may be produced by the cause A or by the cause B. The effect has just been observed. We ask the probability that it is due to the cause A. This is an a posteriori probability of cause. But I could not calculate it, if a convention more or less justified did not tell me in advance what is the a priori probability for the cause A to come into play; I mean the probability of this event for some one who had not observed the effect.

We will find something similar when we look at the[Pg 169] probability of causes. An effect can be caused by either cause A or cause B. The effect has just been observed. We want to know the probability that it was caused by A. This is a post hoc probability of the cause. However, I wouldn't be able to calculate it without a more or less justified convention that provides me in advance with the prior probability of cause A being involved; I mean the probability of this event for someone who hadn’t observed the effect.

The better to explain myself I go back to the example of the game of écarté mentioned above. My adversary deals for the first time and he turns up a king. What is the probability that he is a sharper? The formulas ordinarily taught give 8/9, a result evidently rather surprising. If we look at it closer, we see that the calculation is made as if, before sitting down at the table, I had considered that there was one chance in two that my adversary was not honest. An absurd hypothesis, because in that case I should have certainly not played with him, and this explains the absurdity of the conclusion.

To explain myself better, let’s go back to the example of the game of écarté I mentioned earlier. My opponent deals for the first time and reveals a king. What are the odds that he’s cheating? The formulas typically taught give a probability of 8/9, which is quite surprising. If we examine it more closely, we see that the calculation assumes that, before sitting down at the table, I believed there was a 50/50 chance that my opponent wasn’t honest. That’s a ridiculous assumption, because in that case, I definitely wouldn’t have played with him, which shows why the conclusion is so absurd.

The convention about the a priori probability was unjustified, and that is why the calculation of the a posteriori probability led me to an inadmissible result. We see the importance of this preliminary convention. I shall even add that if none were made, the problem of the a posteriori probability would have no meaning. It must always be made either explicitly or tacitly.

The idea of a priori probability was not justified, which is why calculating the a posteriori probability led me to an unacceptable result. This shows how important this initial convention is. I would even say that without it, the problem of a posteriori probability would be meaningless. It must always be established, either explicitly or implicitly.

Pass to an example of a more scientific character. I wish to determine an experimental law. This law, when I know it, can be represented by a curve. I make a certain number of isolated observations; each of these will be represented by a point. When I have obtained these different points, I draw a curve between them, striving to pass as near to them as possible and yet preserve for my curve a regular form, without angular points, or inflections too accentuated, or brusque variation of the radius of curvature. This curve will represent for me the probable law, and I assume not only that it will tell me the values of the function intermediate between those which have been observed, but also that it will give me the observed values themselves more exactly than direct observation. This is why I make it pass near the points, and not through the points themselves.[Pg 170]

Let's look at a more scientific example. I want to establish an experimental law. Once I understand this law, I can represent it with a curve. I make several isolated observations; each of these will be shown as a point. After gathering these points, I draw a curve between them, trying to stay as close as possible yet keeping the curve smooth, avoiding sharp angles, excessive bends, or sudden changes in curvature. This curve will represent the likely law, and I expect it will not only indicate the intermediate values of the function between the observed ones but also provide those observed values more accurately than direct measurement. That's why I aim for the curve to be close to the points rather than passing through them.[Pg 170]

Here is a problem in the probability of causes. The effects are the measurements I have recorded; they depend on a combination of two causes: the true law of the phenomenon and the errors of observation. Knowing the effects, we have to seek the probability that the phenomenon obeys this law or that, and that the observations have been affected by this or that error. The most probable law then corresponds to the curve traced, and the most probable error of an observation is represented by the distance of the corresponding point from this curve.

Here is a problem in the probability of causes. The effects are the measurements I’ve recorded; they rely on a combination of two causes: the actual law of the phenomenon and the errors in observation. Given the effects, we need to determine the likelihood that the phenomenon follows this law or that one, and that the observations have been influenced by this or that error. The most likely law corresponds to the curve drawn, and the most likely error in an observation is indicated by the distance of the related point from this curve.

But the problem would have no meaning if, before any observation, I had not fashioned an a priori idea of the probability of this or that law, and of the chances of error to which I am exposed.

But the problem wouldn't make sense if, before any observation, I hadn't created an a priori idea of the likelihood of this or that law and the chances of error I might face.

If my instruments are good (and that I knew before making the observations), I shall not permit my curve to depart much from the points which represent the rough measurements. If they are bad, I may go a little further away from them in order to obtain a less sinuous curve; I shall sacrifice more to regularity.

If my instruments are reliable (which I knew before taking the measurements), I won’t let my curve stray far from the points that reflect the rough measurements. If they aren't reliable, I might move a bit further away to create a smoother curve; I'll prioritize consistency over precision.

Why then is it that I seek to trace a curve without sinuosities? It is because I consider a priori a law represented by a continuous function (or by a function whose derivatives of high order are small), as more probable than a law not satisfying these conditions. Without this belief, the problem of which we speak would have no meaning; interpolation would be impossible; no law could be deduced from a finite number of observations; science would not exist.

Why is it that I try to outline a curve without bends? It’s because I assume that a law described by a continuous function (or a function with small higher-order derivatives) is more likely than a law that doesn't meet these criteria. Without this assumption, the problem we're discussing wouldn't make sense; interpolation would be impossible; no law could be derived from a limited number of observations; science wouldn’t exist.

Fifty years ago physicists considered, other things being equal, a simple law as more probable than a complicated law. They even invoked this principle in favor of Mariotte's law as against the experiments of Regnault. To-day they have repudiated this belief; and yet, how many times are they compelled to act as though they still held it! However that may be, what remains of this tendency is the belief in continuity, and we have just seen that if this belief were to disappear in its turn, experimental science would become impossible.

Fifty years ago, physicists thought that, all else being equal, a simple law was more likely than a complicated one. They even used this idea to support Mariotte's law over Regnault's experiments. Today, they have rejected this belief; yet, how often do they still behave as if they believe it? Regardless, what persists from this tendency is the belief in continuity, and we've just seen that if this belief were to vanish as well, experimental science would become impossible.

VI. The Theory of Errors.—We are thus led to speak of the theory of errors, which is directly connected with the problem of the probability of causes. Here again we find effects, to wit, a certain number of discordant observations, and we seek to[Pg 171] divine the causes, which are, on the one hand, the real value of the quantity to be measured; on the other hand, the error made in each isolated observation. It is necessary to calculate what is a posteriori the probable magnitude of each error, and consequently the probable value of the quantity to be measured.

VI. The Error Theory.—This leads us to discuss the theory of errors, which is closely related to the issue of the probability of causes. Here, we find effects, specifically a number of conflicting observations, and we aim to [Pg 171] determine the causes. These causes include, on one hand, the actual value of the quantity being measured, and on the other hand, the error made in each individual observation. It’s important to calculate, retrospectively, the likely size of each error, and consequently, the probable value of the quantity being measured.

But as I have just explained, we should not know how to undertake this calculation if we did not admit a priori, that is to say, before all observation, a law of probability of errors. Is there a law of errors?

But as I just explained, we wouldn't know how to do this calculation if we didn't accept a priori, that is, before any observation, a law of probability of errors. Is there a law of errors?

The law of errors admitted by all calculators is Gauss's law, which is represented by a certain transcendental curve known under the name of 'the bell.'

The law of errors recognized by all calculators is Gauss's law, which is represented by a specific transcendental curve called 'the bell.'

But first it is proper to recall the classic distinction between systematic and accidental errors. If we measure a length with too long a meter, we shall always find too small a number, and it will be of no use to measure several times; this is a systematic error. If we measure with an accurate meter, we may, however, make a mistake; but we go wrong, now too much, now too little, and when we take the mean of a great number of measurements, the error will tend to grow small. These are accidental errors.

But first, it's important to remember the classic distinction between systematic and random errors. If we measure a length with a meter that is too long, we will always get a number that is too small, and measuring multiple times won’t change that; this is a systematic error. However, if we measure with an accurate meter, we might still make mistakes, but those mistakes will vary—sometimes too high, sometimes too low. When we average a large number of measurements, the error tends to get smaller. These are random errors.

It is evident from the first that systematic errors can not satisfy Gauss's law; but do the accidental errors satisfy it? A great number of demonstrations have been attempted; almost all are crude paralogisms. Nevertheless, we may demonstrate Gauss's law by starting from the following hypotheses: the error committed is the result of a great number of partial and independent errors; each of the partial errors is very little and besides, obeys any law of probability, provided that the probability of a positive error is the same as that of an equal negative error. It is evident that these conditions will be often but not always fulfilled, and we may reserve the name of accidental for errors which satisfy them.

It is clear from the outset that systematic errors cannot satisfy Gauss's law; but what about random errors? Many attempts have been made to demonstrate this; almost all of them are flawed arguments. However, we can prove Gauss's law by starting with the following assumptions: the error made is the result of many partial and independent errors; each of these partial errors is small and follows some probability law, provided that the chance of a positive error is equal to that of a corresponding negative error. It's clear that these conditions will often, but not always, be met, and we can call errors that meet them "random."

We see that the method of least squares is not legitimate in every case; in general the physicists are more distrustful of it than the astronomers. This is, no doubt, because the latter, besides the systematic errors to which they and the physicists are subject alike, have to control with an extremely important source of error which is wholly accidental; I mean atmospheric[Pg 172] undulations. So it is very curious to hear a physicist discuss with an astronomer about a method of observation. The physicist, persuaded that one good measurement is worth more than many bad ones, is before all concerned with eliminating by dint of precautions the least systematic errors, and the astronomer says to him: 'But thus you can observe only a small number of stars; the accidental errors will not disappear.'

We see that the least squares method isn't valid in every situation; in general, physicists are more skeptical of it than astronomers. This is likely because astronomers, in addition to the systematic errors both they and physicists face, have to deal with a significant source of error that is completely random: atmospheric[Pg 172] undulations. So, it's quite interesting to hear a physicist discuss observation methods with an astronomer. The physicist, convinced that one accurate measurement is more valuable than many inaccurate ones, mainly focuses on reducing systematic errors through various precautions, while the astronomer responds, "But that way, you can only observe a limited number of stars; the random errors won’t just go away."

What should we conclude? Must we continue to use the method of least squares? We must distinguish. We have eliminated all the systematic errors we could suspect; we know well there are still others, but we can not detect them; yet it is necessary to make up our mind and adopt a definitive value which will be regarded as the probable value; and for that it is evident the best thing to do is to apply Gauss's method. We have only applied a practical rule referring to subjective probability. There is nothing more to be said.

What should we conclude? Should we keep using the method of least squares? We need to make a distinction. We’ve removed all the systematic errors we could identify; we know there are still others, but we can't detect them. However, we need to decide and choose a final value that will be considered the likely value; for that, it's clear that the best approach is to use Gauss's method. We've only applied a practical rule related to subjective probability. There’s nothing more to say.

But we wish to go farther and affirm that not only is the probable value so much, but that the probable error in the result is so much. This is absolutely illegitimate; it would be true only if we were sure that all the systematic errors were eliminated, and of that we know absolutely nothing. We have two series of observations; by applying the rule of least squares, we find that the probable error in the first series is twice as small as in the second. The second series may, however, be better than the first, because the first perhaps is affected by a large systematic error. All we can say is that the first series is probably better than the second, since its accidental error is smaller, and we have no reason to affirm that the systematic error is greater for one of the series than for the other, our ignorance on this point being absolute.

But we want to go further and insist that not only is the probable value this much, but that the probable error in the result is also this much. This is completely illegitimate; it would only be true if we were certain that all the systematic errors have been eliminated, and we have no certainty about that at all. We have two sets of observations; by using the rule of least squares, we find that the probable error in the first set is half as much as in the second. However, the second set might actually be better than the first, because the first might be influenced by a significant systematic error. All we can say is that the first set is probably better than the second, since its random error is smaller, and we have no reason to claim that the systematic error is greater for one set than for the other, as our ignorance on this issue is complete.

VII. Conclusions.—In the lines which precede, I have set many problems without solving any of them. Yet I do not regret having written them, because they will perhaps invite the reader to reflect on these delicate questions.

VII. Conclusions.—In the previous lines, I've raised a lot of questions without providing any answers. However, I don't regret writing them, because they might encourage the reader to think about these complex issues.

However that may be, there are certain points which seem well established. To undertake any calculation of probability, and even for that calculation to have any meaning, it is necessary[Pg 173] to admit, as point of departure, a hypothesis or convention which has always something arbitrary about it. In the choice of this convention, we can be guided only by the principle of sufficient reason. Unfortunately this principle is very vague and very elastic, and in the cursory examination we have just made, we have seen it take many different forms. The form under which we have met it most often is the belief in continuity, a belief which it would be difficult to justify by apodeictic reasoning, but without which all science would be impossible. Finally the problems to which the calculus of probabilities may be applied with profit are those in which the result is independent of the hypothesis made at the outset, provided only that this hypothesis satisfies the condition of continuity.

However that may be, there are certain points that seem well established. To carry out any probability calculation, and for that calculation to have any meaning, it's necessary[Pg 173] to start with a hypothesis or convention that always has some arbitrary element. In choosing this convention, we can only be guided by the principle of sufficient reason. Unfortunately, this principle is quite vague and flexible, and in the quick overview we've just done, we've seen it take many different forms. The form we’ve encountered most often is the belief in continuity, a belief that's tough to justify with absolute reasoning but without which all science would be impossible. Finally, the problems where probability calculations can be effectively applied are those in which the result is independent of the initial hypothesis, as long as this hypothesis meets the condition of continuity.


CHAPTER XII

Optics and Electricity

Fresnel's Theory.—The best example[5] that can be chosen of physics in the making is the theory of light and its relations to the theory of electricity. Thanks to Fresnel, optics is the best developed part of physics; the so-called wave-theory forms a whole truly satisfying to the mind. We must not, however, ask of it what it can not give us.

Fresnel's Theory.—The best example[5] that can be chosen of physics in the making is the theory of light and its connections to the theory of electricity. Thanks to Fresnel, optics is the most developed area of physics; the wave theory provides a comprehensive understanding that is truly satisfying. However, we shouldn't expect it to offer more than it can deliver.

The object of mathematical theories is not to reveal to us the true nature of things; this would be an unreasonable pretension. Their sole aim is to coordinate the physical laws which experiment reveals to us, but which, without the help of mathematics, we should not be able even to state.

The goal of mathematical theories isn't to show us the true nature of things; that would be an unrealistic expectation. Their only purpose is to organize the physical laws that experiments show us, but without the assistance of mathematics, we wouldn't even be able to articulate them.

It matters little whether the ether really exists; that is the affair of metaphysicians. The essential thing for us is that everything happens as if it existed, and that this hypothesis is convenient for the explanation of phenomena. After all, have we any other reason to believe in the existence of material objects? That, too, is only a convenient hypothesis; only this will never cease to be so, whereas, no doubt, some day the ether will be thrown aside as useless. But even at that day, the laws of optics and the equations which translate them analytically will remain true, at least as a first approximation. It will always be useful, then, to study a doctrine that unites all these equations.

It doesn’t really matter if ether actually exists; that’s up to philosophers. What’s crucial for us is that everything acts as if it does, and this idea is useful for explaining phenomena. After all, do we have any solid reason to believe in the existence of physical objects? That’s just another convenient idea, and while it might always be that way, ether will likely be dismissed as irrelevant one day. Even then, the laws of optics and the equations that represent them will still hold true, at least as a rough approximation. So, it will always be beneficial to study a theory that brings together all these equations.

The undulatory theory rests on a molecular hypothesis. For those who think they have thus discovered the cause under the law, this is an advantage. For the others it is a reason for distrust. But this distrust seems to me as little justified as the illusion of the former.

The wave theory is based on a molecular hypothesis. For those who believe they’ve found the cause behind the law, this is a benefit. For others, it raises suspicion. However, I find this suspicion just as unfounded as the illusion of the former group.

These hypotheses play only a secondary part. They might be sacrificed. They usually are not, because then the explanation would lose in clearness; but that is the only reason.[Pg 175]

These hypotheses are only a secondary factor. They could be discarded. They typically aren’t, because doing so would make the explanation less clear; but that’s the only reason.[Pg 175]

In fact, if we looked closer we should see that only two things are borrowed from the molecular hypotheses: the principle of the conservation of energy and the linear form of the equations, which is the general law of small movements, as of all small variations.

In fact, if we looked closer, we would see that only two things are borrowed from the molecular hypotheses: the principle of energy conservation and the linear form of the equations, which is the general law of small movements, as well as all small variations.

This explains why most of Fresnel's conclusions remain unchanged when we adopt the electromagnetic theory of light.

This explains why most of Fresnel's conclusions stay the same when we take on the electromagnetic theory of light.

Maxwell's Theory.—Maxwell, we know, connected by a close bond two parts of physics until then entirely foreign to one another, optics and electricity. By blending thus in a vaster whole, in a higher harmony, the optics of Fresnel has not ceased to be alive. Its various parts subsist, and their mutual relations are still the same. Only the language we used to express them has changed; and, on the other hand, Maxwell has revealed to us other relations, before unsuspected, between the different parts of optics and the domain of electricity.

Maxwell's Theory.—Maxwell, as we know, linked two areas of physics that were completely separate at the time, optics and electricity. By combining them into a larger whole, a higher harmony, the optics of Fresnel remains relevant. Its different components still exist, and their relationships haven't changed. The only thing that has shifted is the language we use to describe them; additionally, Maxwell uncovered previously unknown connections between various aspects of optics and the field of electricity.

When a French reader first opens Maxwell's book, a feeling of uneasiness and often even of mistrust mingles at first with his admiration. Only after a prolonged acquaintance and at the cost of many efforts does this feeling disappear. There are even some eminent minds that never lose it.

When a French reader first picks up Maxwell's book, there's a sense of unease and often even mistrust that mixes with admiration. It takes a while to get used to it, and it requires a lot of effort for that feeling to fade away. Some brilliant thinkers might never shake it off.

Why are the English scientist's ideas with such difficulty acclimatized among us? It is, no doubt, because the education received by the majority of enlightened Frenchmen predisposes them to appreciate precision and logic above every other quality.

Why are the English scientist's ideas so hard to accept among us? It's probably because the education most educated French people receive makes them value precision and logic more than anything else.

The old theories of mathematical physics gave us in this respect complete satisfaction. All our masters, from Laplace to Cauchy, have proceeded in the same way. Starting from clearly stated hypotheses, they deduced all their consequences with mathematical rigor, and then compared them with experiment. It seemed their aim to give every branch of physics the same precision as celestial mechanics.

The old theories of mathematical physics satisfied us completely in this regard. All our greats, from Laplace to Cauchy, followed the same approach. They began with clearly defined hypotheses, then deduced all their consequences with mathematical precision, afterward comparing them with experiments. It seemed their goal was to provide every area of physics with the same level of accuracy as celestial mechanics.

A mind accustomed to admire such models is hard to suit with a theory. Not only will it not tolerate the least appearance of contradiction, but it will demand that the various parts be logically connected with one another, and that the number of distinct hypotheses be reduced to minimum.

A mind used to admiring such models finds it difficult to accept a theory. Not only will it not tolerate any hint of contradiction, but it will also require that the different parts be logically connected to each other and that the number of separate hypotheses be kept to a minimum.

This is not all; it will have still other demands, which seem to[Pg 176] me less reasonable. Behind the matter which our senses can reach, and which experiment tells us of, it will desire to see another, and in its eyes the only real, matter, which will have only purely geometric properties, and whose atoms will be nothing but mathematical points, subject to the laws of dynamics alone. And yet these atoms, invisible and without color, it will seek by an unconscious contradiction to represent to itself and consequently to identify as closely as possible with common matter.

This isn't the whole story; it will have even more demands that seem to me less reasonable. Beyond what our senses can perceive and what experiments reveal, it will strive to uncover another layer, which it considers the only true reality. This layer will have purely geometric properties, and its atoms will be nothing more than mathematical points, governed solely by the laws of dynamics. Yet, these atoms, invisible and colorless, it will paradoxically try to visualize and, as a result, associate as closely as possible with ordinary matter.

Then only will it be fully satisfied and imagine that it has penetrated the secret of the universe. If this satisfaction is deceitful, it is none the less difficult to renounce.

Then only will it be fully satisfied and believe that it has uncovered the secret of the universe. If this satisfaction is misleading, it is still difficult to let go.

Thus, on opening Maxwell, a Frenchman expects to find a theoretical whole as logical and precise as the physical optics based on the hypothesis of the ether; he thus prepares for himself a disappointment which I should like to spare the reader by informing him immediately of what he must look for in Maxwell, and what he can not find there.

Thus, when a Frenchman opens Maxwell, he expects to encounter a theoretical framework that is as logical and precise as the physical optics based on the ether hypothesis. He is setting himself up for disappointment, which I’d like to prevent for the reader by informing him right away about what to look for in Maxwell and what he won’t find there.

Maxwell does not give a mechanical explanation of electricity and magnetism; he confines himself to demonstrating that such an explanation is possible.

Maxwell doesn’t provide a mechanical explanation for electricity and magnetism; he focuses on showing that such an explanation can exist.

He shows also that optical phenomena are only a special case of electromagnetic phenomena. From every theory of electricity, one can therefore deduce immediately a theory of light.

He also demonstrates that optical phenomena are just a specific type of electromagnetic phenomena. From any theory of electricity, one can immediately derive a theory of light.

The converse unfortunately is not true; from a complete explanation of light, it is not always easy to derive a complete explanation of electric phenomena. This is not easy, in particular, if we wish to start from Fresnel's theory. Doubtless it would not be impossible; but nevertheless we must ask whether we are not going to be forced to renounce admirable results that we thought definitely acquired. That seems a step backward; and many good minds are not willing to submit to it.

The opposite, unfortunately, isn't true; from a full explanation of light, it's not always straightforward to get a complete understanding of electric phenomena. This is especially difficult if we want to begin with Fresnel's theory. It might not be impossible, but we have to consider whether we're going to have to give up amazing results that we thought were already established. That feels like a step backward, and many smart people are reluctant to accept it.

When the reader shall have consented to limit his hopes, he will still encounter other difficulties. The English scientist does not try to construct a single edifice, final and well ordered; he seems rather to erect a great number of provisional and independent constructions, between which communication is difficult and sometimes impossible.[Pg 177]

When the reader has agreed to lower their expectations, they will still face other challenges. The English scientist doesn’t aim to create one definitive, orderly structure; instead, they seem to build a lot of temporary and separate frameworks, making communication between them tricky and at times impossible.[Pg 177]

Take as example the chapter in which he explains electrostatic attractions by pressures and tensions in the dielectric medium. This chapter might be omitted without making thereby the rest of the book less clear or complete; and, on the other hand, it contains a theory complete in itself which one could understand without having read a single line that precedes or follows. But it is not only independent of the rest of the work; it is difficult to reconcile with the fundamental ideas of the book. Maxwell does not even attempt this reconciliation; he merely says: "I have not been able to make the next step, namely, to account by mechanical considerations for these stresses in the dielectric."

Take, for example, the chapter where he explains electrostatic attractions through pressures and tensions in the dielectric medium. This chapter could be left out without making the rest of the book any less clear or complete; on the other hand, it presents a self-contained theory that one could grasp without having read any lines before or after it. However, it not only stands apart from the rest of the work; it also clashes with the fundamental ideas of the book. Maxwell doesn't even try to reconcile this; he simply states: "I have not been able to make the next step, namely, to account by mechanical considerations for these stresses in the dielectric."

This example will suffice to make my thought understood; I could cite many others. Thus who would suspect, in reading the pages devoted to magnetic rotary polarization, that there is an identity between optical and magnetic phenomena?

This example is enough to clarify my point; I could mention many more. So, who would guess, while reading the sections on magnetic rotary polarization, that there is a connection between optical and magnetic phenomena?

One must not then flatter himself that he can avoid all contradiction; to that it is necessary to be resigned. In fact, two contradictory theories, provided one does not mingle them, and if one does not seek in them the basis of things, may both be useful instruments of research; and perhaps the reading of Maxwell would be less suggestive if he had not opened up to us so many new and divergent paths.

One shouldn’t fool themselves into thinking they can avoid all contradiction; it’s necessary to accept that. In fact, two contradictory theories, as long as you don’t mix them together and don’t look for the fundamental truth in them, can both be useful tools for research. And maybe reading Maxwell would be less thought-provoking if he hadn’t introduced so many new and different approaches.

The fundamental idea, however, is thus a little obscured. So far is this the case that in the majority of popularized versions it is the only point completely left aside.

The basic idea, however, is somewhat unclear. It's so unclear that in most simplified versions, it's the only point that gets completely ignored.

I feel, then, that the better to make its importance stand out, I ought to explain in what this fundamental idea consists. But for that a short digression is necessary.

I think, then, that to highlight its importance better, I should explain what this fundamental idea is. But for that, a brief digression is needed.

The Mechanical Explanation of Physical Phenomena.—There is in every physical phenomenon a certain number of parameters which experiment reaches directly and allows us to measure. I shall call these the parameters q.

The Mechanical Explanation of Physical Events.—In every physical phenomenon, there are a certain number of parameters that experiments can directly reach and allow us to measure. I will refer to these parameters as q.

Observation then teaches us the laws of the variations of these parameters; and these laws can generally be put in the form of differential equations, which connect the parameters q with the time.

Observation teaches us the rules for how these parameters change, and these rules can usually be expressed as differential equations that link the parameters q with time.

What is it necessary to do to give a mechanical interpretation of such a phenomenon?[Pg 178]

What do we need to do to provide a mechanical explanation for this phenomenon?[Pg 178]

One will try to explain it either by the motions of ordinary matter, or by those of one or more hypothetical fluids.

One will try to explain it either by the movements of regular matter or by those of one or more theoretical fluids.

These fluids will be considered as formed of a very great number of isolated molecules m.

These fluids will be regarded as made up of a vast number of individual molecules m.

When shall we say, then, that we have a complete mechanical explanation of the phenomenon? It will be, on the one hand, when we know the differential equations satisfied by the coordinates of these hypothetical molecules m, equations which, moreover, must conform to the principles of dynamics; and, on the other hand, when we know the relations that define the coordinates of the molecules m as functions of the parameters q accessible to experiment.

When can we say that we have a complete mechanical explanation of the phenomenon? It will be, on one hand, when we understand the differential equations that describe the coordinates of these hypothetical molecules m, equations that must also comply with the principles of dynamics; and, on the other hand, when we understand the relationships that define the coordinates of the molecules m as functions of the parameters q that are accessible to experimentation.

These equations, as I have said, must conform to the principles of dynamics, and, in particular, to the principle of the conservation of energy and the principle of least action.

These equations, as I mentioned, must comply with the principles of dynamics, especially the energy conservation principle and the principle of least action.

The first of these two principles teaches us that the total energy is constant and that this energy is divided into two parts:

The first of these two principles teaches us that the total energy remains constant and that this energy is split into two parts:

1º The kinetic energy, or vis viva, which depends on the masses of the hypothetical molecules m, and their velocities, and which I shall call T.

1º The kinetic energy, or vis viva, which depends on the masses of the hypothetical molecules m, and their speeds, and which I will refer to as T.

2º The potential energy, which depends only on the coordinates of these molecules and which I shall call U. It is the sum of the two energies T and U which is constant.

2º The potential energy, which relies solely on the positions of these molecules and which I’ll refer to as U. It is the sum of the two energies T and U that remains constant.

What now does the principle of least action tell us? It tells us that to pass from the initial position occupied at the instant t0 to the final position occupied at the instant t1, the system must take such a path that, in the interval of time that elapses between the two instants t0 and t1, the average value of 'the action' (that is to say, of the difference between the two energies T and U) shall be as small as possible.

What does the principle of least action tell us now? It tells us that to move from the initial position at the moment t0 to the final position at the moment t1, the system must take a path that ensures that during the time between t0 and t1, the average value of "the action" (which means the difference between the two energies T and U) is as small as possible.

If the two functions T and U are known, this principle suffices to determine the equations of motion.

If the two functions T and U are known, this principle is enough to figure out the equations of motion.

Among all the possible ways of passing from one position to another, there is evidently one for which the average value of the action is less than for any other. There is, moreover, only one; and it results from this that the principle of least action suffices to determine the path followed and consequently the equations of motion.[Pg 179]

Among all the possible ways to move from one position to another, there is clearly one where the average value of the action is lower than for any other option. Furthermore, there is only one such path, and this leads to the conclusion that the principle of least action is enough to determine the path taken and, as a result, the equations of motion.[Pg 179]

Thus we obtain what are called the equations of Lagrange.

Thus we get what are known as the equations of Lagrange.

In these equations, the independent variables are the coordinates of the hypothetical molecules m; but I now suppose that one takes as variables the parameters q directly accessible to experiment.

In these equations, the independent variables are the coordinates of the hypothetical molecules m; but I now assume that we take the parameters q that are directly accessible to experiments as the variables.

The two parts of the energy must then be expressed as functions of the parameters q and of their derivatives. They will evidently appear under this form to the experimenter. The latter will naturally try to define the potential and the kinetic energy by the aid of quantities that he can directly observe.[6]

The two parts of the energy must then be expressed as functions of the parameters q and their derivatives. They will clearly appear in this form to the experimenter. The experimenter will understandably try to define the potential and kinetic energy using quantities that they can directly observe.[6]

That granted, the system will always go from one position to another by a path such that the average action shall be a minimum.

That said, the system will always move from one position to another in a way that minimizes the average action.

It matters little that T and U are now expressed by the aid of the parameters q and their derivatives; it matters little that it is also by means of these parameters that we define the initial and final positions; the principle of least action remains always true.

It doesn’t really matter that T and U are now represented using the parameters q and their derivatives; it doesn’t matter that we also use these parameters to define the starting and ending positions; the principle of least action is always valid.

Now here again, of all the paths that lead from one position to another, there is one for which the average action is a minimum, and there is only one. The principle of least action suffices, then, to determine the differential equations which define the variations of the parameters q.

Now, once again, out of all the routes that connect one position to another, there’s one where the typical action is at its lowest, and there’s only one. Therefore, the principle of least action is enough to establish the differential equations that define the changes in the parameters q.

The equations thus obtained are another form of the equations of Lagrange.

The equations we've derived are another version of Lagrange's equations.

To form these equations we need to know neither the relations that connect the parameters q with the coordinates of the hypothetical molecules, nor the masses of these molecules, nor the expression of U as a function of the coordinates of these molecules.

To create these equations, we don’t need to know the relationships that link the parameters q with the coordinates of the hypothetical molecules, the masses of these molecules, or how to express U as a function of the coordinates of these molecules.

All we need to know is the expression of U as a function of the parameters, and that of T as a function of the parameters q and their derivatives, that is, the expressions of the kinetic and of the potential energy as functions of the experimental data.

All we need to know is how to express U as a function of the parameters, and how to express T as a function of the parameters q and their derivatives. This means we need the expressions for kinetic and potential energy as functions of the experimental data.

Then we shall have one of two things: either for a suitable[Pg 180] choice of the functions T and U, the equations of Lagrange, constructed as we have just said, will be identical with the differential equations deduced from experiments; or else there will exist no functions T and U, for which this agreement takes place. In the latter case it is clear that no mechanical explanation is possible.

Then we will have one of two outcomes: either for a suitable[Pg 180] selection of the functions T and U, the equations of Lagrange, as we just discussed, will match the differential equations derived from experiments; or there will be no functions T and U that allow for this match. In the latter case, it's clear that no mechanical explanation can be provided.

The necessary condition for a mechanical explanation to be possible is therefore that we can choose the functions T and U in such a way as to satisfy the principle of least action, which involves that of the conservation of energy.

The necessary condition for a mechanical explanation to be possible is that we can choose the functions T and U in a way that meets the principle of least action, which is related to the conservation of energy.

This condition, moreover, is sufficient. Suppose, in fact, that we have found a function U of the parameters q, which represents one of the parts of the energy; that another part of the energy, which we shall represent by T, is a function of the parameters q and their derivatives, and that it is a homogeneous polynomial of the second degree with respect to these derivatives; and finally that the equations of Lagrange, formed by means of these two functions, T and U, conform to the data of the experiment.

This condition is also sufficient. Let's say we've found a function U based on the parameters q, which represents part of the energy; another portion of the energy, represented by T, is a function of the parameters q and their derivatives, and it's a homogeneous polynomial of the second degree with respect to these derivatives; finally, the equations of Lagrange, formed using these two functions, T and U, align with the experimental data.

What is necessary in order to deduce from this a mechanical explanation? It is necessary that U can be regarded as the potential energy of a system and T as the vis viva of the same system.

What do we need to derive a mechanical explanation from this? We need to see U as the potential energy of a system and T as the vis viva of that same system.

There is no difficulty as to U, but can T be regarded as the vis viva of a material system?

There’s no issue with U, but can T be considered the vis viva of a material system?

It is easy to show that this is always possible, and even in an infinity of ways. I will confine myself to referring for more details to the preface of my work, 'Électricité et optique.'

It’s easy to demonstrate that this is always possible, and even in countless ways. I’ll just refer you to the preface of my work, 'Électricité et optique,' for more details.

Thus if the principle of least action can not be satisfied, no mechanical explanation is possible; if it can be satisfied, there is not only one, but an infinity, whence it follows that as soon as there is one there is an infinity of others.

Thus, if the principle of least action cannot be satisfied, no mechanical explanation is possible; if it can be satisfied, there is not just one, but an infinite number, which means that as soon as there is one, there is an infinite number of others.

One more observation.

One last observation.

Among the quantities that experiment gives us directly, we shall regard some as functions of the coordinates of our hypothetical molecules; these are our parameters q. We shall look upon the others as dependent not only on the coordinates, but on the velocities, or, what comes to the same thing, on the derivatives[Pg 181] of the parameters q, or as combinations of these parameters and their derivatives.

Among the quantities that experiments provide us directly, we'll consider some as functions of the coordinates of our hypothetical molecules; these are our parameters q. We'll see the others as depending not only on the coordinates, but also on the velocities, or, in other words, on the derivatives[Pg 181] of the parameters q, or as combinations of these parameters and their derivatives.

And then a question presents itself: among all these quantities measured experimentally, which shall we choose to represent the parameters q? Which shall we prefer to regard as the derivatives of these parameters? This choice remains arbitrary to a very large extent; but, for a mechanical explanation to be possible, it suffices if we can make the choice in such a way as to accord with the principle of least action.

And then a question arises: out of all these quantities measured in experiments, which ones should we use to represent the parameters q? Which should we consider as the derivatives of these parameters? This decision is largely arbitrary; however, for a mechanical explanation to be feasible, it’s enough if we can make the choice in a way that aligns with the principle of least action.

And then Maxwell asked himself whether he could make this choice and that of the two energies T and U, in such a way that the electrical phenomena would satisfy this principle. Experiment shows us that the energy of an electromagnetic field is decomposed into two parts, the electrostatic energy and the electrodynamic energy. Maxwell observed that if we regard the first as representing the potential energy U, the second as representing the kinetic energy T; if, moreover, the electrostatic charges of the conductors are considered as parameters q and the intensities of the currents as the derivatives of other parameters q; under these conditions, I say, Maxwell observed that the electric phenomena satisfy the principle of least action. Thenceforth he was certain of the possibility of a mechanical explanation.

And then Maxwell wondered if he could make this choice between the two energies T and U in a way that the electrical phenomena would comply with this principle. Experiments show us that the energy of an electromagnetic field breaks down into two parts: the electrostatic energy and the electrodynamic energy. Maxwell noted that if we consider the first as representing potential energy U and the second as representing kinetic energy T; additionally, if we treat the electrostatic charges of the conductors as parameters q and the current intensities as the derivatives of other parameters q; under these conditions, I mean, Maxwell observed that the electric phenomena satisfy the principle of least action. From that point on, he was confident in the possibility of a mechanical explanation.

If he had explained this idea at the beginning of his book instead of relegating it to an obscure part of the second volume, it would not have escaped the majority of readers.

If he had explained this idea at the start of his book instead of burying it in a less noticeable part of the second volume, it wouldn't have gone over the heads of most readers.

If, then, a phenomenon admits of a complete mechanical explanation, it will admit of an infinity of others, that will render an account equally well of all the particulars revealed by experiment.

If a phenomenon can be fully explained mechanically, it can be explained in countless other ways that will also adequately account for all the details shown by experiments.

And this is confirmed by the history of every branch of physics; in optics, for instance, Fresnel believed vibration to be perpendicular to the plane of polarization; Neumann regarded it as parallel to this plane. An 'experimentum crucis' has long been sought which would enable us to decide between these two theories, but it has not been found.

And this is supported by the history of every area of physics; in optics, for example, Fresnel thought vibration was perpendicular to the plane of polarization, while Neumann saw it as parallel to that plane. A crucial experiment has long been sought to help decide between these two theories, but it hasn't been found.

In the same way, without leaving the domain of electricity, we may ascertain that the theory of two fluids and that of the[Pg 182] single fluid both account in a fashion equally satisfactory for all the observed laws of electrostatics.

In the same way, without leaving the field of electricity, we can see that the theories of two fluids and a single fluid both effectively explain all the observed laws of electrostatics.

All these facts are easily explicable, thanks to the properties of the equations of Lagrange which I have just recalled.

All these facts can be easily explained, thanks to the properties of the Lagrange equations that I just mentioned.

It is easy now to comprehend what is Maxwell's fundamental idea.

It is now easy to understand Maxwell's fundamental idea.

To demonstrate the possibility of a mechanical explanation of electricity, we need not preoccupy ourselves with finding this explanation itself; it suffices us to know the expression of the two functions T and U, which are the two parts of energy, to form with these two functions the equations of Lagrange and then to compare these equations with the experimental laws.

To show that electricity can be explained mechanically, we don’t need to worry about finding that explanation itself; it's enough for us to know the expressions of the two functions T and U, which are the two components of energy. We can then use these functions to create Lagrange's equations and compare them with the experimental laws.

Among all these possible explanations, how make a choice for which the aid of experiment fails us? A day will come perhaps when physicists will not interest themselves in these questions, inaccessible to positive methods, and will abandon them to the metaphysicians. This day has not yet arrived; man does not resign himself so easily to be forever ignorant of the foundation of things.

Among all these possible explanations, how do we choose when experiments can't help us? Maybe one day physicists won’t care about these questions that can’t be addressed by positive methods and will leave them to the metaphysicians. That day hasn’t come yet; humanity doesn’t easily accept being forever ignorant of the foundations of things.

Our choice can therefore be further guided only by considerations where the part of personal appreciation is very great; there are, however, solutions that all the world will reject because of their whimsicality, and others that all the world will prefer because of their simplicity.

Our choice can therefore be further guided only by factors where personal preference plays a significant role; however, there are solutions that everyone will dismiss because they're too fanciful, and others that everyone will prefer because they're straightforward.

In what concerns electricity and magnetism, Maxwell abstains from making any choice. It is not that he systematically disdains all that is unattainable by positive methods; the time he has devoted to the kinetic theory of gases sufficiently proves that. I will add that if, in his great work, he develops no complete explanation, he had previously attempted to give one in an article in the Philosophical Magazine. The strangeness and the complexity of the hypotheses he had been obliged to make had led him afterwards to give this up.

In terms of electricity and magnetism, Maxwell doesn’t pick a side. It’s not that he completely ignores anything that can’t be explained with definite methods; the time he spent on the kinetic theory of gases clearly shows otherwise. I should mention that even though he doesn’t provide a full explanation in his major work, he had previously tried to offer one in an article in the Philosophical Magazine. The oddity and complexity of the assumptions he had to make ultimately caused him to abandon that attempt.

The same spirit is found throughout the whole work. What is essential, that is to say what must remain common to all theories, is made prominent; all that would only be suitable to a particular theory is nearly always passed over in silence. Thus the reader finds himself in the presence of a form almost devoid[Pg 183] of matter, which he is at first tempted to take for a fugitive shadow not to be grasped. But the efforts to which he is thus condemned force him to think and he ends by comprehending what was often rather artificial in the theoretic constructs he had previously only wondered at.

The same vibe runs through the entire work. What's essential, meaning what needs to be common across all theories, is highlighted; anything that would only apply to a specific theory is mostly ignored. As a result, the reader encounters a form that's nearly empty[Pg 183], which at first might seem like an elusive shadow that can't be grasped. But the challenges he's faced with compel him to think, and he ultimately understands what was often pretty artificial in the theoretical ideas he had previously just marveled at.


CHAPTER XIII

Electromagnetism

The history of electrodynamics is particularly instructive from our point of view.

The history of electrodynamics is especially informative from our perspective.

Ampère entitled his immortal work, 'Théorie des phénomènes électrodynamiques, uniquement fondée sur l'expérience.' He therefore imagined that he had made no hypothesis, but he had made them, as we shall soon see; only he made them without being conscious of it.

Ampère titled his timeless work, 'Theory of Electrodynamic Phenomena, exclusively based on experience.' He believed he had made no hypothesis, but he actually had, as we will soon see; he just did so without being aware of it.

His successors, on the other hand, perceived them, since their attention was attracted by the weak points in Ampère's solution. They made new hypotheses, of which this time they were fully conscious; but how many times it was necessary to change them before arriving at the classic system of to-day which is perhaps not yet final; this we shall see.

His successors, however, noticed them because their focus was drawn to the flaws in Ampère's solution. They created new hypotheses, fully aware of what they were doing this time; yet, they had to change them numerous times before reaching today’s classic system, which may still not be final. We will see.

I. Ampere's Theory.—When Ampère studied experimentally the mutual actions of currents, he operated and he only could operate with closed currents.

I. Ampere's Theory.—When Ampère conducted experiments on the interactions of currents, he worked exclusively with closed currents.

It was not that he denied the possibility of open currents. If two conductors are charged with positive and negative electricity and brought into communication by a wire, a current is established going from one to the other, which continues until the two potentials are equal. According to the ideas of Ampère's time this was an open current; the current was known to go from the first conductor to the second, it was not seen to return from the second to the first.

It wasn't that he ruled out the possibility of open currents. If two conductors are charged with positive and negative electricity and connected by a wire, a current flows from one to the other, continuing until both potentials equal out. Back in Ampère's day, this was considered an open current; it was understood that the current moved from the first conductor to the second but didn't return from the second back to the first.

So Ampère considered as open currents of this nature, for example, the currents of discharge of condensers; but he could not make them the objects of his experiments because their duration is too short.

So Ampère viewed open currents of this kind, like the discharge currents of capacitors; however, he couldn't make them the focus of his experiments because their duration is too brief.

Another sort of open current may also be imagined. I suppose two conductors, A and B, connected by a wire AMB. Small conducting masses in motion first come in contact with the[Pg 185] conductor B, take from it an electric charge, leave contact with B and move along the path BNA, and, transporting with them their charge, come into contact with A and give to it their charge, which returns then to B along the wire AMB.

Another kind of open current can also be imagined. Let's say we have two conductors, A and B, connected by a wire AMB. Small conducting particles in motion first make contact with conductor B, take an electric charge from it, move away from B along the path BNA, and while carrying their charge, they come into contact with A and transfer their charge to it. This charge then returns to B through the wire AMB.

Now there we have in a sense a closed circuit, since the electricity describes the closed circuit BNAMB; but the two parts of this current are very different. In the wire AMB, the electricity is displaced through a fixed conductor, like a voltaic current, overcoming an ohmic resistance and developing heat; we say that it is displaced by conduction. In the part BNA, the electricity is carried by a moving conductor; it is said to be displaced by convection.

Now we have, in a way, a closed circuit, since the electricity follows the closed circuit BNAMB; however, the two parts of this current are quite different. In the wire AMB, the electricity moves through a fixed conductor, similar to a voltaic current, overcoming resistance and producing heat; we say that it is moved by conduction. In the part BNA, the electricity is transported by a moving conductor; it's said to be moved by convection.

If then the current of convection is considered as altogether analogous to the current of conduction, the circuit BNAMB is closed; if, on the contrary, the convection current is not 'a true current' and, for example, does not act on the magnet, there remains only the conduction current AMB, which is open.

If we think of the convection current as being just like the conduction current, then the circuit BNAMB is closed; however, if the convection current isn't a 'true current' and, for instance, doesn't affect the magnet, then only the conduction current AMB remains, which is open.

For example, if we connect by a wire the two poles of a Holtz machine, the charged rotating disc transfers the electricity by convection from one pole to the other, and it returns to the first pole by conduction through the wire.

For example, if we connect the two poles of a Holtz machine with a wire, the charged rotating disc transfers electricity from one pole to the other through convection, and it comes back to the first pole by conduction through the wire.

But currents of this sort are very difficult to produce with appreciable intensity. With the means at Ampère's disposal, we may say that this was impossible.

But creating currents like this is really hard to do with any significant intensity. Given the resources that Ampère had, we can say that it was impossible.

To sum up, Ampère could conceive of the existence of two kinds of open currents, but he could operate on neither because they were not strong enough or because their duration was too short.

To sum up, Ampère could imagine the existence of two types of open currents, but he couldn’t work with either because they were either not strong enough or their duration was too brief.

Experiment therefore could only show him the action of a closed current on a closed current, or, more accurately, the action of a closed current on a portion of a current, because a current can be made to describe a closed circuit composed of a moving part and a fixed part. It is possible then to study the displacements of the moving part under the action of another closed current.

Experiment, therefore, could only demonstrate the effect of a closed current on another closed current, or more precisely, the effect of a closed current on a segment of a current, since a current can be arranged to follow a closed loop that consists of both a moving part and a stationary part. Thus, it's possible to examine the movements of the moving part under the influence of a different closed current.

On the other hand, Ampère had no means of studying the action of an open current, either on a closed current or another open current.[Pg 186]

On the other hand, Ampère had no way of examining the effect of an open current on either a closed current or another open current.[Pg 186]

1. The Case of Closed Currents.—In the case of the mutual action of two closed currents, experiment revealed to Ampère remarkably simple laws.

1. The Case of Closed Currents.—When two closed currents interact, experiments showed Ampère some surprisingly straightforward laws.

I recall rapidly here those which will be useful to us in the sequel:

I quickly remember those that will be useful to us later:

If the intensity of the currents is kept constant, and if the two circuits, after having undergone any deformations and displacements whatsoever, return finally to their initial positions, the total work of the electrodynamic actions will be null.

If the intensity of the currents remains constant, and if the two circuits, after experiencing any deformations and displacements, finally return to their original positions, the total work done by the electrodynamic actions will be zero.

In other words, there is an electrodynamic potential of the two circuits, proportional to the product of the intensities, and depending on the form and relative position of the circuits; the work of the electrodynamic actions is equal to the variation of this potential.

In other words, there is an electrodynamic potential of the two circuits, proportional to the product of the intensities, and depending on the shape and relative position of the circuits; the work of the electrodynamic actions is equal to the change in this potential.

2º The action of a closed solenoid is null.

2º The action of a closed solenoid is nonexistent.

3º The action of a circuit C on another voltaic circuit depends only on the 'magnetic field' developed by this circuit. At each point in space we can in fact define in magnitude and direction a certain force called magnetic force, which enjoys the following properties:

3º The action of a circuit C on another circuit depends only on the 'magnetic field' generated by this circuit. At each point in space, we can actually define a certain force called magnetic force in terms of its magnitude and direction, which has the following properties:

(a) The force exercised by C on a magnetic pole is applied to that pole and is equal to the magnetic force multiplied by the magnetic mass of that pole;

(a) The force exerted by C on a magnetic pole is directed at that pole and is equal to the magnetic force multiplied by the magnetic mass of that pole;

(b) A very short magnetic needle tends to take the direction of the magnetic force, and the couple to which it tends to reduce is proportional to the magnetic force, the magnetic moment of the needle and the sine of the dip of the needle;

(b) A very short magnetic needle tends to align with the direction of the magnetic force, and the torque it aims to minimize is proportional to the magnetic force, the magnetic moment of the needle, and the sine of the needle's dip;

(c) If the circuit C is displaced, the work of the electrodynamic action exercised by C on will be equal to the increment of the 'flow of magnetic force' which passes through the circuit.

(c) If the circuit C is moved, the work done by the electrodynamic action from C on will equal the increase in the 'flow of magnetic force' that passes through the circuit.

2. Action of a Closed Current on a Portion of Current.—Ampère not having been able to produce an open current, properly so called, had only one way of studying the action of a closed current on a portion of current.

2. Action of a Closed Current on a Portion of Current.—Ampère, unable to create a true open current, had only one method to investigate the effects of a closed current on a segment of current.

This was by operating on a circuit C composed of two parts, the one fixed, the other movable. The movable part was, for instance, a movable wire αβ whose extremities α and β could[Pg 187] slide along a fixed wire. In one of the positions of the movable wire, the end α rested on the A of the fixed wire and the extremity β on the point B of the fixed wire. The current circulated from α to β, that is to say, from A to B along the movable wire, and then it returned from B to A along the fixed wire. This current was therefore closed.

This was done by operating on a circuit C made up of two parts, one fixed and the other movable. The movable part was, for example, a movable wire αβ whose ends α and β could[Pg 187] slide along a fixed wire. In one position of the movable wire, end α rested on the A of the fixed wire and end β on point B of the fixed wire. The current flowed from α to β, meaning from A to B along the movable wire, and then returned from B to A along the fixed wire. This current was therefore closed.

In a second position, the movable wire having slipped, the extremity α rested on another point of the fixed wire, and the extremity β on another point of the fixed wire. The current circulated then from α to β, that is to say from to along the movable wire, and it afterwards returned from to B, then from B to A, then finally from A to , always following the fixed wire. The current was therefore also closed.

In a second position, the movable wire slipped, with the end α resting on another point of the fixed wire, and the end β on another point of the fixed wire. The current then flowed from α to β, meaning from to along the movable wire, and then returned from to B, then from B to A, and finally from A to , always following the fixed wire. The current was therefore also closed.

If a like current is subjected to the action of a closed current C, the movable part will be displaced just as if it were acted upon by a force. Ampère assumes that the apparent force to which this movable part AB seems thus subjected, representing the action of the C on the portion αβ of the current, is the same as if αβ were traversed by an open current, stopping at α and β, in place of being traversed by a closed current which after arriving at β returns to α through the fixed part of the circuit.

If a current is exposed to the influence of a closed current C, the movable part will shift as if it were being acted upon by a force. Ampère suggests that the apparent force affecting this movable part AB, which represents the action of C on the segment αβ of the current, is the same as if αβ were passing through an open current that stops at α and β, instead of being passed through by a closed current that, after reaching β, returns to α through the fixed part of the circuit.

This hypothesis seems natural enough, and Ampère made it unconsciously; nevertheless it is not necessary, since we shall see further on that Helmholtz rejected it. However that may be, it permitted Ampère, though he had never been able to produce an open current, to enunciate the laws of the action of a closed current on an open current, or even on an element of current.

This hypothesis seems straightforward, and Ampère arrived at it without realizing it; however, it isn't essential, as we will later see that Helmholtz dismissed it. Regardless, it allowed Ampère, even though he had never managed to create an open current, to state the laws regarding how a closed current affects an open current or even an element of current.

The laws are simple:

The rules are simple:

1º The force which acts on an element of current is applied to this element; it is normal to the element and to the magnetic force, and proportional to the component of this magnetic force which is normal to the element.

1º The force acting on a current element is applied directly to that element; it is perpendicular to both the element and the magnetic force, and is proportional to the component of the magnetic force that is perpendicular to the element.

2º The action of a closed solenoid on an element of current is null.

2º The effect of a closed solenoid on a current element is zero.

But the electrodynamic potential has disappeared, that is to say that, when a closed current and an open current, whose intensities have been maintained constant, return to their initial positions, the total work is not null.[Pg 188]

But the electrodynamic potential is gone, meaning that when a closed current and an open current, both kept at constant intensities, return to their starting positions, the total work is not zero.[Pg 188]

3. Continuous Rotations.—Among electrodynamic experiments, the most remarkable are those in which continuous rotations are produced and which are sometimes called unipolar induction experiments. A magnet may turn about its axis; a current passes first through a fixed wire, enters the magnet by the pole N, for example, passes through half the magnet, emerges by a sliding contact and reenters the fixed wire.

3. Continuous Rotations.—Among electrodynamic experiments, the most notable are those that produce continuous rotations, often referred to as unipolar induction experiments. A magnet can rotate around its axis; a current flows first through a stationary wire, enters the magnet at the pole N, travels through half of the magnet, comes out through a sliding contact, and goes back into the stationary wire.

The magnet then begins to rotate continuously without being able ever to attain equilibrium; this is Faraday's experiment.

The magnet then starts to spin continuously without ever being able to reach equilibrium; this is Faraday's experiment.

How is it possible? If it were a question of two circuits of invariable form, the one C fixed, the other movable about an axis, this latter could never take on continuous rotation; in fact there is an electrodynamic potential; there must therefore be necessarily a position of equilibrium when this potential is a maximum.

How is this possible? If we were dealing with two circuits of fixed shapes, one C being stationary and the other able to rotate around an axis, the latter could never achieve continuous rotation. In reality, there is an electrodynamic potential, so there must be a point of equilibrium when this potential reaches its maximum.

Continuous rotations are therefore possible only when the circuit is composed of two parts: one fixed, the other movable about an axis, as is the case in Faraday's experiment. Here again it is convenient to draw a distinction. The passage from the fixed to the movable part, or inversely, may take place either by simple contact (the same point of the movable part remaining constantly in contact with the same point of the fixed part), or by a sliding contact (the same point of the movable part coming successively in contact with diverse points of the fixed part).

Continuous rotations are only possible when the circuit consists of two parts: one fixed and the other movable around an axis, like in Faraday's experiment. It's useful to make a distinction here. The transition from the fixed part to the movable part, or vice versa, can happen either by simple contact (where the same point on the movable part stays constantly in touch with the same point on the fixed part) or by sliding contact (where the same point on the movable part successively touches various points on the fixed part).

It is only in the second case that there can be continuous rotation. This is what then happens: The system tends to take a position of equilibrium; but, when at the point of reaching that position, the sliding contact puts the movable part in communication with a new point of the fixed part; it changes the connections, it changes therefore the conditions of equilibrium, so that the position of equilibrium fleeing, so to say, before the system which seeks to attain it, rotation may take place indefinitely.

It’s only in the second scenario that continuous rotation can happen. Here’s what occurs: The system tries to reach a state of balance; however, as it approaches that state, the sliding contact connects the movable part to a different point on the fixed part. This alters the connections, and thus the conditions for balance shift, causing the state of balance to elude the system trying to achieve it, allowing rotation to continue indefinitely.

Ampère assumes that the action of the circuit on the movable part of is the same as if the fixed part of did not exist, and therefore as if the current passing through the movable part were open.[Pg 189]

Ampère assumes that the circuit's effect on the movable part of is the same as if the fixed part of were not there, and thus, it's as if the current flowing through the movable part were disconnected.[Pg 189]

He concludes therefore that the action of a closed on an open current, or inversely that of an open current on a closed current, may give rise to a continuous rotation.

He concludes that the action of a closed circuit on an open circuit, or conversely, that of an open circuit on a closed circuit, can lead to a continuous rotation.

But this conclusion depends on the hypothesis I have enunciated and which, as I said above, is not admitted by Helmholtz.

But this conclusion relies on the hypothesis I've stated, which, as I mentioned earlier, Helmholtz does not accept.

4. Mutual Action of Two Open Currents.—In what concerns the mutual actions of two open currents, and in particular that of two elements of current, all experiment breaks down. Ampère has recourse to hypothesis. He supposes:

4. Mutual Action of Two Open Currents.—When it comes to the mutual actions of two open currents, especially regarding two elements of current, all experiments fail. Ampère turns to hypotheses. He assumes:

1º That the mutual action of two elements reduces to a force acting along their join;

1º That the shared interaction of two elements boils down to a force acting along their connection;

2º That the action of two closed currents is the resultant of the mutual actions of their diverse elements, which are besides the same as if these elements were isolated.

2º That the action of two closed currents is the result of the mutual interactions of their different elements, which are also the same as if these elements were isolated.

What is remarkable is that here again Ampère makes these hypotheses unconsciously.

What’s incredible is that once again Ampère makes these hypotheses without even realizing it.

However that may be, these two hypotheses, together with the experiments on closed currents, suffice to determine completely the law of the mutual action of two elements. But then most of the simple laws we have met in the case of closed currents are no longer true.

However that may be, these two hypotheses, along with the experiments on closed currents, are enough to fully determine the law of the mutual action between two elements. However, many of the simple laws we've encountered regarding closed currents no longer hold true.

In the first place, there is no electrodynamic potential; nor was there any, as we have seen, in the case of a closed current acting on an open current.

In the first place, there is no electrodynamic potential; nor was there any, as we have seen, in the case of a closed current acting on an open current.

Next there is, properly speaking, no magnetic force.

Next, there isn't actually any magnetic force.

And, in fact, we have given above three different definitions of this force:

And actually, we have provided three different definitions of this force above:

1º By the action on a magnetic pole;

1º By the action on a magnetic pole;

2º By the director couple which orientates the magnetic needle;

2º By the director couple that guides the magnetic needle;

3º By the action on an element of current.

3º By the action on a current element.

But in the case which now occupies us, not only these three definitions are no longer in harmony, but each has lost its meaning, and in fact:

But in the situation we’re discussing now, not only are these three definitions out of sync, but each one has also lost its meaning, and in fact:

1º A magnetic pole is no longer acted upon simply by a single force applied to this pole. We have seen in fact that the force due to the action of an element of current on a pole is not applied to the pole, but to the element; it may moreover be replaced by a force applied to the pole and by a couple;[Pg 190]

1º A magnetic pole isn't just affected by a single force on it anymore. We've seen that the force from a current element acting on a pole doesn't actually act on the pole itself, but on the element; it can also be replaced by a force acting on the pole along with a couple.[Pg 190]

2º The couple which acts on the magnetic needle is no longer a simple director couple, for its moment with respect to the axis of the needle is not null. It breaks up into a director couple, properly so called, and a supplementary couple which tends to produce the continuous rotation of which we have above spoken;

2º The couple that acts on the magnetic needle is no longer just a simple directing couple, because its moment relative to the axis of the needle is not zero. It splits into a directing couple, properly speaking, and a supplementary couple that tends to create the continuous rotation we mentioned earlier;

3º Finally the force acting on an element of current is not normal to this element.

3º Finally, the force acting on a segment of current is not perpendicular to this segment.

In other words, the unity of the magnetic force has disappeared.

In other words, the unity of the magnetic force is gone.

Let us see in what this unity consists. Two systems which exercise the same action on a magnetic pole will exert also the same action on an indefinitely small magnetic needle, or on an element of current placed at the same point of space as this pole.

Let’s explore what this unity is all about. Two systems that apply the same force on a magnetic pole will also apply the same force on a very small magnetic needle or on a current element located at the same point in space as that pole.

Well, this is true if these two systems contain only closed currents; this would no longer be true if these two systems contained open currents.

Well, this is true if these two systems only have closed currents; it wouldn't be true anymore if these two systems had open currents.

It suffices to remark, for instance, that, if a magnetic pole is placed at A and an element at B, the direction of the element being along the prolongation of the sect AB, this element which will exercise no action on this pole will, on the other hand, exercise an action either on a magnetic needle placed at the point A, or on an element of current placed at the point A.

It’s enough to note, for example, that if a magnetic pole is positioned at A and an element at B, with the element aligned along the extension of the line AB, this element won’t affect the pole but will affect either a magnetic needle located at point A or a current element at point A.

5. Induction.—We know that the discovery of electrodynamic induction soon followed the immortal work of Ampère.

5. Induction.—We know that the discovery of electrodynamic induction quickly came after the groundbreaking work of Ampère.

As long as it is only a question of closed currents there is no difficulty, and Helmholtz has even remarked that the principle of the conservation of energy is sufficient for deducing the laws of induction from the electrodynamic laws of Ampère. But always on one condition, as Bertrand has well shown; that we make besides a certain number of hypotheses.

As long as it's just about closed currents, there’s no issue, and Helmholtz even noted that the principle of energy conservation is enough to derive the laws of induction from Ampère's electrodynamic laws. But there's always one condition, as Bertrand has clearly demonstrated; we also need to make a certain number of hypotheses.

The same principle again permits this deduction in the case of open currents, although of course we can not submit the result to the test of experiment, since we can not produce such currents.

The same principle still allows us to make this deduction in the case of open currents, although we obviously can't test the result through experimentation, since we can't create such currents.

If we try to apply this mode of analysis to Ampère's theory of open currents, we reach results calculated to surprise us.

If we apply this way of分析 to Ampère's theory of open currents, we get results that are likely to surprise us.

In the first place, induction can not be deduced from the variation of the magnetic field by the formula well known to savants and practicians, and, in fact, as we have said, properly speaking there is no longer a magnetic field.[Pg 191]

In the beginning, induction can’t be derived from the changes in the magnetic field using the formula familiar to experts and practitioners. In reality, as we've mentioned, there is effectively no longer a magnetic field.[Pg 191]

But, further, if a circuit C is subjected to the induction of a variable voltaic system S, if this system S be displaced and deformed in any way whatever, so that the intensity of the currents of this system varies according to any law whatever, but that after these variations the system finally returns to its initial situation, it seems natural to suppose that the mean electromotive force induced in the circuit C is null.

But, additionally, if a circuit C is exposed to the induction of a variable voltage system S, and if this system S is changed or distorted in any way, causing the strength of its currents to vary according to any rule, but ultimately the system returns to its original position, it seems reasonable to assume that the average electromotive force generated in the circuit C is zero.

This is true if the circuit C is closed and if the system S contains only closed currents. This would no longer be true, if one accepts the theory of Ampère, if there were open currents. So that not only induction will no longer be the variation of the flow of magnetic force, in any of the usual senses of the word, but it can not be represented by the variation of anything whatever.

This is true if the circuit C is closed and if the system S only has closed currents. This would no longer be the case if we accept Ampère's theory, where there are open currents. As a result, not only will induction not represent the change in magnetic force flow in any of the usual ways, but it also cannot be represented by the change in anything at all.

II. Theory of Helmholtz.—I have dwelt upon the consequences of Ampère's theory, and of his method of explaining open currents.

II. Helmholtz's Theory.—I've focused on the outcomes of Ampère's theory and his approach to explaining open currents.

It is difficult to overlook the paradoxical and artificial character of the propositions to which we are thus led. One can not help thinking 'that can not be so.'

It’s hard to ignore the contradictory and artificial nature of the ideas we’re presented with. You can’t help but think, “That can’t be right.”

We understand therefore why Helmholtz was led to seek something else.

We understand why Helmholtz was driven to look for something more.

Helmholtz rejects Ampère's fundamental hypothesis, to wit, that the mutual action of two elements of current reduces to a force along their join. He assumes that an element of current is not subjected to a single force, but to a force and a couple. It is just this which gave rise to the celebrated polemic between Bertrand and Helmholtz.

Helmholtz disagrees with Ampère's main idea, which is that the mutual interaction between two current elements results in a force along their connection. He believes that a current element is influenced not just by one force, but by both a force and a couple. This difference in views is what sparked the famous debate between Bertrand and Helmholtz.

Helmholtz replaces Ampère's hypothesis by the following: two elements always admit of an electrodynamic potential depending solely on their position and orientation; and the work of the forces that they exercise, one on the other, is equal to the variation of this potential. Thus Helmholtz can no more do without hypothesis than Ampère; but at least he does not make one without explicitly announcing it.

Helmholtz replaces Ampère's hypothesis with this: two elements always have an electrodynamic potential that depends only on their position and orientation; and the work done by the forces they exert on each other is equal to the change in this potential. So, Helmholtz relies on a hypothesis just like Ampère does; however, at least he clearly states it when he presents it.

In the case of closed currents, which are alone accessible to experiment, the two theories agree.

In the case of closed currents, which can only be tested through experiments, the two theories align.

In all other cases they differ.

In all other cases, they are different.

In the first place, contrary to what Ampère supposed, the force[Pg 192] which seems to act on the movable portion of a closed current is not the same as would act upon this movable portion if it were isolated and constituted an open current.

In the first place, unlike what Ampère thought, the force[Pg 192] that seems to act on the moving part of a closed circuit is not the same as the force that would act on this moving part if it were separate and formed an open circuit.

Let us return to the circuit , of which we spoke above, and which was formed of a movable wire αβ sliding on a fixed wire. In the only experiment that can be made, the movable portion αβ is not isolated, but is part of a closed circuit. When it passes from AB to A´B´, the total electrodynamic potential varies for two reasons:

Let’s go back to the circuit , which we mentioned earlier, made up of a movable wire αβ sliding on a fixed wire. In the one experiment we can conduct, the movable part αβ isn’t isolated; it’s part of a closed circuit. As it moves from AB to A´B´, the overall electrodynamic potential changes for two reasons:

1º It undergoes a first increase because the potential of A´B´ with respect to the circuit C is not the same as that of AB;

1º It goes through a first increase because the potential of A´B´ in relation to the circuit C is not the same as that of AB;

2º It takes a second increment because it must be increased by the potentials of the elements AA´, BB´ with respect to C.

2º It takes a second increase because it has to be raised by the potentials of the elements AA´, BB´ in relation to C.

It is this double increment which represents the work of the force to which the portion AB seems subjected.

It is this double increase that represents the work done by the force that the section AB appears to be under.

If, on the contrary, αβ were isolated, the potential would undergo only the first increase, and this first increment alone would measure the work of the force which acts on AB.

If, on the other hand, αβ were isolated, the potential would only see the first increase, and this initial increase alone would represent the work done by the force acting on AB.

In the second place, there could be no continuous rotation without sliding contact, and, in fact, that, as we have seen à propos of closed currents, is an immediate consequence of the existence of an electrodynamic potential.

In addition, there can't be continuous rotation without sliding contact, and, as we've seen à propos of closed currents, this is a direct result of the presence of an electrodynamic potential.

In Faraday's experiment, if the magnet is fixed and if the part of the current exterior to the magnet runs along a movable wire, that movable part may undergo a continuous rotation. But this does not mean to say that if the contacts of the wire with the magnet were suppressed, and an open current were to run along the wire, the wire would still take a movement of continuous rotation.

In Faraday's experiment, if the magnet is stationary and the part of the current outside the magnet flows through a movable wire, that movable part can rotate continuously. However, this doesn’t imply that if the contacts of the wire with the magnet were removed, and an open current flowed through the wire, the wire would still exhibit continuous rotational movement.

I have just said in fact that an isolated element is not acted upon in the same way as a movable element making part of a closed circuit.

I just mentioned that an isolated element isn't influenced in the same way as a movable element that is part of a closed circuit.

Another difference: The action of a closed solenoid on a closed current is null according to experiment and according to the two theories. Its action on an open current would be null according to Ampère; it would not be null according to Helmholtz. From this follows an important consequence. We have given above three definitions of magnetic force. The third has[Pg 193] no meaning here since an element of current is no longer acted upon by a single force. No more has the first any meaning. What, in fact, is a magnetic pole? It is the extremity of an indefinite linear magnet. This magnet may be replaced by an indefinite solenoid. For the definition of magnetic force to have any meaning, it would be necessary that the action exercised by an open current on an indefinite solenoid should depend only on the position of the extremity of this solenoid, that is to say, that the action on a closed solenoid should be null. Now we have just seen that such is not the case.

Another difference: The effect of a closed solenoid on a closed current is zero based on experiments and both theories. Its effect on an open current would be zero according to Ampère, but not according to Helmholtz. This leads to an important consequence. We previously provided three definitions of magnetic force. The third has[Pg 193] no relevance here since a current element is no longer influenced by a single force. The first also holds no meaning. So, what exactly is a magnetic pole? It’s the end of an indefinite linear magnet. This magnet can be replaced by an indefinite solenoid. For the definition of magnetic force to be valid, it would need for the effect of an open current on an indefinite solenoid to depend only on the position of the end of this solenoid, meaning that the effect on a closed solenoid should be zero. However, we have just shown that this is not the case.

On the other hand, nothing prevents our adopting the second definition, which is founded on the measurement of the director couple which tends to orientate the magnetic needle.

On the other hand, nothing stops us from adopting the second definition, which is based on measuring the director couple that tends to orient the magnetic needle.

But if it is adopted, neither the effects of induction nor the electrodynamic effects will depend solely on the distribution of the lines of force in this magnetic field.

But if it's accepted, neither the effects of induction nor the electrodynamic effects will depend only on how the lines of force are distributed in this magnetic field.

III. Difficulties Raised by These Theories.—The theory of Helmholtz is in advance of that of Ampère; it is necessary, however, that all the difficulties should be smoothed away. In the one as in the other, the phrase 'magnetic field' has no meaning, or, if we give it one, by a more or less artificial convention, the ordinary laws so familiar to all electricians no longer apply; thus the electromotive force induced in a wire is no longer measured by the number of lines of force met by this wire.

III. Challenges Presented by These Theories.—Helmholtz's theory is more advanced than Ampère's; however, all the challenges need to be addressed. In both theories, the term 'magnetic field' is ambiguous, or if we assign it a meaning through somewhat artificial conventions, the usual laws that all electricians know no longer hold true; for instance, the electromotive force induced in a wire can no longer be measured by the number of lines of force intersecting that wire.

And our repugnance does not come alone from the difficulty of renouncing inveterate habits of language and of thought. There is something more. If we do not believe in action at a distance, electrodynamic phenomena must be explained by a modification of the medium. It is precisely this modification that we call 'magnetic field.' And then the electrodynamic effects must depend only on this field.

And our dislike doesn't just stem from the challenge of giving up long-standing habits of language and thought. There's more to it. If we don't accept action at a distance, then we have to explain electrodynamic phenomena through a change in the medium. It's this change that we refer to as the 'magnetic field.' Consequently, the electrodynamic effects must rely solely on this field.

All these difficulties arise from the hypothesis of open currents.

All these difficulties come from the idea of open currents.

IV. Maxwell's Theory.—Such were the difficulties raised by the dominant theories when Maxwell appeared, who with a stroke of the pen made them all vanish. To his mind, in fact, all currents are closed currents. Maxwell assumes that if in a dielectric the electric field happens to vary, this dielectric becomes the seat of a particular phenomenon, acting on the[Pg 194] galvanometer like a current, and which he calls current of displacement.

IV. Maxwell's Theory.—These were the challenges posed by the prevailing theories when Maxwell arrived, and with one stroke of his pen, he made them all disappear. To him, all currents are actually closed currents. Maxwell suggests that if the electric field in a dielectric changes, this dielectric experiences a specific phenomenon that acts on the [Pg 194] galvanometer like a current, which he refers to as the current of displacement.

If then two conductors bearing contrary charges are put in communication by a wire, in this wire during the discharge there is an open current of conduction; but there are produced at the same time in the surrounding dielectric, currents of displacement which close this current of conduction.

If two conductors with opposite charges are connected by a wire, there will be an open current of conduction in the wire during the discharge. At the same time, displacement currents are generated in the surrounding dielectric, which help close this conduction current.

We know that Maxwell's theory leads to the explanation of optical phenomena, which would be due to extremely rapid electrical oscillations.

We know that Maxwell's theory explains optical phenomena, which are caused by extremely fast electrical oscillations.

At that epoch such a conception was only a bold hypothesis, which could be supported by no experiment.

At that time, such an idea was just a daring theory that had no experimental support.

At the end of twenty years, Maxwell's ideas received the confirmation of experiment. Hertz succeeded in producing systems of electric oscillations which reproduce all the properties of light, and only differ from it by the length of their wave; that is to say as violet differs from red. In some measure he made the synthesis of light.

At the end of twenty years, Maxwell's ideas were confirmed by experiments. Hertz managed to create systems of electric oscillations that replicate all the properties of light, differing from it only in the wavelength; that is to say, as violet differs from red. In a way, he created the synthesis of light.

It might be said that Hertz has not demonstrated directly Maxwell's fundamental idea, the action of the current of displacement on the galvanometer. This is true in a sense. What he has shown in sum is that electromagnetic induction is not propagated instantaneously as was supposed; but with the speed of light.

It could be argued that Hertz hasn’t directly proven Maxwell's key concept, which is the effect of the displacement current on the galvanometer. This is somewhat accurate. What he has essentially demonstrated is that electromagnetic induction doesn’t occur instantly as was previously thought; instead, it travels at the speed of light.

But to suppose there is no current of displacement, and induction is propagated with the speed of light; or to suppose that the currents of displacement produce effects of induction, and that the induction is propagated instantaneously, comes to the same thing.

But if we think there’s no flow of displacement, and that induction travels at the speed of light; or if we think that the displacement currents create induction effects and that induction spreads instantly, it amounts to the same thing.

This can not be seen at the first glance, but it is proved by an analysis of which I must not think of giving even a summary here.

This isn’t obvious at first glance, but it’s shown through an analysis that I won’t even try to summarize here.

V. Rowland's Experiment.—But as I have said above, there are two kinds of open conduction currents. There are first the currents of discharge of a condenser or of any conductor whatever.

V. Rowland's Experiment.—But as I mentioned earlier, there are two types of open conduction currents. First, there are the discharge currents from a capacitor or any conductor.

There are also the cases in which electric discharges describe[Pg 195] a closed contour, being displaced by conduction in one part of the circuit and by convection in the other part.

There are also situations where electric discharges form[Pg 195] a closed loop, moving through conduction in one section of the circuit and through convection in another section.

For open currents of the first sort, the question might be considered as solved; they were closed by the currents of displacement.

For open currents of the first kind, the question could be seen as resolved; they were blocked by displacement currents.

For open currents of the second sort, the solution appeared still more simple. It seemed that if the current were closed, it could only be by the current of convection itself. For that it sufficed to assume that a 'convection current,' that is to say a charged conductor in motion, could act on the galvanometer.

For open currents of the second type, the solution seemed even simpler. It appeared that if the current was closed, it could only be by the convection current itself. To support this, it was enough to assume that a "convection current," which means a charged conductor in motion, could affect the galvanometer.

But experimental confirmation was lacking. It appeared difficult in fact to obtain a sufficient intensity even by augmenting as much as possible the charge and the velocity of the conductors. It was Rowland, an extremely skillful experimenter, who first triumphed over these difficulties. A disc received a strong electrostatic charge and a very great speed of rotation. An astatic magnetic system placed beside the disc underwent deviations.

But there was no experimental proof. It seemed really hard to achieve a strong enough intensity, even by increasing the charge and speed of the conductors as much as possible. Rowland, a highly skilled experimenter, was the first to overcome these challenges. A disc was given a powerful electrostatic charge and spun at a very high speed. An astatic magnetic system positioned next to the disc experienced deviations.

The experiment was made twice by Rowland, once in Berlin, once in Baltimore. It was afterwards repeated by Himstedt. These physicists even announced that they had succeeded in making quantitative measurements.

The experiment was conducted twice by Rowland, once in Berlin and once in Baltimore. It was later repeated by Himstedt. These physicists even claimed they had succeeded in making quantitative measurements.

In fact, for twenty years Rowland's law was admitted without objection by all physicists. Besides everything seemed to confirm it. The spark certainly does produce a magnetic effect. Now does it not seem probable that the discharge by spark is due to particles taken from one of the electrodes and transferred to the other electrode with their charge? Is not the very spectrum of the spark, in which we recognize the lines of the metal of the electrode, a proof of it? The spark would then be a veritable current of convection.

In fact, for twenty years, Rowland's law was accepted without question by all physicists. Everything seemed to back it up. The spark definitely produces a magnetic effect. Now, doesn’t it seem likely that the discharge from the spark is caused by particles taken from one of the electrodes and moved to the other electrode along with their charge? Isn’t the very spectrum of the spark, where we can identify the lines of the metal from the electrode, evidence of this? The spark would then be a true convection current.

On the other hand, it is also admitted that in an electrolyte the electricity is carried by the ions in motion. The current in an electrolyte would therefore be also a current of convection; now, it acts on the magnetic needle.

On the other hand, it's also acknowledged that in an electrolyte, electricity is carried by moving ions. The current in an electrolyte is therefore also a convection current; and it affects the magnetic needle.

The same for cathode rays. Crookes attributed these rays to a very subtile matter charged with electricity and moving with a very great velocity. He regarded them, in other words, as currents of convection. Now these cathode rays are[Pg 196] deviated by the magnet. In virtue of the principle of action and reaction, they should in turn deviate the magnetic needle. It is true that Hertz believed he had demonstrated that the cathode rays do not carry electricity, and that they do not act on the magnetic needle. But Hertz was mistaken. First of all, Perrin succeeded in collecting the electricity carried by these rays, electricity of which Hertz denied the existence; the German scientist appears to have been deceived by effects due to the action of X-rays, which were not yet discovered. Afterwards, and quite recently, the action of the cathode rays on the magnetic needle has been put in evidence.

The same goes for cathode rays. Crookes believed these rays were made up of a very fine matter charged with electricity and moving at a high speed. He considered them, in other words, as convection currents. Now these cathode rays are[Pg 196] deflected by a magnet. According to the principle of action and reaction, they should also deflect the magnetic needle. It’s true that Hertz thought he had proven that cathode rays don’t carry electricity and that they don’t affect the magnetic needle. But Hertz was wrong. First of all, Perrin was able to collect the electricity carried by these rays, electricity that Hertz denied existed; the German scientist seems to have been misled by effects caused by X-rays, which hadn’t been discovered yet. More recently, the influence of cathode rays on the magnetic needle has been demonstrated.

Thus all these phenomena regarded as currents of convection, sparks, electrolytic currents, cathode rays, act in the same manner on the galvanometer and in conformity with Rowland's law.

Thus all these phenomena, considered as convection currents, sparks, electrolytic currents, and cathode rays, behave in the same way on the galvanometer and in accordance with Rowland's law.

VI. Theory of Lorentz.—We soon went farther. According to the theory of Lorentz, currents of conduction themselves would be true currents of convection. Electricity would remain inseparably connected with certain material particles called electrons. The circulation of these electrons through bodies would produce voltaic currents. And what would distinguish conductors from insulators would be that the one could be traversed by these electrons while the others would arrest their movements.

VI. Lorentz Theory.—We quickly moved on. According to Lorentz's theory, conduction currents would actually be convection currents. Electricity would always be linked to specific material particles called electrons. The flow of these electrons through materials would create voltaic currents. The key difference between conductors and insulators would be that conductors allow these electrons to move through them, while insulators would stop their movement.

The theory of Lorentz is very attractive. It gives a very simple explanation of certain phenomena which the earlier theories, even Maxwell's in its primitive form, could not explain in a satisfactory way; for example, the aberration of light, the partial carrying away of luminous waves, magnetic polarization and the Zeeman effect.

The Lorentz theory is quite appealing. It provides a straightforward explanation for certain phenomena that earlier theories, even Maxwell's in its basic form, couldn't adequately explain; for instance, the aberration of light, the partial carrying away of light waves, magnetic polarization, and the Zeeman effect.

Some objections still remained. The phenomena of an electric system seemed to depend on the absolute velocity of translation of the center of gravity of this system, which is contrary to the idea we have of the relativity of space. Supported by M. Crémieu, M. Lippmann has presented this objection in a striking form. Imagine two charged conductors with the same velocity of translation; they are relatively at rest. However, each of them being equivalent to a current of convection, they ought to attract one another, and by measuring this attraction we could measure their absolute velocity.[Pg 197]

Some objections still remained. The behavior of an electric system seemed to depend on the absolute speed of the center of gravity of this system, which goes against our understanding of the relativity of space. Supported by M. Crémieu, M. Lippmann has presented this objection in a compelling way. Imagine two charged conductors moving at the same speed; they are relatively at rest. However, since each of them can be thought of as a current of convection, they should attract each other, and by measuring this attraction, we could determine their absolute speed.[Pg 197]

"No!" replied the partisans of Lorentz. "What we could measure in that way is not their absolute velocity, but their relative velocity with respect to the ether, so that the principle of relativity is safe."

"No!" replied the supporters of Lorentz. "What we could measure that way isn't their absolute speed, but their relative speed compared to the ether, so the principle of relativity remains intact."

Whatever there may be in these latter objections, the edifice of electrodynamics, at least in its broad lines, seemed definitively constructed. Everything was presented under the most satisfactory aspect. The theories of Ampère and of Helmholtz, made for open currents which no longer existed, seemed to have no longer anything but a purely historic interest, and the inextricable complications to which these theories led were almost forgotten.

Whatever may be in these later objections, the structure of electrodynamics, at least in its overall framework, appeared to be firmly established. Everything seemed to be presented in the most satisfying way. The theories of Ampère and Helmholtz, designed for open currents that no longer existed, seemed to hold only historical interest now, and the complex issues these theories created were almost forgotten.

This quiescence has been recently disturbed by the experiments of M. Crémieu, which for a moment seemed to contradict the result previously obtained by Rowland.

This calm has recently been disrupted by M. Crémieu's experiments, which momentarily appeared to contradict the results previously achieved by Rowland.

But fresh researches have not confirmed them, and the theory of Lorentz has victoriously stood the test.

But recent studies haven't confirmed them, and Lorentz's theory has successfully passed the test.

The history of these variations will be none the less instructive; it will teach us to what pitfalls the scientist is exposed, and how he may hope to escape them.

The history of these variations will still be educational; it will show us the traps scientists can fall into and how they might avoid them.


 

THE VALUE OF SCIENCE

 


TRANSLATOR'S INTRODUCTION

1. Does the Scientist create Science?—Professor Rados of Budapest in his report to the Hungarian Academy of Science on the award to Poincaré of the Bolyai prize of ten thousand crowns, speaking of him as unquestionably the most powerful investigator in the domain of mathematics and mathematical physics, characterized him as the intuitive genius drawing the inspiration for his wide-reaching researches from the exhaustless fountain of geometric and physical intuition, yet working this inspiration out in detail with marvelous logical keenness. With his brilliant creative genius was combined the capacity for sharp and successful generalization, pushing far out the boundaries of thought in the most widely different domains, so that his works must be ranked with the greatest mathematical achievements of all time. "Finally," says Rados, "permit me to make especial mention of his intensely interesting book, 'The Value of Science,' in which he in a way has laid down the scientist's creed." Now what is this creed?

1. Does the Scientist create Science?—Professor Rados from Budapest, in his report to the Hungarian Academy of Science about Poincaré receiving the Bolyai prize of ten thousand crowns, described him as undeniably the most powerful researcher in mathematics and mathematical physics. He portrayed Poincaré as an intuitive genius who draws inspiration for his extensive research from an endless source of geometric and physical intuition, while also elaborating on this inspiration with incredible logical precision. Alongside his brilliant creative talent, he had the ability to generalize sharply and effectively, expanding the boundaries of thought across various fields, making his work among the greatest mathematical achievements of all time. "Finally," Rados states, "I want to specifically highlight his incredibly engaging book, 'The Value of Science,' where he has, in a way, outlined the scientist's creed." So, what is this creed?

Sense may act as stimulus, as suggestive, yet not to awaken a dormant depiction, or to educe the conception of an archetypal form, but rather to strike the hour for creation, to summon to work a sculptor capable of smoothing a Venus of Milo out of the formless clay. Knowledge is not a gift of bare experience, nor even made solely out of experience. The creative activity of mind is in mathematics particularly clear. The axioms of geometry are conventions, disguised definitions or unprovable hypotheses precreated by auto-active animal and human minds. Bertrand Russell says of projective geometry: "It takes nothing from experience, and has, like arithmetic, a creature of the pure intellect for its object. It deals with an object whose properties are logically deduced from its definition, not empirically discovered from data." Then does the scientist create science? This is a question Poincaré here dissects with a master hand.

Sense can act as a trigger or a suggestion, but not to awaken a hidden image or to bring forth the idea of an ideal form. Instead, it's meant to signal the moment for creation, to call forth a sculptor who can shape a Venus of Milo out of formless clay. Knowledge isn't just a product of raw experience, nor is it solely built from it. The creative process of the mind is especially evident in mathematics. The axioms of geometry are conventions, hidden definitions, or unprovable hypotheses created by active animal and human minds. Bertrand Russell describes projective geometry as: "It takes nothing from experience, and has, like arithmetic, a creation of pure intellect for its object. It deals with an object whose properties are logically deduced from its definition, not empirically discovered from data." So, does the scientist create science? This is a question Poincaré skillfully explores here.

The physiologic-psychologic investigation of the space problem [Pg 202]must give the meaning of the words geometric fact, geometric reality. Poincaré here subjects to the most successful analysis ever made the tridimensionality of our space.

The physiological and psychological study of the space problem [Pg 202]must clarify the meaning of the terms geometric fact and geometric reality. Poincaré provides the most effective analysis ever conducted on the three-dimensionality of our space.

2. The Mind Dispelling Optical Illusions.—Actual perception of spatial properties is accompanied by movements corresponding to its character. In the case of optical illusions, with the so-called false perceptions eye-movements are closely related. But though the perceived object and its environment remain constant, the sufficiently powerful mind can, as we say, dispel these illusions, the perception itself being creatively changed. Photo-graphs taken at intervals during the presence of these optical illusions, during the change, perhaps gradual and unconscious, in the perception, and after these illusions have, as the phrase is, finally disappeared, show quite clearly that changes in eye-movements corresponding to those internally created in perception itself successively occur. What is called accuracy of movement is created by what is called correctness of perception. The higher creation in the perception is the determining cause of an improvement, a precision in the motion. Thus we see correct perception in the individual helping to make that cerebral organization and accurate motor adjustment on which its possibility and permanence seem in so far to depend. So-called correct perception is connected with a long-continued process of perceptual education motived and initiated from within. How this may take place is here illustrated at length by our author.

2. The Mind Dispelling Optical Illusions.—Our actual perception of spatial properties is linked to corresponding movements. When it comes to optical illusions, these so-called false perceptions are closely related to eye movements. However, even though the object we perceive and its surroundings stay the same, a sufficiently powerful mind can, as we say, dispel these illusions, creatively altering the perception itself. Photographs taken at intervals during the presence of these optical illusions, as well as during the gradual and often unconscious changes in perception, and after these illusions have finally disappeared, clearly show that changes in eye movements correspond to those internal shifts in perception. What we call accuracy of movement is created by what we call correctness of perception. The higher level of perception is the key reason for the improvement and precision in movement. Thus, we see that correct perception in an individual aids in developing the brain’s organization and the precise motor adjustments necessary for its viability and consistency. This so-called correct perception is tied to a long-term process of perceptual education that is motivated and initiated from within. Our author illustrates how this may occur in detail.

3. Euclid not Necessary.—Geometry is a construction of the intellect, in application not certain but convenient. As Schiller says, when we see these facts as clearly as the development of metageometry has compelled us to see them, we must surely confess that the Kantian account of space is hopelessly and demonstrably antiquated. As Royce says in 'Kant's Doctrine of the Basis of Mathematics,' "That very use of intuition which Kant regarded as geometrically ideal, the modern geometer regards as scientifically defective, because surreptitious. No mathematical exactness without explicit proof from assumed principles—such is the motto of the modern geometer. But suppose the reasoning of Euclid purified of this comparatively surreptitious [Pg 203]appeal to intuition. Suppose that the principles of geometry are made quite explicit at the outset of the treatise, as Pieri and Hilbert or Professor Halsted or Dr. Veblen makes his principles explicit in his recent treatment of geometry. Then, indeed, geometry becomes for the modern mathematician a purely rational science. But very few students of the logic of mathematics at the present time can see any warrant in the analysis of geometrical truth for regarding just the Euclidean system of principles as possessing any discoverable necessity." Yet the environmental and perhaps hereditary premiums on Euclid still make even the scientist think Euclid most convenient.

3. Euclid Not Necessary.—Geometry is a product of the mind, useful but not always certain. As Schiller points out, once we recognize these truths as clearly as advancements in metageometry have forced us to, we have to admit that Kant's view of space is outdated and clearly incorrect. As Royce states in 'Kant's Doctrine of the Basis of Mathematics,' "What Kant considered an ideal use of intuition, the modern geometer views as scientifically flawed because it's hidden. There can be no mathematical accuracy without clear proof based on assumed principles—that's the mantra of the modern geometer. However, if we imagine Euclid's reasoning stripped of this somewhat hidden reliance on intuition, and if the principles of geometry are laid out clearly at the start of the work, as Pieri, Hilbert, Professor Halsted, or Dr. Veblen do in their recent approaches to geometry, then geometry truly becomes a rational science for the modern mathematician. Yet, very few students of mathematical logic today find any justification in the analysis of geometric truth for considering just the Euclidean system of principles to have any necessary foundation." Still, the traditional and possibly inherited preference for Euclid makes even scientists see Euclid as the most practical option.

4. Without Hypotheses, no Science.—Nobody ever observed an equidistantial, but also nobody ever observed a straight line. Emerson's Uriel

4. Without Hypotheses, no Science.—No one has ever seen an equidistantial, but no one has ever seen a straight line either. Emerson's Uriel

"Gave his sentiment divine
Against the being of a line.
Line in Nature is not found."

"Gave his feelings a divine touch
Against the existence of a line.
A line is not found in Nature."

Clearly not, being an eject from man's mind. What is called 'a knowledge of facts' is usually merely a subjective realization that the old hypotheses are still sufficiently elastic to serve in some domain; that is, with a sufficiency of conscious or unconscious omissions and doctorings and fudgings more or less wilful. In the present book we see the very foundation rocks of science, the conservation of energy and the indestructibility of matter, beating against the bars of their cages, seemingly anxious to take wing away into the empyrean, to chase the once divine parallel postulate broken loose from Euclid and Kant.

Clearly not, it's an ejection from the human mind. What’s referred to as 'knowledge of facts' is usually just a personal realization that the old theories are still flexible enough to work in some areas; that is, with enough conscious or unconscious omissions and adjustments, more or less intentional. In this book, we see the very foundational principles of science, the conservation of energy and the indestructibility of matter, pressing against the limits of their confines, seemingly eager to soar into the sky, to pursue the once divine parallel postulate that has broken free from Euclid and Kant.

5. What Outcome?—What now is the definite, the permanent outcome? What new islets raise their fronded palms in air within thought's musical domain? Over what age-gray barriers rise the fragrant floods of this new spring-tide, redolent of the wolf-haunted forest of Transylvania, of far Erdély's plunging river, Maros the bitter, or broad mother Volga at Kazan? What victory heralded the great rocket for which young Lobachevski, the widow's son, was cast into prison? What severing of age-old mental fetters symbolized young Bolyai's cutting-off with his[Pg 204] Damascus blade the spikes driven into his door-post, and strewing over the sod the thirteen Austrian cavalry officers? This book by the greatest mathematician of our time gives weightiest and most charming answer.

5. What Outcome?—What is the clear, permanent outcome now? What new islands raise their leafy palms in the air within the realm of thought? Over what ancient barriers do the fragrant floods of this new spring tide rise, reminiscent of the wolf-infested forests of Transylvania, of far Erdély's rushing river, the bitter Maros, or the wide mother Volga at Kazan? What victory announced the great event for which young Lobachevski, the widow's son, was imprisoned? What breaking of long-standing mental chains did young Bolyai's decisive action represent when he used his Damascus blade to cut the spikes driven into his doorpost and scattered the thirteen Austrian cavalry officers over the ground? This book by the greatest mathematician of our time provides the most significant and enchanting answer.

George Bruce Halsted.

George Bruce Halsted.


INTRODUCTION

The search for truth should be the goal of our activities; it is the sole end worthy of them. Doubtless we should first bend our efforts to assuage human suffering, but why? Not to suffer is a negative ideal more surely attained by the annihilation of the world. If we wish more and more to free man from material cares, it is that he may be able to employ the liberty obtained in the study and contemplation of truth.

The search for truth should be the goal of what we do; it's the only worthy purpose. Of course, we should first focus our efforts on easing human suffering, but why? Not suffering is a negative ideal that can definitely be achieved by destroying the world. If we want to relieve people of material concerns, it’s so they can use that freedom to study and reflect on the truth.

But sometimes truth frightens us. And in fact we know that it is sometimes deceptive, that it is a phantom never showing itself for a moment except to ceaselessly flee, that it must be pursued further and ever further without ever being attained. Yet to work one must stop, as some Greek, Aristotle or another, has said. We also know how cruel the truth often is, and we wonder whether illusion is not more consoling, yea, even more bracing, for illusion it is which gives confidence. When it shall have vanished, will hope remain and shall we have the courage to achieve? Thus would not the horse harnessed to his treadmill refuse to go, were his eyes not bandaged? And then to seek truth it is necessary to be independent, wholly independent. If, on the contrary, we wish to act, to be strong, we should be united. This is why many of us fear truth; we consider it a cause of weakness. Yet truth should not be feared, for it alone is beautiful.

But sometimes the truth scares us. We know it can be misleading, a ghost that only shows itself briefly before it runs away, something we have to chase endlessly without ever catching. But to get anything done, we have to take a break, as some Greek philosopher, Aristotle or someone else, once said. We also understand how harsh the truth can be, and we wonder if illusion isn't more comforting, maybe even more energizing, because it's illusion that gives us confidence. Once it disappears, will there be hope left and will we have the guts to succeed? Wouldn't a horse on a treadmill refuse to move if its eyes weren’t covered? To seek the truth, you need to be independent, completely independent. But if we want to act and be strong, we should stick together. That's why many of us are scared of the truth; we see it as a sign of weakness. Yet, truth shouldn’t be feared, because it's the only thing that’s truly beautiful.

When I speak here of truth, assuredly I refer first to scientific truth; but I also mean moral truth, of which what we call justice is only one aspect. It may seem that I am misusing words, that I combine thus under the same name two things having nothing in common; that scientific truth, which is demonstrated, can in no way be likened to moral truth, which is felt. And yet I can not separate them, and whosoever loves the one can not help loving the other. To find the one, as well as to find the other, it is necessary to free the soul completely from prejudice and from passion; it is necessary to attain absolute sincerity. These two sorts of[Pg 206] truth when discovered give the same joy; each when perceived beams with the same splendor, so that we must see it or close our eyes. Lastly, both attract us and flee from us; they are never fixed: when we think to have reached them, we find that we have still to advance, and he who pursues them is condemned never to know repose. It must be added that those who fear the one will also fear the other; for they are the ones who in everything are concerned above all with consequences. In a word, I liken the two truths, because the same reasons make us love them and because the same reasons make us fear them.

When I talk about truth here, I’m clearly referring first to scientific truth; but I also mean moral truth, of which what we call justice is just one aspect. It might seem like I’m mixing things up, putting under the same label two things that have nothing to do with each other; that scientific truth, which is proven, can’t be compared to moral truth, which is felt. Still, I can’t separate them, and whoever loves one can’t help but love the other. To discover either one, you need to completely free your mind from bias and emotion; you need to achieve total honesty. These two types of[Pg 206] truth, when found, bring the same joy; each shines with the same brilliance, making it impossible to miss or ignore. Lastly, both draw us in and elude us; they’re never static: when we think we’ve grasped them, we realize we still have more to discover, and those who chase them are destined to never find peace. It should also be noted that those who fear one will fear the other; they are the ones most focused on the consequences in everything. In short, I compare the two truths because the same reasons lead us to love them and the same reasons lead us to fear them.

If we ought not to fear moral truth, still less should we dread scientific truth. In the first place it can not conflict with ethics. Ethics and science have their own domains, which touch but do not interpenetrate. The one shows us to what goal we should aspire, the other, given the goal, teaches us how to attain it. So they can never conflict since they can never meet. There can no more be immoral science than there can be scientific morals.

If we shouldn't fear moral truth, we definitely shouldn't fear scientific truth. First of all, it can't go against ethics. Ethics and science each have their own areas that touch on each other but don't mix. Ethics shows us the goals we should aim for, while science, once we have those goals, teaches us how to achieve them. So they can never conflict since they don't overlap. There can't be immoral science any more than there can be scientific morals.

But if science is feared, it is above all because it can not give us happiness. Of course it can not. We may even ask whether the beast does not suffer less than man. But can we regret that earthly paradise where man brute-like was really immortal in knowing not that he must die? When we have tasted the apple, no suffering can make us forget its savor. We always come back to it. Could it be otherwise? As well ask if one who has seen and is blind will not long for the light. Man, then, can not be happy through science, but to-day he can much less be happy without it.

But if people are afraid of science, it's mainly because it can't give us happiness. Of course, it can't. We might even wonder if animals suffer less than humans. But can we truly miss that earthly paradise where humans, like beasts, were really immortal because they didn't know they had to die? Once we've tasted the apple, no amount of suffering can make us forget how it tasted. We always return to it. Could it be any other way? It's like asking if someone who has seen the light and then goes blind won't long for it. So, humans can't find happiness through science, but today, they definitely can't be happy without it.

But if truth be the sole aim worth pursuing, may we hope to attain it? It may well be doubted. Readers of my little book 'Science and Hypothesis' already know what I think about the question. The truth we are permitted to glimpse is not altogether what most men call by that name. Does this mean that our most legitimate, most imperative aspiration is at the same time the most vain? Or can we, despite all, approach truth on some side? This it is which must be investigated.

But if truth is the only goal worth chasing, can we hope to achieve it? That is certainly debatable. Readers of my little book 'Science and Hypothesis' already know my thoughts on this issue. The truth we get to see is not quite what most people refer to as such. Does this mean that our most valid and essential desire is also the most futile? Or can we, despite everything, get closer to the truth in some way? This is what needs to be explored.

In the first place, what instrument have we at our disposal for this conquest? Is not human intelligence, more specifically the[Pg 207] intelligence of the scientist, susceptible of infinite variation? Volumes could be written without exhausting this subject; I, in a few brief pages, have only touched it lightly. That the geometer's mind is not like the physicist's or the naturalist's, all the world would agree; but mathematicians themselves do not resemble each other; some recognize only implacable logic, others appeal to intuition and see in it the only source of discovery. And this would be a reason for distrust. To minds so unlike can the mathematical theorems themselves appear in the same light? Truth which is not the same for all, is it truth? But looking at things more closely, we see how these very different workers collaborate in a common task which could not be achieved without their cooperation. And that already reassures us.

First of all, what resources do we have available for this challenge? Isn't human intelligence, specifically the intelligence of scientists, capable of endless variations? Entire volumes could be written on this topic without covering it all; I've only scratched the surface in these few pages. Everyone would agree that a geometer's mind is different from a physicist's or a naturalist's. However, even mathematicians don't all think alike; some rely solely on strict logic, while others turn to intuition, viewing it as the main source of discovery. This might raise some doubts. Can such vastly different minds really see mathematical theorems in the same way? Is truth that isn’t universal truly truth? Yet, upon closer inspection, we see how these diverse thinkers work together toward a common goal that could not be accomplished without their collaboration. That alone brings us some comfort.

Next must be examined the frames in which nature seems enclosed and which are called time and space. In 'Science and Hypothesis' I have already shown how relative their value is; it is not nature which imposes them upon us, it is we who impose them upon nature because we find them convenient. But I have spoken of scarcely more than space, and particularly quantitative space, so to say, that is of the mathematical relations whose aggregate constitutes geometry. I should have shown that it is the same with time as with space and still the same with 'qualitative space'; in particular, I should have investigated why we attribute three dimensions to space. I may be pardoned then for taking up again these important questions.

Next, we need to look at the frameworks in which nature seems to exist, known as time and space. In 'Science and Hypothesis,' I already pointed out how relative their significance is; it’s not nature that forces these concepts upon us, but rather we impose them on nature because we find them useful. However, I’ve mostly discussed space, particularly quantitative space, which refers to the mathematical relationships that form geometry. I should have demonstrated that time is similar to space and that 'qualitative space' follows the same principle; specifically, I should have explored why we attribute three dimensions to space. So, I hope you’ll excuse me for revisiting these important questions.

Is mathematical analysis, then, whose principal object is the study of these empty frames, only a vain play of the mind? It can give to the physicist only a convenient language; is this not a mediocre service, which, strictly speaking, could be done without; and even is it not to be feared that this artificial language may be a veil interposed between reality and the eye of the physicist? Far from it; without this language most of the intimate analogies of things would have remained forever unknown to us; and we should forever have been ignorant of the internal harmony of the world, which is, we shall see, the only true objective reality.

Is mathematical analysis, whose main focus is the examination of these empty frameworks, just a pointless exercise for the mind? It provides the physicist with a convenient language; isn't this a mediocre function that could technically be done without? And shouldn't we worry that this constructed language might act as a barrier between reality and the physicist's perception? Not at all; without this language, many of the deep connections between things would have remained unknown to us, and we would have been unaware of the internal harmony of the world, which, as we will see, is the only true objective reality.

The best expression of this harmony is law. Law is one of the[Pg 208] most recent conquests of the human mind; there still are people who live in the presence of a perpetual miracle and are not astonished at it. On the contrary, we it is who should be astonished at nature's regularity. Men demand of their gods to prove their existence by miracles; but the eternal marvel is that there are not miracles without cease. The world is divine because it is a harmony. If it were ruled by caprice, what could prove to us it was not ruled by chance?

The best expression of this harmony is law. Law is one of the[Pg 208] most recent achievements of the human mind; there are still people who live with a constant miracle around them and aren't amazed by it. On the contrary, we should be the ones who are amazed by nature's consistency. People expect their gods to prove their existence through miracles; but the true wonder is that there aren't miracles happening all the time. The world is divine because it is a harmony. If it were governed by whim, how could we be sure it wasn't just random?

This conquest of law we owe to astronomy, and just this makes the grandeur of the science rather than the material grandeur of the objects it considers. It was altogether natural, then, that celestial mechanics should be the first model of mathematical physics; but since then this science has developed; it is still developing, even rapidly developing. And it is already necessary to modify in certain points the scheme from which I drew two chapters of 'Science and Hypothesis.' In an address at the St. Louis exposition, I sought to survey the road traveled; the result of this investigation the reader shall see farther on.

This achievement in law is thanks to astronomy, and this is what makes the greatness of the science, more so than the physical grandeur of the objects it studies. It was completely natural for celestial mechanics to be the first model of mathematical physics; however, this science has evolved since then and continues to grow, even quickly. It's already necessary to adjust some aspects of the framework I used in the two chapters of 'Science and Hypothesis.' In a talk at the St. Louis exposition, I aimed to review the progress made; the reader will see the results of this exploration later on.

The progress of science has seemed to imperil the best established principles, those even which were regarded as fundamental. Yet nothing shows they will not be saved; and if this comes about only imperfectly, they will still subsist even though they are modified. The advance of science is not comparable to the changes of a city, where old edifices are pitilessly torn down to give place to new, but to the continuous evolution of zoologic types which develop ceaselessly and end by becoming unrecognizable to the common sight, but where an expert eye finds always traces of the prior work of the centuries past. One must not think then that the old-fashioned theories have been sterile and vain.

The progress of science has seemed to threaten even the most well-established principles, those that were once considered fundamental. Yet there’s no evidence that they won’t be preserved; and even if this happens imperfectly, they will still exist, albeit in a modified form. The advancement of science is not like the changes in a city, where old buildings are ruthlessly demolished to make way for new ones, but rather like the continuous evolution of animal species, which develop relentlessly and eventually become unrecognizable to the average observer, while an expert can always find traces of the earlier forms from centuries ago. Therefore, one should not think that outdated theories have been fruitless or pointless.

Were we to stop there, we should find in these pages some reasons for confidence in the value of science, but many more for distrusting it; an impression of doubt would remain; it is needful now to set things to rights.

If we were to stop here, we would find some reasons to have confidence in the value of science, but even more reasons to be skeptical about it; a sense of doubt would linger; it's necessary now to rectify this situation.

Some people have exaggerated the rôle of convention in science; they have even gone so far as to say that law, that scientific fact itself, was created by the scientist. This is going much too far in the direction of nominalism. No, scientific laws are not[Pg 209] artificial creations; we have no reason to regard them as accidental, though it be impossible to prove they are not.

Some people have overstated the role of convention in science; they’ve even claimed that scientific laws and facts were created by scientists. This is taking it too far in the direction of nominalism. No, scientific laws are not[Pg 209] artificial creations; we have no reason to view them as random, even though it’s impossible to prove that they aren’t.

Does the harmony the human intelligence thinks it discovers in nature exist outside of this intelligence? No, beyond doubt a reality completely independent of the mind which conceives it, sees or feels it, is an impossibility. A world as exterior as that, even if it existed, would for us be forever inaccessible. But what we call objective reality is, in the last analysis, what is common to many thinking beings, and could be common to all; this common part, we shall see, can only be the harmony expressed by mathematical laws. It is this harmony then which is the sole objective reality, the only truth we can attain; and when I add that the universal harmony of the world is the source of all beauty, it will be understood what price we should attach to the slow and difficult progress which little by little enables us to know it better.

Does the harmony that human intelligence believes it finds in nature exist outside of that intelligence? No, without a doubt, a reality completely independent of the mind that conceives it, sees it, or feels it is impossible. A world as external as that, even if it existed, would be forever out of our reach. However, what we refer to as objective reality is, at its core, what is shared by many thinking beings and could be shared by all; this common aspect, as we shall see, can only be the harmony expressed by mathematical laws. It is this harmony that is the sole objective reality, the only truth we can achieve; and when I add that the universal harmony of the world is the source of all beauty, it will be clear what value we should place on the slow and challenging journey that gradually allows us to understand it better.


PART I

THE MATHEMATICAL SCIENCES


CHAPTER I

Math: Intuition and Logic

I

It is impossible to study the works of the great mathematicians, or even those of the lesser, without noticing and distinguishing two opposite tendencies, or rather two entirely different kinds of minds. The one sort are above all preoccupied with logic; to read their works, one is tempted to believe they have advanced only step by step, after the manner of a Vauban who pushes on his trenches against the place besieged, leaving nothing to chance. The other sort are guided by intuition and at the first stroke make quick but sometimes precarious conquests, like bold cavalrymen of the advance guard.

It’s impossible to study the works of great mathematicians, or even those of lesser-known ones, without noticing and distinguishing between two opposite tendencies, or rather two entirely different types of minds. One type is primarily focused on logic; when you read their works, you might think they’ve moved only step by step, like a military engineer meticulously advancing their trenches against a besieged fortress, leaving nothing to chance. The other type relies on intuition and makes quick but sometimes risky breakthroughs, similar to daring cavalrymen leading the charge.

The method is not imposed by the matter treated. Though one often says of the first that they are analysts and calls the others geometers, that does not prevent the one sort from remaining analysts even when they work at geometry, while the others are still geometers even when they occupy themselves with pure analysis. It is the very nature of their mind which makes them logicians or intuitionalists, and they can not lay it aside when they approach a new subject.

The method isn't dictated by the subject at hand. Even though people often refer to the first group as analysts and the second as geometers, this doesn't stop the analysts from staying analysts when they tackle geometry, or the geometers from being geometers when they engage in pure analysis. It's their mental nature that defines them as logicians or intuitionalists, and they can't put that aside when they dive into a new topic.

Nor is it education which has developed in them one of the two tendencies and stifled the other. The mathematician is born, not made, and it seems he is born a geometer or an analyst. I should like to cite examples and there are surely plenty; but to accentuate the contrast I shall begin with an extreme example, taking the liberty of seeking it in two living mathematicians.[Pg 211]

Nor is it education that has fostered one of the two tendencies and suppressed the other. A mathematician is born, not created, and it appears that they are either born a geometer or an analyst. I’d like to provide examples, and there are certainly many to choose from; however, to highlight the contrast, I'll start with an extreme example, taking the liberty of selecting it from two living mathematicians.[Pg 211]

M. Méray wants to prove that a binomial equation always has a root, or, in ordinary words, that an angle may always be subdivided. If there is any truth that we think we know by direct intuition, it is this. Who could doubt that an angle may always be divided into any number of equal parts? M. Méray does not look at it that way; in his eyes this proposition is not at all evident and to prove it he needs several pages.

M. Méray wants to show that a binomial equation always has a solution, or, in plain language, that an angle can always be divided. If there's one truth we feel we know intuitively, it's this. Who could doubt that an angle can always be split into any number of equal parts? M. Méray sees it differently; for him, this statement is not obvious at all, and he takes several pages to prove it.

On the other hand, look at Professor Klein: he is studying one of the most abstract questions of the theory of functions: to determine whether on a given Riemann surface there always exists a function admitting of given singularities. What does the celebrated German geometer do? He replaces his Riemann surface by a metallic surface whose electric conductivity varies according to certain laws. He connects two of its points with the two poles of a battery. The current, says he, must pass, and the distribution of this current on the surface will define a function whose singularities will be precisely those called for by the enunciation.

On the other hand, take a look at Professor Klein: he is exploring one of the most abstract questions in function theory: whether there is always a function on a given Riemann surface that allows for specific singularities. What does this famous German geometer do? He substitutes his Riemann surface with a metallic surface whose electrical conductivity changes according to certain laws. He connects two points of it to the two poles of a battery. The current, he says, must flow, and the way this current moves across the surface will define a function whose singularities will exactly match those specified in the statement.

Doubtless Professor Klein well knows he has given here only a sketch; nevertheless he has not hesitated to publish it; and he would probably believe he finds in it, if not a rigorous demonstration, at least a kind of moral certainty. A logician would have rejected with horror such a conception, or rather he would not have had to reject it, because in his mind it would never have originated.

Doubtless Professor Klein knows he has only provided a rough outline here; still, he hasn’t hesitated to share it, and he probably thinks he has found, if not a strict proof, at least a form of moral certainty in it. A logician would have rejected such an idea in horror, or rather, it would never have occurred to him in the first place.

Again, permit me to compare two men, the honor of French science, who have recently been taken from us, but who both entered long ago into immortality. I speak of M. Bertrand and M. Hermite. They were scholars of the same school at the same time; they had the same education, were under the same influences; and yet what a difference! Not only does it blaze forth in their writings; it is in their teaching, in their way of speaking, in their very look. In the memory of all their pupils these two faces are stamped in deathless lines; for all who have had the pleasure of following their teaching, this remembrance is still fresh; it is easy for us to evoke it.

Once again, let me compare two great men, the pride of French science, who we recently lost but who have both achieved timelessness. I'm talking about M. Bertrand and M. Hermite. They were students of the same school and time; they received the same education and were shaped by the same influences; and yet, what a contrast! Not only does it shine through in their writings; it’s also evident in their teaching, the way they spoke, and even in their appearance. In the memories of all their students, these two faces are etched in unforgettable detail; for all who had the joy of learning from them, those memories are still vivid and easy for us to recall.

While speaking, M. Bertrand is always in motion; now he seems in combat with some outside enemy, now he outlines with a gesture of the hand the figures he studies. Plainly he sees and he[Pg 212] is eager to paint, this is why he calls gesture to his aid. With M. Hermite, it is just the opposite; his eyes seem to shun contact with the world; it is not without, it is within he seeks the vision of truth.

While speaking, M. Bertrand is always animated; sometimes he seems to be battling an external foe, and other times he uses hand gestures to illustrate the concepts he’s considering. Clearly, he sees what he wants to express and is eager to convey it, which is why he relies on gestures. With M. Hermite, it’s completely the opposite; his eyes seem to avoid engaging with the outside world; he’s not looking outward but inward for his vision of truth.

Among the German geometers of this century, two names above all are illustrious, those of the two scientists who founded the general theory of functions, Weierstrass and Riemann. Weierstrass leads everything back to the consideration of series and their analytic transformations; to express it better, he reduces analysis to a sort of prolongation of arithmetic; you may turn through all his books without finding a figure. Riemann, on the contrary, at once calls geometry to his aid; each of his conceptions is an image that no one can forget, once he has caught its meaning.

Among the German mathematicians of this century, two names stand out: Weierstrass and Riemann, the pioneers of the general theory of functions. Weierstrass approaches everything through the lens of series and their analytic transformations; to put it more clearly, he simplifies analysis to an extension of arithmetic; you can flip through all his books without coming across a single figure. Riemann, on the other hand, immediately brings geometry into the picture; each of his ideas is a striking image that sticks with you once you grasp its meaning.

More recently, Lie was an intuitionalist; this might have been doubted in reading his books, no one could doubt it after talking with him; you saw at once that he thought in pictures. Madame Kovalevski was a logician.

More recently, Lie was an intuitionalist; this might have been doubted while reading his books, but no one could question it after talking with him; you could immediately tell that he thought in pictures. Madame Kovalevski was a logician.

Among our students we notice the same differences; some prefer to treat their problems 'by analysis,' others 'by geometry.' The first are incapable of 'seeing in space,' the others are quickly tired of long calculations and become perplexed.

Among our students, we notice the same differences; some prefer to tackle their problems 'through analysis,' while others prefer 'through geometry.' The first group struggles with 'visualizing in space,' while the second quickly gets tired of lengthy calculations and becomes confused.

The two sorts of minds are equally necessary for the progress of science; both the logicians and the intuitionalists have achieved great things that others could not have done. Who would venture to say whether he preferred that Weierstrass had never written or that there had never been a Riemann? Analysis and synthesis have then both their legitimate rôles. But it is interesting to study more closely in the history of science the part which belongs to each.

The two types of minds are equally important for the advancement of science; both logicians and intuitionalists have accomplished great things that others couldn't have done. Who would dare to say whether they would rather Weierstrass had never written or that Riemann had never existed? Analysis and synthesis both have their rightful roles. However, it’s fascinating to take a closer look at the contributions of each in the history of science.

II

Strange! If we read over the works of the ancients we are tempted to class them all among the intuitionalists. And yet nature is always the same; it is hardly probable that it has begun in this century to create minds devoted to logic. If we could put ourselves into the flow of ideas which reigned in their time, we should recognize that many of the old geometers were in tendency[Pg 213] analysts. Euclid, for example, erected a scientific structure wherein his contemporaries could find no fault. In this vast construction, of which each piece however is due to intuition, we may still to-day, without much effort, recognize the work of a logician.

Strange! When we look at the works of the ancients, we might be inclined to categorize them all as intuitionalists. Yet, nature remains consistent; it’s unlikely that this century has suddenly started to produce minds focused on logic. If we could immerse ourselves in the flow of ideas that existed during their time, we would see that many of the old geometers were, in fact, analysts. Euclid, for instance, built a scientific structure that his contemporaries couldn’t criticize. In this vast creation, where each part is based on intuition, we can still easily recognize the work of a logician today.

It is not minds that have changed, it is ideas; the intuitional minds have remained the same; but their readers have required of them greater concessions.

It’s not the minds that have changed, it’s the ideas; the intuitive minds have stayed the same; but their readers have demanded greater concessions from them.

What is the cause of this evolution? It is not hard to find. Intuition can not give us rigor, nor even certainty; this has been recognized more and more. Let us cite some examples. We know there exist continuous functions lacking derivatives. Nothing is more shocking to intuition than this proposition which is imposed upon us by logic. Our fathers would not have failed to say: "It is evident that every continuous function has a derivative, since every curve has a tangent."

What is behind this evolution? It's not hard to figure out. Intuition can't provide us with precision or even certainty; this has become increasingly clear. Let's look at some examples. We know that there are continuous functions that don't have derivatives. Nothing is more surprising to our intuition than this claim, which logic demands of us. Our ancestors would have undoubtedly said, "It's obvious that every continuous function has a derivative, since every curve has a tangent."

How can intuition deceive us on this point? It is because when we seek to imagine a curve we can not represent it to ourselves without width; just so, when we represent to ourselves a straight line, we see it under the form of a rectilinear band of a certain breadth. We well know these lines have no width; we try to imagine them narrower and narrower and thus to approach the limit; so we do in a certain measure, but we shall never attain this limit. And then it is clear we can always picture these two narrow bands, one straight, one curved, in a position such that they encroach slightly one upon the other without crossing. We shall thus be led, unless warned by a rigorous analysis, to conclude that a curve always has a tangent.

How can our intuition mislead us here? It's because when we try to picture a curve, we can't help but imagine it with some width; similarly, when we visualize a straight line, we see it as a straight band with a certain thickness. We know these lines have no thickness; we attempt to envision them getting narrower and narrower to reach the limit, and to some extent, we do, but we’ll never actually reach that limit. It’s clear that we can always visualize these two narrow bands, one straight and one curved, positioned so that they slightly overlap without crossing. This could lead us, unless we are careful with a thorough analysis, to conclude that a curve always has a tangent.

I shall take as second example Dirichlet's principle on which rest so many theorems of mathematical physics; to-day we establish it by reasoning very rigorous but very long; heretofore, on the contrary, we were content with a very summary proof. A certain integral depending on an arbitrary function can never vanish. Hence it is concluded that it must have a minimum. The flaw in this reasoning strikes us immediately, since we use the abstract term function and are familiar with all the singularities functions can present when the word is understood in the most general sense.[Pg 214]

I will use Dirichlet's principle as a second example, which underpins many theorems in mathematical physics. Nowadays, we prove it through very rigorous but lengthy reasoning; in the past, however, we were satisfied with a much simpler proof. An integral that depends on any arbitrary function can never be zero. Therefore, we conclude that it must have a minimum. The flaw in this reasoning becomes apparent right away, as we use the abstract term function and are aware of all the unique behaviors functions can exhibit when the term is taken in its broadest sense.[Pg 214]

But it would not be the same had we used concrete images, had we, for example, considered this function as an electric potential; it would have been thought legitimate to affirm that electrostatic equilibrium can be attained. Yet perhaps a physical comparison would have awakened some vague distrust. But if care had been taken to translate the reasoning into the language of geometry, intermediate between that of analysis and that of physics, doubtless this distrust would not have been produced, and perhaps one might thus, even to-day, still deceive many readers not forewarned.

But it wouldn’t be the same if we had used concrete images; if we had, for instance, viewed this function as an electric potential, it would have seemed valid to say that electrostatic equilibrium can be achieved. However, a physical comparison might have raised some uncertainty. But if we had made an effort to express the reasoning in the language of geometry, which sits between analysis and physics, this doubt likely wouldn’t have arisen, and perhaps even today, many unsuspecting readers could still be misled.

Intuition, therefore, does not give us certainty. This is why the evolution had to happen; let us now see how it happened.

Intuition, therefore, doesn't provide us with certainty. That's why the evolution had to take place; now let's see how it happened.

It was not slow in being noticed that rigor could not be introduced in the reasoning unless first made to enter into the definitions. For the most part the objects treated of by mathematicians were long ill defined; they were supposed to be known because represented by means of the senses or the imagination; but one had only a crude image of them and not a precise idea on which reasoning could take hold. It was there first that the logicians had to direct their efforts.

It quickly became clear that you couldn't add rigor to reasoning without first incorporating it into the definitions. Most of the concepts dealt with by mathematicians were poorly defined for a long time; they were assumed to be understood because they could be represented through the senses or imagination. However, people only had a rough idea of them rather than a clear concept that reasoning could grasp. This was where logicians needed to focus their efforts first.

So, in the case of incommensurable numbers. The vague idea of continuity, which we owe to intuition, resolved itself into a complicated system of inequalities referring to whole numbers.

So, in the case of incommensurable numbers. The unclear concept of continuity, which we owe to intuition, turned into a complex system of inequalities related to whole numbers.

By that means the difficulties arising from passing to the limit, or from the consideration of infinitesimals, are finally removed. To-day in analysis only whole numbers are left or systems, finite or infinite, of whole numbers bound together by a net of equality or inequality relations. Mathematics, as they say, is arithmetized.

By doing this, the challenges that come from approaching the limit or dealing with infinitesimals are completely eliminated. Nowadays, in analysis, only whole numbers remain, or systems—whether finite or infinite—of whole numbers connected by a network of equality or inequality relationships. As people say, mathematics has been reduced to arithmetic.

III

A first question presents itself. Is this evolution ended? Have we finally attained absolute rigor? At each stage of the evolution our fathers also thought they had reached it. If they deceived themselves, do we not likewise cheat ourselves?

A first question comes to mind. Is this evolution over? Have we finally achieved complete rigor? At every stage of the evolution, our predecessors also believed they had reached it. If they were mistaken, are we not also fooling ourselves?

We believe that in our reasonings we no longer appeal to intuition; the philosophers will tell us this is an illusion. Pure logic could never lead us to anything but tautologies; it could[Pg 215] create nothing new; not from it alone can any science issue. In one sense these philosophers are right; to make arithmetic, as to make geometry, or to make any science, something else than pure logic is necessary. To designate this something else we have no word other than intuition. But how many different ideas are hidden under this same word?

We think that in our reasoning, we no longer rely on intuition; philosophers would argue this is an illusion. Pure logic can only lead us to tautologies; it can[Pg 215] create nothing new, and from it alone, no science can arise. In one way, these philosophers are correct; to develop arithmetic, geometry, or any science, we need something beyond pure logic. The only term we have to describe this something else is intuition. But how many different concepts are concealed under this same term?

Compare these four axioms: (1) Two quantities equal to a third are equal to one another; (2) if a theorem is true of the number 1 and if we prove that it is true of n + 1 if true for n, then will it be true of all whole numbers; (3) if on a straight the point C is between A and B and the point D between A and C, then the point D will be between A and B; (4) through a given point there is not more than one parallel to a given straight.

Compare these four axioms: (1) If two quantities are both equal to a third quantity, then they are equal to each other; (2) if a theorem is true for the number 1, and we can prove that if it’s true for n, then it’s also true for n + 1, it will be true for all whole numbers; (3) if point C is on a straight line between A and B, and point D is between A and C, then point D will be between A and B; (4) through a given point, there is only one line parallel to a given straight line.

All four are attributed to intuition, and yet the first is the enunciation of one of the rules of formal logic; the second is a real synthetic a priori judgment, it is the foundation of rigorous mathematical induction; the third is an appeal to the imagination; the fourth is a disguised definition.

All four are based on intuition, yet the first states one of the rules of formal logic; the second is a genuine synthetic a priori judgment and serves as the basis for strict mathematical induction; the third is an appeal to the imagination; the fourth is a hidden definition.

Intuition is not necessarily founded on the evidence of the senses; the senses would soon become powerless; for example, we can not represent to ourselves a chiliagon, and yet we reason by intuition on polygons in general, which include the chiliagon as a particular case.

Intuition isn't always based on what we can sense; our senses could quickly become inadequate. For instance, we can't visualize a chiliagon, yet we can understand polygons through intuition, which includes the chiliagon as a specific instance.

You know what Poncelet understood by the principle of continuity. What is true of a real quantity, said Poncelet, should be true of an imaginary quantity; what is true of the hyperbola whose asymptotes are real, should then be true of the ellipse whose asymptotes are imaginary. Poncelet was one of the most intuitive minds of this century; he was passionately, almost ostentatiously, so; he regarded the principle of continuity as one of his boldest conceptions, and yet this principle did not rest on the evidence of the senses. To assimilate the hyperbola to the ellipse was rather to contradict this evidence. It was only a sort of precocious and instinctive generalization which, moreover, I have no desire to defend.

You know what Poncelet understood by the principle of continuity. Poncelet said that what is true for a real quantity should also be true for an imaginary quantity; what applies to the hyperbola with real asymptotes should also apply to the ellipse with imaginary asymptotes. Poncelet was one of the most intuitive minds of this century; he was passionately and almost showily so. He saw the principle of continuity as one of his most daring ideas, yet this principle wasn't based on sensory evidence. Comparing the hyperbola to the ellipse actually contradicted that evidence. It was more of an early and instinctive generalization, which, by the way, I don’t feel the need to defend.

We have then many kinds of intuition; first, the appeal to the senses and the imagination; next, generalization by induction, copied, so to speak, from the procedures of the experimental[Pg 216] sciences; finally, we have the intuition of pure number, whence arose the second of the axioms just enunciated, which is able to create the real mathematical reasoning. I have shown above by examples that the first two can not give us certainty; but who will seriously doubt the third, who will doubt arithmetic?

We have many types of intuition. First, there's the appeal to our senses and imagination. Next is generalization through induction, which we borrow from the methods used in experimental sciences. Finally, we have the intuition of pure numbers, which led to the second of the axioms just mentioned, enabling genuine mathematical reasoning. I have already demonstrated with examples that the first two cannot provide us with certainty, but who would genuinely doubt the third? Who would question arithmetic?

Now in the analysis of to-day, when one cares to take the trouble to be rigorous, there can be nothing but syllogisms or appeals to this intuition of pure number, the only intuition which can not deceive us. It may be said that to-day absolute rigor is attained.

Now in today's analysis, when one is willing to put in the effort to be thorough, there is nothing but syllogisms or appeals to this intuition of pure number, the only intuition that cannot mislead us. It can be said that today absolute rigor is achieved.

IV

The philosophers make still another objection: "What you gain in rigor," they say, "you lose in objectivity. You can rise toward your logical ideal only by cutting the bonds which attach you to reality. Your science is infallible, but it can only remain so by imprisoning itself in an ivory tower and renouncing all relation with the external world. From this seclusion it must go out when it would attempt the slightest application."

The philosophers raise another point: "While you gain in precision," they say, "you lose in objectivity. You can only reach your logical ideal by severing your ties to reality. Your science is flawless, but it can only stay that way by isolating itself in an ivory tower and rejecting any connection to the outside world. From this isolation, it must emerge whenever it tries to make even the smallest application."

For example, I seek to show that some property pertains to some object whose concept seems to me at first indefinable, because it is intuitive. At first I fail or must content myself with approximate proofs; finally I decide to give to my object a precise definition, and this enables me to establish this property in an irreproachable manner.

For example, I want to illustrate that a certain property relates to an object whose concept initially feels indefinable to me because it's intuitive. At first, I struggle or have to settle for rough proofs; eventually, I choose to give my object a clear definition, which allows me to establish this property in a solid way.

"And then," say the philosophers, "it still remains to show that the object which corresponds to this definition is indeed the same made known to you by intuition; or else that some real and concrete object whose conformity with your intuitive idea you believe you immediately recognize corresponds to your new definition. Only then could you affirm that it has the property in question. You have only displaced the difficulty."

"And then," say the philosophers, "it still needs to be shown that the object that fits this definition is actually the same one you know intuitively; or that there is some real and tangible object, which you think you can immediately identify as matching your intuitive idea, that aligns with your new definition. Only then can you claim that it has the property in question. You've only shifted the problem."

That is not exactly so; the difficulty has not been displaced, it has been divided. The proposition to be established was in reality composed of two different truths, at first not distinguished. The first was a mathematical truth, and it is now rigorously established. The second was an experimental verity. Experience alone can teach us that some real and concrete object corresponds[Pg 217] or does not correspond to some abstract definition. This second verity is not mathematically demonstrated, but neither can it be, no more than can the empirical laws of the physical and natural sciences. It would be unreasonable to ask more.

That's not quite right; the challenge hasn't been eliminated, it's been separated. The claim we aimed to prove actually consisted of two distinct truths that weren't recognized at first. The first was a mathematical truth, and that's now thoroughly established. The second was an experimental truth. Only through experience can we learn whether some real and tangible object matches or doesn't match a certain abstract definition. This second truth isn't demonstrated mathematically, and it can't be, just like the empirical laws of physical and natural sciences. It would be unreasonable to expect more.

Well, is it not a great advance to have distinguished what long was wrongly confused? Does this mean that nothing is left of this objection of the philosophers? That I do not intend to say; in becoming rigorous, mathematical science takes a character so artificial as to strike every one; it forgets its historical origins; we see how the questions can be answered, we no longer see how and why they are put.

Well, isn't it a significant progress to have clarified what was long misunderstood? Does this mean that the philosophers' objections have completely vanished? That's not what I mean to say; in becoming strict, mathematical science takes on such an artificial character that it becomes striking to everyone; it overlooks its historical origins; we can see how the questions can be answered, but we no longer understand how and why they are raised.

This shows us that logic is not enough; that the science of demonstration is not all science and that intuition must retain its rôle as complement, I was about to say as counterpoise or as antidote of logic.

This shows us that logic alone isn’t enough; that the science of demonstration isn’t the whole of science and that intuition must continue to play its role as a complement, I would almost say as a counterbalance or antidote to logic.

I have already had occasion to insist on the place intuition should hold in the teaching of the mathematical sciences. Without it young minds could not make a beginning in the understanding of mathematics; they could not learn to love it and would see in it only a vain logomachy; above all, without intuition they would never become capable of applying mathematics. But now I wish before all to speak of the rôle of intuition in science itself. If it is useful to the student it is still more so to the creative scientist.

I have already had the chance to stress how important intuition is in teaching math. Without it, young people wouldn’t be able to start understanding mathematics; they wouldn’t learn to appreciate it and would only see it as pointless arguments; most importantly, without intuition, they wouldn’t ever be able to apply math. Now, I want to focus on the role of intuition in science itself. If it’s helpful for students, it’s even more essential for the innovative scientist.

V

We seek reality, but what is reality? The physiologists tell us that organisms are formed of cells; the chemists add that cells themselves are formed of atoms. Does this mean that these atoms or these cells constitute reality, or rather the sole reality? The way in which these cells are arranged and from which results the unity of the individual, is not it also a reality much more interesting than that of the isolated elements, and should a naturalist who had never studied the elephant except by means of the microscope think himself sufficiently acquainted with that animal?

We seek reality, but what is reality? The scientists tell us that organisms are made up of cells; the chemists point out that cells are made of atoms. Does this mean that these atoms or cells are what reality is, or the only reality? The way these cells are organized, which creates the unity of the individual, is it not a reality that is far more interesting than that of the individual elements? Should a naturalist who has only studied the elephant through a microscope consider themselves knowledgeable about that animal?

Well, there is something analogous to this in mathematics. The logician cuts up, so to speak, each demonstration into a very great number of elementary operations; when we have examined these[Pg 218] operations one after the other and ascertained that each is correct, are we to think we have grasped the real meaning of the demonstration? Shall we have understood it even when, by an effort of memory, we have become able to repeat this proof by reproducing all these elementary operations in just the order in which the inventor had arranged them? Evidently not; we shall not yet possess the entire reality; that I know not what, which makes the unity of the demonstration, will completely elude us.

Well, there's something similar to this in mathematics. The logician breaks down each demonstration into a large number of basic operations. Once we've examined these[Pg 218] operations one by one and confirmed that each is correct, should we think we've understood the real meaning of the demonstration? Will we really understand it even if, through effort, we can repeat this proof by recreating all these basic operations in the exact order the creator arranged them? Clearly not; we still won't have the complete picture. That elusive something that gives the demonstration its unity will still escape us.

Pure analysis puts at our disposal a multitude of procedures whose infallibility it guarantees; it opens to us a thousand different ways on which we can embark in all confidence; we are assured of meeting there no obstacles; but of all these ways, which will lead us most promptly to our goal? Who shall tell us which to choose? We need a faculty which makes us see the end from afar, and intuition is this faculty. It is necessary to the explorer for choosing his route; it is not less so to the one following his trail who wants to know why he chose it.

Pure analysis offers us a variety of methods that it promises are foolproof; it presents us with countless paths we can confidently take; we are guaranteed that there will be no obstacles on these paths. But out of all these options, which one will get us to our goal the quickest? Who can guide us in our choice? We need an ability that allows us to see the end from a distance, and that ability is intuition. It's essential for the explorer in selecting their route; it’s equally important for the one who is tracing their steps and wants to understand why they made that choice.

If you are present at a game of chess, it will not suffice, for the understanding of the game, to know the rules for moving the pieces. That will only enable you to recognize that each move has been made conformably to these rules, and this knowledge will truly have very little value. Yet this is what the reader of a book on mathematics would do if he were a logician only. To understand the game is wholly another matter; it is to know why the player moves this piece rather than that other which he could have moved without breaking the rules of the game. It is to perceive the inward reason which makes of this series of successive moves a sort of organized whole. This faculty is still more necessary for the player himself, that is, for the inventor.

If you’re watching a chess game, just knowing the rules for moving the pieces isn’t enough to really understand the game. That knowledge will only help you see that each move follows the rules, and that’s not very useful. This is what someone reading a math book would do if they were only focused on logic. Understanding the game is a whole different thing; it’s about knowing why a player makes this move instead of another one they could have legally made. It’s about grasping the deeper purpose that turns a series of moves into a cohesive strategy. This understanding is even more crucial for the player themselves, who is the one creating the strategy.

Let us drop this comparison and return to mathematics. For example, see what has happened to the idea of continuous function. At the outset this was only a sensible image, for example, that of a continuous mark traced by the chalk on a blackboard. Then it became little by little more refined; ere long it was used to construct a complicated system of inequalities, which reproduced, so to speak, all the lines of the original image; this construction finished, the centering of the arch, so to say, was removed, that crude representation which had temporarily served[Pg 219] as support and which was afterward useless was rejected; there remained only the construction itself, irreproachable in the eyes of the logician. And yet if the primitive image had totally disappeared from our recollection, how could we divine by what caprice all these inequalities were erected in this fashion one upon another?

Let’s move away from this comparison and get back to mathematics. For instance, look at how the concept of a continuous function has evolved. Initially, it was just a simple idea, like a continuous line drawn with chalk on a blackboard. Over time, it became more sophisticated; soon it was used to create a complex system of inequalities that represented, so to speak, all the lines of the original image. Once this construction was complete, the central framework, which had been a rough representation serving as a temporary support, was discarded. What remained was only the construction itself, flawless in the eyes of the logician. Yet, if the original image had completely faded from our memory, how would we ever figure out the reasoning behind stacking all these inequalities in such a way?

Perhaps you think I use too many comparisons; yet pardon still another. You have doubtless seen those delicate assemblages of silicious needles which form the skeleton of certain sponges. When the organic matter has disappeared, there remains only a frail and elegant lace-work. True, nothing is there except silica, but what is interesting is the form this silica has taken, and we could not understand it if we did not know the living sponge which has given it precisely this form. Thus it is that the old intuitive notions of our fathers, even when we have abandoned them, still imprint their form upon the logical constructions we have put in their place.

Maybe you think I rely on too many comparisons; still, I ask you to forgive one more. You’ve probably seen those delicate arrangements of silicious needles that create the skeleton of certain sponges. Once the organic material has faded away, what remains is just a fragile and elegant lacework. Sure, it’s just silica left behind, but what’s interesting is the shape that this silica has taken, and we wouldn’t really grasp it if we didn’t know about the living sponge that originally gave it that specific form. In the same way, the old intuitive ideas from our ancestors, even when we’ve moved on from them, still leave their mark on the logical frameworks we’ve created in their place.

This view of the aggregate is necessary for the inventor; it is equally necessary for whoever wishes really to comprehend the inventor. Can logic give it to us? No; the name mathematicians give it would suffice to prove this. In mathematics logic is called analysis and analysis means division, dissection. It can have, therefore, no tool other than the scalpel and the microscope.

This perspective on the whole is essential for the inventor; it’s just as crucial for anyone who truly wants to understand the inventor. Can logic provide this? No; the term that mathematicians use is enough to show this. In mathematics, logic is referred to as analysis and analysis means division, dissection. Therefore, it can only rely on tools like the scalpel and the microscope.

Thus logic and intuition have each their necessary rôle. Each is indispensable. Logic, which alone can give certainty, is the instrument of demonstration; intuition is the instrument of invention.

Thus, logic and intuition each have their essential role. Each is crucial. Logic, which alone can provide certainty, is the tool for demonstration; intuition is the tool for invention.

VI

But at the moment of formulating this conclusion I am seized with scruples. At the outset I distinguished two kinds of mathematical minds, the one sort logicians and analysts, the others intuitionalists and geometers. Well, the analysts also have been inventors. The names I have just cited make my insistence on this unnecessary.

But as I wrap up this conclusion, I’m hit with doubts. At the start, I identified two types of mathematical thinkers: the logicians and analysts, and the intuitionalists and geometers. However, the analysts have also been innovators. The names I’ve just mentioned make it clear that I don’t need to stress this further.

Here is a contradiction, at least apparently, which needs explanation. And first, do you think these logicians have always proceeded from the general to the particular, as the rules of formal[Pg 220] logic would seem to require of them? Not thus could they have extended the boundaries of science; scientific conquest is to be made only by generalization.

Here is a contradiction, at least seemingly, that needs explaining. First, do you think these logicians have always moved from the general to the specific, as the rules of formal[Pg 220] logic would suggest they should? They couldn’t have expanded the limits of science this way; scientific progress can only be achieved through generalization.

In one of the chapters of 'Science and Hypothesis,' I have had occasion to study the nature of mathematical reasoning, and I have shown how this reasoning, without ceasing to be absolutely rigorous, could lift us from the particular to the general by a procedure I have called mathematical induction. It is by this procedure that the analysts have made science progress, and if we examine the detail itself of their demonstrations, we shall find it there at each instant beside the classic syllogism of Aristotle. We, therefore, see already that the analysts are not simply makers of syllogisms after the fashion of the scholastics.

In one of the chapters of 'Science and Hypothesis,' I've had the chance to explore the nature of mathematical reasoning. I've demonstrated how this reasoning, while remaining completely rigorous, can take us from specifics to general concepts through a method I've called mathematical induction. This method is how analysts have advanced science, and if we look closely at their proofs, we'll find it consistently alongside Aristotle's classic syllogism. So, we can see that analysts are not just creators of syllogisms like the scholastics.

Besides, do you think they have always marched step by step with no vision of the goal they wished to attain? They must have divined the way leading thither, and for that they needed a guide. This guide is, first, analogy. For example, one of the methods of demonstration dear to analysts is that founded on the employment of dominant functions. We know it has already served to solve a multitude of problems; in what consists then the rôle of the inventor who wishes to apply it to a new problem? At the outset he must recognize the analogy of this question with those which have already been solved by this method; then he must perceive in what way this new question differs from the others, and thence deduce the modifications necessary to apply to the method.

Besides, do you think they always moved forward without a clear vision of their goal? They must have figured out the way to get there, and for that, they needed guidance. This guide is, first, analogy. For instance, one of the demonstration methods favored by analysts relies on using dominant functions. We know it has already helped solve many problems; so what’s the role of the inventor who wants to use it for a new problem? Initially, they need to recognize the similarities between this question and those that have already been solved with this method; then they must identify how this new question differs from the others, and from that, figure out the adjustments needed to apply the method.

But how does one perceive these analogies and these differences? In the example just cited they are almost always evident, but I could have found others where they would have been much more deeply hidden; often a very uncommon penetration is necessary for their discovery. The analysts, not to let these hidden analogies escape them, that is, in order to be inventors, must, without the aid of the senses and imagination, have a direct sense of what constitutes the unity of a piece of reasoning, of what makes, so to speak, its soul and inmost life.

But how do we recognize these similarities and differences? In the example just mentioned, they are usually clear, but I could have found others where they would be much harder to find; often, a very uncommon insight is needed to uncover them. The analysts, in order to grasp these hidden similarities and truly be innovators, must, without relying on the senses and imagination, have a direct understanding of what makes a piece of reasoning whole, what gives it, so to speak, its essence and deepest vitality.

When one talked with M. Hermite, he never evoked a sensuous image, and yet you soon perceived that the most abstract entities were for him like living beings. He did not see them, but he[Pg 221] perceived that they are not an artificial assemblage and that they have some principle of internal unity.

When you spoke with M. Hermite, he never conjured a sensual image, yet you quickly realized that the most abstract concepts felt to him like living beings. He couldn't see them, but he[Pg 221] sensed that they weren't just a random collection and that they held some fundamental internal unity.

But, one will say, that still is intuition. Shall we conclude that the distinction made at the outset was only apparent, that there is only one sort of mind and that all the mathematicians are intuitionalists, at least those who are capable of inventing?

But, some might argue, that’s still intuition. Should we conclude that the distinction made at the beginning was just an illusion, that there’s only one type of mind, and that all mathematicians are intuitive thinkers, at least those who can create?

No, our distinction corresponds to something real. I have said above that there are many kinds of intuition. I have said how much the intuition of pure number, whence comes rigorous mathematical induction, differs from sensible intuition to which the imagination, properly so called, is the principal contributor.

No, our distinction relates to something real. I mentioned earlier that there are various kinds of intuition. I explained how the intuition of pure number, which is the basis for strict mathematical induction, differs from sensory intuition, where imagination plays the main role.

Is the abyss which separates them less profound than it at first appeared? Could we recognize with a little attention that this pure intuition itself could not do without the aid of the senses? This is the affair of the psychologist and the metaphysician and I shall not discuss the question. But the thing's being doubtful is enough to justify me in recognizing and affirming an essential difference between the two kinds of intuition; they have not the same object and seem to call into play two different faculties of our soul; one would think of two search-lights directed upon two worlds strangers to one another.

Is the gap that separates them less deep than it initially seemed? Could we realize with a bit of attention that this pure intuition itself can't do without the help of the senses? This is the matter for psychologists and metaphysicians, and I won't get into that. But the fact that it's uncertain is enough for me to acknowledge and state a fundamental difference between the two types of intuition; they don’t focus on the same thing and seem to engage two different faculties of our soul; it’s as if there are two searchlights shining on two completely separate worlds.

It is the intuition of pure number, that of pure logical forms, which illumines and directs those we have called analysts. This it is which enables them not alone to demonstrate, but also to invent. By it they perceive at a glance the general plan of a logical edifice, and that too without the senses appearing to intervene. In rejecting the aid of the imagination, which, as we have seen, is not always infallible, they can advance without fear of deceiving themselves. Happy, therefore, are those who can do without this aid! We must admire them; but how rare they are!

It’s the intuition of pure numbers and logical forms that illuminates and guides those we call analysts. This is what allows them not just to demonstrate but also to create. They can see the overall structure of a logical framework at a glance, without the senses getting in the way. By rejecting the help of the imagination, which we’ve seen isn’t always reliable, they can move forward without the fear of misleading themselves. So, those who can manage without this help are fortunate! We should admire them, but they are so rare!

Among the analysts there will then be inventors, but they will be few. The majority of us, if we wished to see afar by pure intuition alone, would soon feel ourselves seized with vertigo. Our weakness has need of a staff more solid, and, despite the exceptions of which we have just spoken, it is none the less true that sensible intuition is in mathematics the most usual instrument of invention.

Among the analysts, there will be inventors, but they will be rare. Most of us, if we tried to see far ahead purely by intuition, would quickly feel overwhelmed. Our limitations require a more solid foundation, and, despite the exceptions we've just mentioned, it's still true that sensible intuition is the most common tool for invention in mathematics.

Apropos of these reflections, a question comes up that I have[Pg 222] not the time either to solve or even to enunciate with the developments it would admit of. Is there room for a new distinction, for distinguishing among the analysts those who above all use pure intuition and those who are first of all preoccupied with formal logic?

A related question arises that I don’t have the time to fully explore or even to articulate with all the implications it would suggest. Is there a place for a new distinction, to differentiate between analysts who primarily rely on pure intuition and those who focus mainly on formal logic?

M. Hermite, for example, whom I have just cited, can not be classed among the geometers who make use of the sensible intuition; but neither is he a logician, properly so called. He does not conceal his aversion to purely deductive procedures which start from the general and end in the particular.

M. Hermite, for example, whom I have just mentioned, cannot be categorized as one of the geometers who rely on practical intuition; however, he is also not a logician in the strict sense. He doesn't hide his dislike for purely deductive methods that start with the general and conclude with the specific.


CHAPTER II

Measuring Time

I

So long as we do not go outside the domain of consciousness, the notion of time is relatively clear. Not only do we distinguish without difficulty present sensation from the remembrance of past sensations or the anticipation of future sensations, but we know perfectly well what we mean when we say that of two conscious phenomena which we remember, one was anterior to the other; or that, of two foreseen conscious phenomena, one will be anterior to the other.

As long as we stay within the realm of consciousness, the idea of time is pretty clear. Not only do we easily tell the difference between what we feel right now and our memories of what we felt before or what we expect to feel later, but we also clearly understand what we mean when we say that one memory of a conscious experience happened before another; or that, among two anticipated conscious experiences, one will happen before the other.

When we say that two conscious facts are simultaneous, we mean that they profoundly interpenetrate, so that analysis can not separate them without mutilating them.

When we say that two conscious experiences happen at the same time, we mean that they deeply influence each other, making it impossible to analyze them separately without losing their essence.

The order in which we arrange conscious phenomena does not admit of any arbitrariness. It is imposed upon us and of it we can change nothing.

The way we organize our conscious experiences isn’t random. It’s determined for us, and we can’t change it.

I have only a single observation to add. For an aggregate of sensations to have become a remembrance capable of classification in time, it must have ceased to be actual, we must have lost the sense of its infinite complexity, otherwise it would have remained present. It must, so to speak, have crystallized around a center of associations of ideas which will be a sort of label. It is only when they thus have lost all life that we can classify our memories in time as a botanist arranges dried flowers in his herbarium.

I only have one observation to add. For a bunch of sensations to turn into a memory we can categorize over time, it has to stop being actual; we must lose the sense of its endless complexity, or else it would still feel present. It has, in a way, had to crystallize around a central idea to create a sort of label. It's only when these sensations lose their vitality that we can organize our memories in time, just like a botanist sorts dried flowers in a herbarium.

But these labels can only be finite in number. On that score, psychologic time should be discontinuous. Whence comes the feeling that between any two instants there are others? We arrange our recollections in time, but we know that there remain empty compartments. How could that be, if time were not a form pre-existent in our minds? How could we know there were empty compartments, if these compartments were revealed to us only by their content?

But these labels can only be limited in number. In that sense, psychological time should be discontinuous. Where does the feeling come from that there are other moments between any two instants? We organize our memories over time, but we know there are still empty spaces. How is that possible if time isn’t a concept that already exists in our minds? How could we recognize there are empty spaces if those spaces were only shown to us through their contents?

II

But that is not all; into this form we wish to put not only the phenomena of our own consciousness, but those of which other consciousnesses are the theater. But more, we wish to put there physical facts, these I know not what with which we people space and which no consciousness sees directly. This is necessary because without it science could not exist. In a word, psychologic time is given to us and must needs create scientific and physical time. There the difficulty begins, or rather the difficulties, for there are two.

But that's not all; we want to include not only the experiences of our own consciousness but also those that belong to other consciousnesses. Furthermore, we aim to incorporate physical facts—those things that we use to make sense of space and that no consciousness directly perceives. This is essential because science wouldn’t exist without it. In short, psychological time is given to us and must to some extent create scientific and physical time. That's where the challenges start, or rather, the challenges, since there are two.

Think of two consciousnesses, which are like two worlds impenetrable one to the other. By what right do we strive to put them into the same mold, to measure them by the same standard? Is it not as if one strove to measure length with a gram or weight with a meter? And besides, why do we speak of measuring? We know perhaps that some fact is anterior to some other, but not by how much it is anterior.

Think of two minds, which are like two worlds that can’t be penetrated by the other. What gives us the right to try to make them fit into the same mold, to judge them by the same standard? Isn’t it like trying to measure length with a gram or weight with a meter? Plus, why do we even talk about measuring? We might know that one fact comes before another, but not by how much it comes before.

Therefore two difficulties: (1) Can we transform psychologic time, which is qualitative, into a quantitative time? (2) Can we reduce to one and the same measure facts which transpire in different worlds?

Therefore two difficulties: (1) Can we change psychological time, which is qualitative, into a quantitative time? (2) Can we measure events that happen in different worlds with the same standard?

III

The first difficulty has long been noticed; it has been the subject of long discussions and one may say the question is settled. We have not a direct intuition of the equality of two intervals of time. The persons who believe they possess this intuition are dupes of an illusion. When I say, from noon to one the same time passes as from two to three, what meaning has this affirmation?

The first difficulty has been recognized for a long time; it has been the topic of extensive discussions, and one could argue that the issue is resolved. We don’t have a direct understanding of the equality of two time intervals. Those who think they have this understanding are falling for an illusion. When I say that the same amount of time passes from noon to one as from two to three, what does this statement actually mean?

The least reflection shows that by itself it has none at all. It will only have that which I choose to give it, by a definition which will certainly possess a certain degree of arbitrariness. Psychologists could have done without this definition; physicists and astronomers could not; let us see how they have managed.

The slightest reflection reveals that it has no inherent meaning. It will only have what I decide to assign to it, based on a definition that will definitely have some level of arbitrariness. Psychologists could have done without this definition; physicists and astronomers couldn’t. Let’s see how they’ve dealt with it.

To measure time they use the pendulum and they suppose by definition that all the beats of this pendulum are of equal duration. But this is only a first approximation; the temperature, the resistance of the air, the barometric pressure, make the pace[Pg 225] of the pendulum vary. If we could escape these sources of error, we should obtain a much closer approximation, but it would still be only an approximation. New causes, hitherto neglected, electric, magnetic or others, would introduce minute perturbations.

To measure time, they use a pendulum and assume by definition that all the beats of the pendulum are the same length. But this is just a rough estimate; temperature, air resistance, and barometric pressure cause the pendulum's timing to change[Pg 225]. If we could eliminate these sources of error, we would get a much more accurate result, but it would still be an estimate. New factors, previously overlooked, like electrical or magnetic influences, would introduce tiny variations.

In fact, the best chronometers must be corrected from time to time, and the corrections are made by the aid of astronomic observations; arrangements are made so that the sidereal clock marks the same hour when the same star passes the meridian. In other words, it is the sidereal day, that is, the duration of the rotation of the earth, which is the constant unit of time. It is supposed, by a new definition substituted for that based on the beats of the pendulum, that two complete rotations of the earth about its axis have the same duration.

In fact, even the best clocks need to be adjusted occasionally, and these adjustments are made using astronomical observations. Systems are set up so that the sidereal clock shows the same hour when the same star crosses the meridian. In other words, it’s the sidereal day—the time it takes for the Earth to complete one rotation on its axis—that serves as the constant unit of time. According to a new definition that replaces the one based on the swings of a pendulum, two full rotations of the Earth around its axis are believed to have the same duration.

However, the astronomers are still not content with this definition. Many of them think that the tides act as a check on our globe, and that the rotation of the earth is becoming slower and slower. Thus would be explained the apparent acceleration of the motion of the moon, which would seem to be going more rapidly than theory permits because our watch, which is the earth, is going slow.

However, the astronomers are still not satisfied with this definition. Many of them believe that the tides play a role in moderating our planet, and that the Earth's rotation is gradually slowing down. This could explain the apparent acceleration of the moon's motion, which seems to be moving faster than theory suggests because our timekeeper, the Earth, is lagging behind.

IV

All this is unimportant, one will say; doubtless our instruments of measurement are imperfect, but it suffices that we can conceive a perfect instrument. This ideal can not be reached, but it is enough to have conceived it and so to have put rigor into the definition of the unit of time.

All this seems unimportant, one might argue; sure, our measuring tools aren't perfect, but it's enough that we can imagine a perfect tool. We may never achieve this ideal, but just having the idea helps us define the unit of time more precisely.

The trouble is that there is no rigor in the definition. When we use the pendulum to measure time, what postulate do we implicitly admit? It is that the duration of two identical phenomena is the same; or, if you prefer, that the same causes take the same time to produce the same effects.

The problem is that there’s no clarity in the definition. When we use the pendulum to keep time, what assumption are we implicitly making? It’s that the duration of two identical events is the same; or, if you prefer, that the same causes take the same amount of time to produce the same effects.

And at first blush, this is a good definition of the equality of two durations. But take care. Is it impossible that experiment may some day contradict our postulate?

And at first glance, this is a solid definition of the equality of two durations. But be cautious. Is it really impossible that experiments might someday challenge our assumption?

Let me explain myself. I suppose that at a certain place in the world the phenomenon α happens, causing as consequence at the end of a certain time the effect α´. At another place in the world[Pg 226] very far away from the first, happens the phenomenon β, which causes as consequence the effect β´. The phenomena α and β are simultaneous, as are also the effects α´ and β´.

Let me clarify. I believe that in one part of the world, phenomenon α occurs, which leads to effect α' after some time. In a different part of the world[Pg 226] far away from the first, phenomenon β takes place, resulting in effect β'. The phenomena α and β happen at the same time, just like effects α' and β'.

Later, the phenomenon α is reproduced under approximately the same conditions as before, and simultaneously the phenomenon β is also reproduced at a very distant place in the world and almost under the same circumstances. The effects α´ and β´ also take place. Let us suppose that the effect α´ happens perceptibly before the effect β´.

Later, phenomenon α is recreated under roughly the same conditions as before, and at the same time phenomenon β is also recreated in a very distant part of the world and nearly under the same circumstances. Effects α´ and β´ also occur. Let's assume that effect α´ happens noticeably before effect β´.

If experience made us witness such a sight, our postulate would be contradicted. For experience would tell us that the first duration αα´ is equal to the first duration ββ´ and that the second duration αα´ is less than the second duration ββ´. On the other hand, our postulate would require that the two durations αα´ should be equal to each other, as likewise the two durations ββ´. The equality and the inequality deduced from experience would be incompatible with the two equalities deduced from the postulate.

If our experiences led us to see such a thing, our assumption would be proven wrong. Experience would indicate that the first time period αα´ is equal to the first time period ββ´ and that the second time period αα´ is shorter than the second time period ββ´. On the flip side, our assumption would demand that the two time periods αα´ be equal to each other, just like the two time periods ββ´. The equality and inequality observed from experience would clash with the two equalities inferred from the assumption.

Now can we affirm that the hypotheses I have just made are absurd? They are in no wise contrary to the principle of contradiction. Doubtless they could not happen without the principle of sufficient reason seeming violated. But to justify a definition so fundamental I should prefer some other guarantee.

Now can we say that the hypotheses I've just made are ridiculous? They're not at all against the principle of contradiction. Of course, they couldn't occur without giving the impression that the principle of sufficient reason is being ignored. But to support such a fundamental definition, I would rather have some other assurance.

V

But that is not all. In physical reality one cause does not produce a given effect, but a multitude of distinct causes contribute to produce it, without our having any means of discriminating the part of each of them.

But that's not all. In physical reality, one cause doesn’t lead to a specific effect; instead, a variety of different causes come together to create it, and we have no way of distinguishing the contribution of each one.

Physicists seek to make this distinction; but they make it only approximately, and, however they progress, they never will make it except approximately. It is approximately true that the motion of the pendulum is due solely to the earth's attraction; but in all rigor every attraction, even of Sirius, acts on the pendulum.

Physicists try to make this distinction, but they only get close to it, and no matter how far they get, it will always be just an approximation. It’s mostly true that the pendulum's motion is caused only by the earth's pull; however, technically speaking, every attraction, even that of Sirius, affects the pendulum.

Under these conditions, it is clear that the causes which have produced a certain effect will never be reproduced except approximately. Then we should modify our postulate and our[Pg 227] definition. Instead of saying: 'The same causes take the same time to produce the same effects,' we should say: 'Causes almost identical take almost the same time to produce almost the same effects.'

Under these conditions, it's clear that the causes that created a certain effect will never occur in exactly the same way again. So we need to change our assumption and our[Pg 227] definition. Instead of saying, "The same causes take the same time to produce the same effects," we should say, "Causes that are almost identical take almost the same time to produce almost the same effects."

Our definition therefore is no longer anything but approximate. Besides, as M. Calinon very justly remarks in a recent memoir:[7]

Our definition is now just an approximation. Besides, as M. Calinon rightly points out in a recent paper:[7]

One of the circumstances of any phenomenon is the velocity of the earth's rotation; if this velocity of rotation varies, it constitutes in the reproduction of this phenomenon a circumstance which no longer remains the same. But to suppose this velocity of rotation constant is to suppose that we know how to measure time.

One factor in any phenomenon is how fast the Earth rotates; if this rotation speed changes, it affects the way this phenomenon is reproduced and no longer stays the same. However, assuming this rotation speed is constant means we believe we know how to measure time.

Our definition is therefore not yet satisfactory; it is certainly not that which the astronomers of whom I spoke above implicitly adopt, when they affirm that the terrestrial rotation is slowing down.

Our definition isn’t quite satisfactory yet; it’s definitely not the one that the astronomers I mentioned earlier implicitly use when they claim that the Earth’s rotation is slowing down.

What meaning according to them has this affirmation? We can only understand it by analyzing the proofs they give of their proposition. They say first that the friction of the tides producing heat must destroy vis viva. They invoke therefore the principle of vis viva, or of the conservation of energy.

What do they mean by this statement? We can only understand it by looking at the evidence they provide for their claim. They first argue that the friction of the tides creates heat, which must reduce vis viva. They therefore refer to the principle of vis viva, or the conservation of energy.

They say next that the secular acceleration of the moon, calculated according to Newton's law, would be less than that deduced from observations unless the correction relative to the slowing down of the terrestrial rotation were made. They invoke therefore Newton's law. In other words, they define duration in the following way: time should be so defined that Newton's law and that of vis viva may be verified. Newton's law is an experimental truth; as such it is only approximate, which shows that we still have only a definition by approximation.

They say next that the secular acceleration of the moon, calculated according to Newton's law, would be less than what we get from observations unless we account for the slowing down of Earth's rotation. They reference Newton's law, meaning they define time like this: time should be defined in a way that confirms both Newton's law and the law of vis viva. Newton's law is an experimental truth; as such, it is only approximate, which indicates that we still only have a definition by approximation.

If now it be supposed that another way of measuring time is adopted, the experiments on which Newton's law is founded would none the less have the same meaning. Only the enunciation of the law would be different, because it would be translated into another language; it would evidently be much less simple. So that the definition implicitly adopted by the astronomers may be summed up thus: Time should be so defined that[Pg 228] the equations of mechanics may be as simple as possible. In other words, there is not one way of measuring time more true than another; that which is generally adopted is only more convenient. Of two watches, we have no right to say that the one goes true, the other wrong; we can only say that it is advantageous to conform to the indications of the first.

If we now assume that a different method of measuring time is used, the experiments that form the basis of Newton's law would still have the same significance. The statement of the law would just be phrased differently, as it would be translated into another language; it would clearly be much less straightforward. Thus, the definition implicitly accepted by astronomers can be summarized as follows: Time should be defined in a way that[Pg 228] keeps the equations of mechanics as simple as possible. In other words, there isn't one method of measuring time that's more accurate than another; the one that is commonly used is simply more convenient. Among two clocks, we can't claim that one is right and the other is wrong; we can only say that it's beneficial to follow the readings of the first.

The difficulty which has just occupied us has been, as I have said, often pointed out; among the most recent works in which it is considered, I may mention, besides M. Calinon's little book, the treatise on mechanics of Andrade.

The challenge we've just discussed has, as I mentioned, been highlighted frequently; among the latest works that address it, I can point out, in addition to M. Calinon's brief book, Andrade's treatise on mechanics.

VI

The second difficulty has up to the present attracted much less attention; yet it is altogether analogous to the preceding; and even, logically, I should have spoken of it first.

The second difficulty has received much less attention so far; however, it is completely similar to the first one; and logically, I should have discussed it first.

Two psychological phenomena happen in two different consciousnesses; when I say they are simultaneous, what do I mean? When I say that a physical phenomenon, which happens outside of every consciousness, is before or after a psychological phenomenon, what do I mean?

Two psychological phenomena occur in two different states of mind; when I say they happen at the same time, what do I mean? When I say that a physical event, which occurs outside of any consciousness, comes before or after a psychological event, what do I mean?

In 1572, Tycho Brahe noticed in the heavens a new star. An immense conflagration had happened in some far distant heavenly body; but it had happened long before; at least two hundred years were necessary for the light from that star to reach our earth. This conflagration therefore happened before the discovery of America. Well, when I say that; when, considering this gigantic phenomenon, which perhaps had no witness, since the satellites of that star were perhaps uninhabited, I say this phenomenon is anterior to the formation of the visual image of the isle of Española in the consciousness of Christopher Columbus, what do I mean?

In 1572, Tycho Brahe observed a new star in the sky. A massive explosion had occurred in some distant celestial body; however, it took place long before, at least two hundred years before its light reached Earth. This explosion, therefore, occurred before the discovery of America. When I say this—when I consider this enormous phenomenon, which might have had no witnesses since the moons of that star were likely uninhabited—I mean that this event happened before Christopher Columbus formed the visual image of the island of Española in his mind. What do I mean by that?

A little reflection is sufficient to understand that all these affirmations have by themselves no meaning. They can have one only as the outcome of a convention.

A little reflection is enough to realize that all these statements have no meaning on their own. They only gain meaning through a convention.

VII

We should first ask ourselves how one could have had the idea of putting into the same frame so many worlds impenetrable to[Pg 229] one another. We should like to represent to ourselves the external universe, and only by so doing could we feel that we understood it. We know we never can attain this representation: our weakness is too great. But at least we desire the ability to conceive an infinite intelligence for which this representation could be possible, a sort of great consciousness which should see all, and which should classify all in its time, as we classify, in our time, the little we see.

We should first ask ourselves how anyone could think to fit so many worlds that are completely separate from each other into the same framework. We want to imagine the external universe, and by doing so, we feel like we can understand it. We know we can never truly achieve this representation; our limitations are too significant. But at the very least, we wish to have the ability to envision an infinite intelligence for which this representation could be possible, a kind of vast consciousness that sees everything and organizes it all in its time, just as we sort the little we see in our time.

This hypothesis is indeed crude and incomplete, because this supreme intelligence would be only a demigod; infinite in one sense, it would be limited in another, since it would have only an imperfect recollection of the past; and it could have no other, since otherwise all recollections would be equally present to it and for it there would be no time. And yet when we speak of time, for all which happens outside of us, do we not unconsciously adopt this hypothesis; do we not put ourselves in the place of this imperfect god; and do not even the atheists put themselves in the place where god would be if he existed?

This hypothesis is definitely simplistic and not complete, because this supreme intelligence would only be a demigod; infinite in one way, it would be limited in another, as it would only have an imperfect memory of the past; and it couldn't have anything else, or else all memories would be equally present to it, and there would be no time for it. Yet, when we talk about time in relation to everything happening outside of us, don't we unconsciously adopt this hypothesis; don't we place ourselves in the role of this imperfect god; and don't even atheists position themselves where god would be if he did exist?

What I have just said shows us, perhaps, why we have tried to put all physical phenomena into the same frame. But that can not pass for a definition of simultaneity, since this hypothetical intelligence, even if it existed, would be for us impenetrable. It is therefore necessary to seek something else.

What I've just said might explain why we've tried to fit all physical phenomena into the same framework. However, that can't really be considered a definition of simultaneity, since this hypothetical intelligence, even if it existed, would be unintelligible to us. So, we need to look for something else.

VIII

The ordinary definitions which are proper for psychologic time would suffice us no more. Two simultaneous psychologic facts are so closely bound together that analysis can not separate without mutilating them. Is it the same with two physical facts? Is not my present nearer my past of yesterday than the present of Sirius?

The usual definitions that work for psychological time aren't enough for us anymore. Two simultaneous psychological events are so closely linked that analyzing them would be like tearing them apart. Is it the same for two physical events? Isn’t my present moment closer to my past from yesterday than it is to the present of Sirius?

It has also been said that two facts should be regarded as simultaneous when the order of their succession may be inverted at will. It is evident that this definition would not suit two physical facts which happen far from one another, and that, in what concerns them, we no longer even understand what this reversibility would be; besides, succession itself must first be defined.

It has also been said that two facts should be seen as happening at the same time when you can switch their order whenever you want. It's clear that this definition wouldn't apply to two physical facts that occur far apart from each other, and in that case, we no longer even know what this reversibility would mean; moreover, succession itself needs to be defined first.

IX

Let us then seek to give an account of what is understood by simultaneity or antecedence, and for this let us analyze some examples.

Let’s try to explain what we mean by simultaneity or antecedence, and to do this, let’s look at some examples.

I write a letter; it is afterward read by the friend to whom I have addressed it. There are two facts which have had for their theater two different consciousnesses. In writing this letter I have had the visual image of it, and my friend has had in his turn this same visual image in reading the letter. Though these two facts happen in impenetrable worlds, I do not hesitate to regard the first as anterior to the second, because I believe it is its cause.

I write a letter; later, my friend reads it. There are two events happening in two different minds. While I’m writing this letter, I have a mental picture of it, and my friend experiences that same mental picture when reading it. Even though these two events occur in separate realms, I don’t hesitate to see the first as coming before the second because I think it’s the reason for it.

I hear thunder, and I conclude there has been an electric discharge; I do not hesitate to consider the physical phenomenon as anterior to the auditory image perceived in my consciousness, because I believe it is its cause.

I hear thunder, and I figure there has been a lightning strike; I don't hesitate to think of the physical event as happening before the sound I notice in my mind, because I believe it causes it.

Behold then the rule we follow, and the only one we can follow: when a phenomenon appears to us as the cause of another, we regard it as anterior. It is therefore by cause that we define time; but most often, when two facts appear to us bound by a constant relation, how do we recognize which is the cause and which the effect? We assume that the anterior fact, the antecedent, is the cause of the other, of the consequent. It is then by time that we define cause. How save ourselves from this petitio principii?

Here’s the rule we follow, and the only one we can follow: when we see one event as the cause of another, we consider it to be earlier in time. That's how we define time based on cause. However, often when two events seem to be constantly linked, how do we figure out which one is the cause and which one is the effect? We assume that the earlier event, the one that comes first, is the cause of the later one, the effect. So, we end up defining cause through time. How do we avoid this petitio principii?

We say now post hoc, ergo propter hoc; now propter hoc, ergo post hoc; shall we escape from this vicious circle?

We say now after this, therefore because of this; now because of this, therefore after this; will we break free from this endless cycle?

X

Let us see, not how we succeed in escaping, for we do not completely succeed, but how we try to escape.

Let’s look at not how we fully escape, because we don’t really succeed, but how we attempt to escape.

I execute a voluntary act A and I feel afterward a sensation D, which I regard as a consequence of the act A; on the other hand, for whatever reason, I infer that this consequence is not immediate, but that outside my consciousness two facts B and C, which I have not witnessed, have happened, and in such a way that B is the effect of A, that C is the effect of B, and D of C.

I perform a voluntary action A and afterward feel a sensation D, which I see as a result of the action A; however, for some reason, I conclude that this result isn't immediate, but that outside my awareness two events B and C, which I haven't observed, have occurred, such that B is the effect of A, C is the effect of B, and D is the result of C.

But why? If I think I have reason to regard the four facts A, B, C, D, as bound to one another by a causal connection, why[Pg 231] range them in the causal order A B C D, and at the same time in the chronologic order A B C D, rather than in any other order?

But why? If I believe I have a reason to see the four facts A, B, C, D, as connected by a cause-and-effect relationship, why[Pg 231] arrange them in the causal order A B C D, and at the same time in the chronological order A B C D, instead of in any other order?

I clearly see that in the act A I have the feeling of having been active, while in undergoing the sensation D I have that of having been passive. This is why I regard A as the initial cause and D as the ultimate effect; this is why I put A at the beginning of the chain and D at the end; but why put B before C rather than C before B?

I clearly see that in the act A, I feel like I’ve been active, while in experiencing the sensation D, I feel like I’ve been passive. That’s why I see A as the initial cause and D as the final effect; that’s why I place A at the start of the chain and D at the end. But why put B before C instead of C before B?

If this question is put, the reply ordinarily is: we know that it is B which is the cause of C because we always see B happen before C. These two phenomena, when witnessed, happen in a certain order; when analogous phenomena happen without witness, there is no reason to invert this order.

If this question is asked, the usual answer is: we know that it is B that causes C because we always see B occur before C. When we observe these two events, they happen in a specific order; when similar events occur without observation, there's no reason to change that order.

Doubtless, but take care; we never know directly the physical phenomena B and C. What we know are sensations and produced respectively by B and C. Our consciousness tells us immediately that precedes and we suppose that B and C succeed one another in the same order.

Doubtless, but be careful; we never directly know the physical phenomena B and C. What we know are the sensations and produced by B and C, respectively. Our consciousness immediately tells us that comes before , and we assume that B and C follow each other in the same order.

This rule appears in fact very natural, and yet we are often led to depart from it. We hear the sound of the thunder only some seconds after the electric discharge of the cloud. Of two flashes of lightning, the one distant, the other near, can not the first be anterior to the second, even though the sound of the second comes to us before that of the first?

This rule seems quite natural, yet we often stray from it. We hear the sound of thunder a few seconds after the lightning from the cloud. With two flashes of lightning, one far away and the other close, can’t the first one happen before the second, even if we hear the sound of the second before the sound of the first?

XI

Another difficulty; have we really the right to speak of the cause of a phenomenon? If all the parts of the universe are interchained in a certain measure, any one phenomenon will not be the effect of a single cause, but the resultant of causes infinitely numerous; it is, one often says, the consequence of the state of the universe a moment before. How enunciate rules applicable to circumstances so complex? And yet it is only thus that these rules can be general and rigorous.

Another difficulty: do we truly have the right to discuss the cause of a phenomenon? If all parts of the universe are interconnected to some extent, then any given phenomenon isn’t the result of a single cause, but rather the outcome of countless causes. It’s often said that it’s the consequence of the state of the universe just a moment before. How can we articulate rules that apply to such complex circumstances? And yet, it’s only in this way that these rules can be both general and precise.

Not to lose ourselves in this infinite complexity, let us make a simpler hypothesis. Consider three stars, for example, the sun, Jupiter and Saturn; but, for greater simplicity, regard them as[Pg 232] reduced to material points and isolated from the rest of the world. The positions and the velocities of three bodies at a given instant suffice to determine their positions and velocities at the following instant, and consequently at any instant. Their positions at the instant t determine their positions at the instant t + h as well as their positions at the instant th.

Not to get lost in this infinite complexity, let's make a simpler assumption. Consider three stars, for example, the sun, Jupiter, and Saturn; but to keep it straightforward, let's think of them as[Pg 232] reduced to point-like objects and separated from everything else. The positions and speeds of these three bodies at a certain moment are enough to determine their positions and speeds at the next moment, and therefore at any moment. Their positions at time t determine their positions at time t + h and also at time th.

Even more; the position of Jupiter at the instant t, together with that of Saturn at the instant t + a, determines the position of Jupiter at any instant and that of Saturn at any instant.

Even more, the position of Jupiter at the moment t, along with that of Saturn at the moment t + a, decides the position of Jupiter at any time and that of Saturn at any time.

The aggregate of positions occupied by Jupiter at the instant t + e and Saturn at the instant t + a + e is bound to the aggregate of positions occupied by Jupiter at the instant t and Saturn at the instant t + a, by laws as precise as that of Newton, though more complicated. Then why not regard one of these aggregates as the cause of the other, which would lead to considering as simultaneous the instant t of Jupiter and the instant t + a of Saturn?

The total positions of Jupiter at the moment t + e and Saturn at the moment t + a + e is linked to the total positions of Jupiter at the moment t and Saturn at the moment t + a, by laws as exact as Newton's, though more complex. So why not see one of these totals as the cause of the other, which would lead to viewing the moment t of Jupiter and the moment t + a of Saturn as simultaneous?

In answer there can only be reasons, very strong, it is true, of convenience and simplicity.

In response, there can only be very strong reasons, it’s true, related to convenience and simplicity.

XII

But let us pass to examples less artificial; to understand the definition implicitly supposed by the savants, let us watch them at work and look for the rules by which they investigate simultaneity.

But let's move on to less artificial examples; to understand the definition that the experts take for granted, let's observe them in action and look for the rules they use to explore simultaneity.

I will take two simple examples, the measurement of the velocity of light and the determination of longitude.

I will give two straightforward examples: measuring the speed of light and figuring out longitude.

When an astronomer tells me that some stellar phenomenon, which his telescope reveals to him at this moment, happened, nevertheless, fifty years ago, I seek his meaning, and to that end I shall ask him first how he knows it, that is, how he has measured the velocity of light.

When an astronomer tells me that some stellar phenomenon, which his telescope is showing him right now, actually happened fifty years ago, I try to understand what he means, and to do that, I’ll first ask him how he knows this, specifically how he has measured the speed of light.

He has begun by supposing that light has a constant velocity, and in particular that its velocity is the same in all directions. That is a postulate without which no measurement of this velocity could be attempted. This postulate could never be verified directly by experiment; it might be contradicted by it if the results of different measurements were not concordant. We[Pg 233] should think ourselves fortunate that this contradiction has not happened and that the slight discordances which may happen can be readily explained.

He has started by assuming that light travels at a constant speed and that this speed is the same in all directions. This is a basic assumption without which we couldn't attempt to measure this speed. This assumption can never be directly tested through experiments; it could be challenged if the results of various measurements didn't match up. We[Pg 233] should consider ourselves lucky that this challenge hasn't occurred and that any small discrepancies that do arise can be easily explained.

The postulate, at all events, resembling the principle of sufficient reason, has been accepted by everybody; what I wish to emphasize is that it furnishes us with a new rule for the investigation of simultaneity, entirely different from that which we have enunciated above.

The idea, in any case, similar to the principle of sufficient reason, has been accepted by everyone; what I want to highlight is that it provides us with a new guideline for examining simultaneity, completely different from the one we stated earlier.

This postulate assumed, let us see how the velocity of light has been measured. You know that Roemer used eclipses of the satellites of Jupiter, and sought how much the event fell behind its prediction. But how is this prediction made? It is by the aid of astronomic laws; for instance Newton's law.

This assumption leads us to examine how the speed of light has been measured. You might know that Roemer used the eclipses of Jupiter's satellites and looked at how much the event lagged behind its prediction. But how is this prediction made? It's done with the help of astronomical laws, like Newton's law.

Could not the observed facts be just as well explained if we attributed to the velocity of light a little different value from that adopted, and supposed Newton's law only approximate? Only this would lead to replacing Newton's law by another more complicated. So for the velocity of light a value is adopted, such that the astronomic laws compatible with this value may be as simple as possible. When navigators or geographers determine a longitude, they have to solve just the problem we are discussing; they must, without being at Paris, calculate Paris time. How do they accomplish it? They carry a chronometer set for Paris. The qualitative problem of simultaneity is made to depend upon the quantitative problem of the measurement of time. I need not take up the difficulties relative to this latter problem, since above I have emphasized them at length.

Couldn’t the observed facts be explained just as well if we assigned a slightly different value to the speed of light and considered Newton's law to be only an approximation? However, this would just mean replacing Newton's law with something more complex. So, a specific value for the speed of light is chosen so that the astronomical laws compatible with it can be as straightforward as possible. When navigators or geographers determine a longitude, they face the same challenge we’re discussing; they need to calculate the time in Paris without being there. How do they do that? They use a chronometer set to Paris time. The qualitative issue of simultaneity depends on the quantitative issue of measuring time. I won't delve into the challenges related to this latter issue, as I’ve already discussed them in detail above.

Or else they observe an astronomic phenomenon, such as an eclipse of the moon, and they suppose that this phenomenon is perceived simultaneously from all points of the earth. That is not altogether true, since the propagation of light is not instantaneous; if absolute exactitude were desired, there would be a correction to make according to a complicated rule.

Or they might witness an astronomical event, like a lunar eclipse, and think that everyone on Earth experiences it at the same time. That's not entirely true because light doesn't travel instantly; if we wanted to be perfectly precise, there would be a correction to apply based on a complex rule.

Or else finally they use the telegraph. It is clear first that the reception of the signal at Berlin, for instance, is after the sending of this same signal from Paris. This is the rule of cause and effect analyzed above. But how much after? In general, the duration of the transmission is neglected and the two events are[Pg 234] regarded as simultaneous. But, to be rigorous, a little correction would still have to be made by a complicated calculation; in practise it is not made, because it would be well within the errors of observation; its theoretic necessity is none the less from our point of view, which is that of a rigorous definition. From this discussion, I wish to emphasize two things: (1) The rules applied are exceedingly various. (2) It is difficult to separate the qualitative problem of simultaneity from the quantitative problem of the measurement of time; no matter whether a chronometer is used, or whether account must be taken of a velocity of transmission, as that of light, because such a velocity could not be measured without measuring a time.

Or they might finally use the telegraph. It's clear that the reception of the signal in Berlin, for example, happens after the signal is sent from Paris. This follows the cause and effect rule we discussed earlier. But how much later? Generally, the transmission time is overlooked, and the two events are[Pg 234] seen as happening at the same time. However, to be precise, a small adjustment should still be made through a complicated calculation; in practice, it isn't done because it's within the margin of observational error. Nonetheless, from our point of view, which seeks a strict definition, its theoretical necessity remains. From this discussion, I want to highlight two things: (1) The rules in play vary significantly. (2) It's tough to distinguish the qualitative issue of simultaneity from the quantitative issue of measuring time, whether using a chronometer or accounting for a transmission speed, like that of light, since measuring that speed would require measuring time.

XIII

To conclude: We have not a direct intuition of simultaneity, nor of the equality of two durations. If we think we have this intuition, this is an illusion. We replace it by the aid of certain rules which we apply almost always without taking count of them.

To conclude: We don't have a direct sense of simultaneity, nor of the equality of two durations. If we think we do, that's an illusion. We substitute it with certain rules that we apply almost always without being aware of them.

But what is the nature of these rules? No general rule, no rigorous rule; a multitude of little rules applicable to each particular case.

But what are the nature of these rules? No general rule, no strict rule; just a lot of small rules that apply to each specific situation.

These rules are not imposed upon us and we might amuse ourselves in inventing others; but they could not be cast aside without greatly complicating the enunciation of the laws of physics, mechanics and astronomy.

These rules aren’t forced upon us, and we can entertain ourselves by coming up with others; however, we can’t just disregard them without making the explanation of the laws of physics, mechanics, and astronomy much more complicated.

We therefore choose these rules, not because they are true, but because they are the most convenient, and we may recapitulate them as follows: "The simultaneity of two events, or the order of their succession, the equality of two durations, are to be so defined that the enunciation of the natural laws may be as simple as possible. In other words, all these rules, all these definitions are only the fruit of an unconscious opportunism."

We choose these rules not because they are true, but because they are the most convenient. We can summarize them like this: "The coincidence of two events or the order in which they happen, and the equality of two durations, should be defined in a way that makes the expression of natural laws as simple as possible. In other words, all these rules and definitions are simply the result of an unconscious opportunism."


CHAPTER III

The Concept of Space

1. Introduction

In the articles I have heretofore devoted to space I have above all emphasized the problems raised by non-Euclidean geometry, while leaving almost completely aside other questions more difficult of approach, such as those which pertain to the number of dimensions. All the geometries I considered had thus a common basis, that tridimensional continuum which was the same for all and which differentiated itself only by the figures one drew in it or when one aspired to measure it.

In the articles I've written so far about space, I've primarily focused on the issues brought up by non-Euclidean geometry, while mostly ignoring other more complex questions, like those related to the number of dimensions. All the geometries I looked at shared a common foundation: that three-dimensional continuum that was the same for everyone, differing only in the shapes drawn within it or in attempts to measure it.

In this continuum, primitively amorphous, we may imagine a network of lines and surfaces, we may then convene to regard the meshes of this net as equal to one another, and it is only after this convention that this continuum, become measurable, becomes Euclidean or non-Euclidean space. From this amorphous continuum can therefore arise indifferently one or the other of the two spaces, just as on a blank sheet of paper may be traced indifferently a straight or a circle.

In this shapeless continuum, we can picture a network of lines and surfaces. We can come together to view the connections in this net as being equal to each other, and it's only after agreeing on this that this continuum becomes measurable, turning into either Euclidean or non-Euclidean space. From this formless continuum, either of the two spaces can emerge, just as you can draw either a straight line or a circle on a blank sheet of paper.

In space we know rectilinear triangles the sum of whose angles is equal to two right angles; but equally we know curvilinear triangles the sum of whose angles is less than two right angles. The existence of the one sort is not more doubtful than that of the other. To give the name of straights to the sides of the first is to adopt Euclidean geometry; to give the name of straights to the sides of the latter is to adopt the non-Euclidean geometry. So that to ask what geometry it is proper to adopt is to ask, to what line is it proper to give the name straight?

In space, we understand that in straight triangles, the sum of the angles equals two right angles; but we also recognize that in curved triangles, the sum of the angles is less than two right angles. The existence of one type is not more uncertain than that of the other. Referring to the sides of the first type as straight is to embrace Euclidean geometry; labeling the sides of the second type as straight means embracing non-Euclidean geometry. Therefore, to question which geometry to adopt is essentially to ask which line should be called straight.

It is evident that experiment can not settle such a question; one would not ask, for instance, experiment to decide whether I should call AB or CD a straight. On the other hand, neither can I say that I have not the right to give the name of straights to the sides of non-Euclidean triangles because they are not in[Pg 236] conformity with the eternal idea of straight which I have by intuition. I grant, indeed, that I have the intuitive idea of the side of the Euclidean triangle, but I have equally the intuitive idea of the side of the non-Euclidean triangle. Why should I have the right to apply the name of straight to the first of these ideas and not to the second? Wherein does this syllable form an integrant part of this intuitive idea? Evidently when we say that the Euclidean straight is a true straight and that the non-Euclidean straight is not a true straight, we simply mean that the first intuitive idea corresponds to a more noteworthy object than the second. But how do we decide that this object is more noteworthy? This question I have investigated in 'Science and Hypothesis.'

It’s clear that experiments can’t determine such a question; for example, you wouldn’t rely on experiments to decide whether I should call AB or CD straight. On the flip side, I can’t say that I don’t have the right to call the sides of non-Euclidean triangles straight just because they don’t match the timeless concept of straight that I have intuitively. I admit that I have an intuitive idea of the side of a Euclidean triangle, but I also have an intuitive idea of the side of a non-Euclidean triangle. Why should I have the right to label the first idea as straight but not the second? What makes that term an essential part of one intuitive idea and not the other? Clearly, when we say that the Euclidean straight is a true straight and that the non-Euclidean straight is not a true straight, we’re simply saying that the first intuitive idea relates to a more significant object than the second. But how do we determine that this object is more significant? I’ve explored this question in 'Science and Hypothesis.'

It is here that we saw experience come in. If the Euclidean straight is more noteworthy than the non-Euclidean straight, it is so chiefly because it differs little from certain noteworthy natural objects from which the non-Euclidean straight differs greatly. But, it will be said, the definition of the non-Euclidean straight is artificial; if we for a moment adopt it, we shall see that two circles of different radius both receive the name of non-Euclidean straights, while of two circles of the same radius one can satisfy the definition without the other being able to satisfy it, and then if we transport one of these so-called straights without deforming it, it will cease to be a straight. But by what right do we consider as equal these two figures which the Euclidean geometers call two circles with the same radius? It is because by transporting one of them without deforming it we can make it coincide with the other. And why do we say this transportation is effected without deformation? It is impossible to give a good reason for it. Among all the motions conceivable, there are some of which the Euclidean geometers say that they are not accompanied by deformation; but there are others of which the non-Euclidean geometers would say that they are not accompanied by deformation. In the first, called Euclidean motions, the Euclidean straights remain Euclidean straights and the non-Euclidean straights do not remain non-Euclidean straights; in the motions of the second sort, or non-Euclidean motions, the non-Euclidean straights remain non-Euclidean straights[Pg 237] and the Euclidean straights do not remain Euclidean straights. It has, therefore, not been demonstrated that it was unreasonable to call straights the sides of non-Euclidean triangles; it has only been shown that that would be unreasonable if one continued to call the Euclidean motions motions without deformation; but it has at the same time been shown that it would be just as unreasonable to call straights the sides of Euclidean triangles if the non-Euclidean motions were called motions without deformation.

It is here that we see experience come into play. If the Euclidean straight line is more significant than the non-Euclidean straight line, it's mainly because it closely resembles certain significant natural objects that the non-Euclidean straight line differs greatly from. However, it could be argued that the definition of the non-Euclidean straight line is artificial; if we briefly accept it, we will find that two circles of different radii are both called non-Euclidean straights, while out of two circles with the same radius, only one can meet the definition. Then, if we move one of these so-called straights without changing its shape, it will stop being a straight line. But what justifies us in considering these two figures that Euclidean geometers call two circles with the same radius as equal? It’s because, by moving one of them without stretching it, we can make it line up with the other. And why do we say this movement is done without any change in shape? There's no solid reason for it. Among all conceivable movements, some of which Euclidean geometers claim occur without deformation, while others are claimed by non-Euclidean geometers to also be without deformation. In the first category, called Euclidean motions, the Euclidean straights remain Euclidean, and the non-Euclidean straights do not remain non-Euclidean. In the second category, or non-Euclidean motions, the non-Euclidean straights remain so, while the Euclidean straights do not. Therefore, it hasn’t been proven that it’s unreasonable to call the sides of non-Euclidean triangles straights; it has only been shown that it would be unreasonable to keep calling Euclidean motions motions without deformation. At the same time, it has also been shown that it would be just as unreasonable to label the sides of Euclidean triangles as straights if non-Euclidean motions were referred to as motions without deformation.[Pg 237]

Now when we say that the Euclidean motions are the true motions without deformation, what do we mean? We simply mean that they are more noteworthy than the others. And why are they more noteworthy? It is because certain noteworthy natural bodies, the solid bodies, undergo motions almost similar.

Now when we say that Euclidean motions are the true motions without deformation, what do we mean? We simply mean that they are more significant than the others. And why are they more significant? It is because certain remarkable natural bodies, the solid bodies, undergo motions that are almost similar.

And then when we ask: Can one imagine non-Euclidean space? That means: Can we imagine a world where there would be noteworthy natural objects affecting almost the form of non-Euclidean straights, and noteworthy natural bodies frequently undergoing motions almost similar to the non-Euclidean motions? I have shown in 'Science and Hypothesis' that to this question we must answer yes.

And then when we ask: Can we imagine non-Euclidean space? That means: Can we picture a world with significant natural objects that influence the shape of non-Euclidean lines, and important natural bodies that often move in ways similar to non-Euclidean movements? I've demonstrated in 'Science and Hypothesis' that to this question, our answer must be yes.

It has often been observed that if all the bodies in the universe were dilated simultaneously and in the same proportion, we should have no means of perceiving it, since all our measuring instruments would grow at the same time as the objects themselves which they serve to measure. The world, after this dilatation, would continue on its course without anything apprising us of so considerable an event. In other words, two worlds similar to one another (understanding the word similitude in the sense of Euclid, Book VI.) would be absolutely indistinguishable. But more; worlds will be indistinguishable not only if they are equal or similar, that is, if we can pass from one to the other by changing the axes of coordinates, or by changing the scale to which lengths are referred; but they will still be indistinguishable if we can pass from one to the other by any 'point-transformation' whatever. I will explain my meaning. I suppose that to each point of one corresponds one point of the other and only one, and inversely; and besides that the[Pg 238] coordinates of a point are continuous functions, otherwise altogether arbitrary, of the corresponding point. I suppose besides that to each object of the first world corresponds in the second an object of the same nature placed precisely at the corresponding point. I suppose finally that this correspondence fulfilled at the initial instant is maintained indefinitely. We should have no means of distinguishing these two worlds one from the other. The relativity of space is not ordinarily understood in so broad a sense; it is thus, however, that it would be proper to understand it.

It has often been noted that if all the objects in the universe expanded at the same time and in the same way, we wouldn’t be able to notice it, as all our measuring tools would also grow alongside the objects they measure. The universe, after this expansion, would keep going without anything indicating such a significant event. In other words, two worlds that are similar to each other (using the term similarity in the sense defined by Euclid in Book VI) would be completely indistinguishable. Furthermore, worlds will be indistinguishable not only if they are equal or similar, meaning we can transition from one to the other by adjusting the coordinate axes or altering the scale for lengths; but they will also remain indistinguishable if we can shift from one to the other via any 'point-transformation' whatsoever. Let me clarify what I mean. I assume that each point in one world corresponds to one point in the other and only one, and vice versa; also, the coordinates of a point are continuous functions, otherwise completely arbitrary, of the corresponding point. Additionally, I assume that for every object in the first world, there is a corresponding object of the same type in the second world, located precisely at the corresponding point. Finally, I assume this correspondence established at the initial moment is maintained indefinitely. We wouldn’t have any way to distinguish these two worlds from one another. The concept of the relativity of space isn’t usually understood in such a broad way; however, that is how it should be understood.

If one of these universes is our Euclidean world, what its inhabitants will call straight will be our Euclidean straight; but what the inhabitants of the second world will call straight will be a curve which will have the same properties in relation to the world they inhabit and in relation to the motions that they will call motions without deformation. Their geometry will, therefore, be Euclidean geometry, but their straight will not be our Euclidean straight. It will be its transform by the point-transformation which carries over from our world to theirs. The straights of these men will not be our straights, but they will have among themselves the same relations as our straights to one another. It is in this sense I say their geometry will be ours. If then we wish after all to proclaim that they deceive themselves, that their straight is not the true straight, if we still are unwilling to admit that such an affirmation has no meaning, at least we must confess that these people have no means whatever of recognizing their error.

If one of these universes is our Euclidean world, what its inhabitants call straight will be our Euclidean straight; however, what the inhabitants of the second world call straight will be a curve that has the same properties in relation to the world they live in and in relation to the motions they describe as motions without deformation. Their geometry will, therefore, be Euclidean geometry, but their straight won't be our Euclidean straight. It will be transformed by the point-transformation that shifts from our world to theirs. The straights of these people won’t be our straights, but they will have the same relationships among themselves as our straights do with one another. In this sense, I say their geometry will be ours. If we still want to insist that they are mistaken, that their straight is not the true straight, and refuse to acknowledge that such a statement has no meaning, we must at least admit that these people have no way of recognizing their error.

2. Qualitative Geometry

All that is relatively easy to understand, and I have already so often repeated it that I think it needless to expatiate further on the matter. Euclidean space is not a form imposed upon our sensibility, since we can imagine non-Euclidean space; but the two spaces, Euclidean and non-Euclidean, have a common basis, that amorphous continuum of which I spoke in the beginning. From this continuum we can get either Euclidean space or Lobachevskian space, just as we can, by tracing upon it a proper graduation, transform an ungraduated thermometer into a Fahrenheit or a Réaumur thermometer.[Pg 239]

All of this is relatively easy to understand, and I’ve already repeated it so many times that I think it’s unnecessary to elaborate further. Euclidean space isn’t just a form imposed on our perception, since we can also imagine non-Euclidean space. However, both Euclidean and non-Euclidean spaces share a common foundation: that formless continuum I mentioned earlier. From this continuum, we can derive either Euclidean space or Lobachevskian space, much like how we can convert an uncalibrated thermometer into a Fahrenheit or Réaumur thermometer by applying the right scale.[Pg 239]

And then comes a question: Is not this amorphous continuum, that our analysis has allowed to survive, a form imposed upon our sensibility? If so, we should have enlarged the prison in which this sensibility is confined, but it would always be a prison.

And then comes a question: Isn't this shapeless continuum that our analysis has managed to keep alive just a way we've imposed on our feelings? If that's the case, we should have expanded the prison that holds these feelings, but it would still always be a prison.

This continuum has a certain number of properties, exempt from all idea of measurement. The study of these properties is the object of a science which has been cultivated by many great geometers and in particular by Riemann and Betti and which has received the name of analysis situs. In this science abstraction is made of every quantitative idea and, for example, if we ascertain that on a line the point B is between the points A and C, we shall be content with this ascertainment and shall not trouble to know whether the line ABC is straight or curved, nor whether the length AB is equal to the length BC, or whether it is twice as great.

This continuum has several properties that are free from any idea of measurement. The study of these properties is the focus of a field that has been explored by many great geometers, particularly Riemann and Betti, and this field is known as topology. In this discipline, we ignore all quantitative notions. For instance, if we determine that point B is located between points A and C on a line, we are satisfied with that information and don’t concern ourselves with whether the line ABC is straight or curved, or whether the length AB is equal to the length BC, or if one is twice the other.

The theorems of analysis situs have, therefore, this peculiarity, that they would remain true if the figures were copied by an inexpert draftsman who should grossly change all the proportions and replace the straights by lines more or less sinuous. In mathematical terms, they are not altered by any 'point-transformation' whatsoever. It has often been said that metric geometry was quantitative, while projective geometry was purely qualitative. That is not altogether true. The straight is still distinguished from other lines by properties which remain quantitative in some respects. The real qualitative geometry is, therefore, analysis situs.

The theorems of topology have this unique feature: they would still hold true even if someone not skilled in drawing completely messed up the proportions and replaced straight lines with wavy ones. In mathematical terms, they aren't changed by any 'point transformation' at all. It's often stated that metric geometry is quantitative, while projective geometry is purely qualitative. However, that's not entirely accurate. Straight lines are still distinct from other lines by properties that remain quantitative in some aspects. Therefore, the true qualitative geometry is topology.

The same questions which came up apropos of the truths of Euclidean geometry, come up anew apropos of the theorems of analysis situs. Are they obtainable by deductive reasoning? Are they disguised conventions? Are they experimental verities? Are they the characteristics of a form imposed either upon our sensibility or upon our understanding?

The same questions that were raised about the truths of Euclidean geometry arise again regarding the theorems of topology. Can they be derived through deductive reasoning? Are they just hidden conventions? Are they proven by experimental facts? Do they represent traits of a structure imposed either on our perception or on our comprehension?

I wish simply to observe that the last two solutions exclude each other. We can not admit at the same time that it is impossible to imagine space of four dimensions and that experience proves to us that space has three dimensions. The experimenter puts to nature a question: Is it this or that? and he can not put[Pg 240] it without imagining the two terms of the alternative. If it were impossible to imagine one of these terms, it would be futile and besides impossible to consult experience. There is no need of observation to know that the hand of a watch is not marking the hour 15 on the dial, because we know beforehand that there are only 12, and we could not look at the mark 15 to see if the hand is there, because this mark does not exist.

I simply want to point out that the last two solutions contradict each other. We can’t accept at the same time that it’s impossible to imagine four-dimensional space and that our experience shows us space has three dimensions. The experimenter asks nature a question: Is it this or that? and can’t pose it without imagining both options. If it were impossible to conceive one of these options, it would be pointless and, beyond that, impossible to turn to experience. There’s no need for observation to know that a watch’s hand isn’t pointing to 15 on the dial, because we already know there are only 12, and we couldn’t even look at the 15 mark to see if the hand is there, since that mark doesn’t exist.

Note likewise that in analysis situs the empiricists are disembarrassed of one of the gravest objections that can be leveled against them, of that which renders absolutely vain in advance all their efforts to apply their thesis to the verities of Euclidean geometry. These verities are rigorous and all experimentation can only be approximate. In analysis situs approximate experiments may suffice to give a rigorous theorem and, for instance, if it is seen that space can not have either two or less than two dimensions, nor four or more than four, we are certain that it has exactly three, since it could not have two and a half or three and a half.

Note also that in topology, empiricists are free from one of the biggest criticisms against them, which makes all their attempts to apply their ideas to the truths of Euclidean geometry basically useless from the start. These truths are strict, and all experimentation can only be somewhat accurate. In topology, approximate experiments can be enough to establish a precise theorem. For example, if we observe that space cannot have two dimensions or less, nor four dimensions or more, we can confidently conclude that it has exactly three dimensions, since it can't have two and a half or three and a half dimensions.

Of all the theorems of analysis situs, the most important is that which is expressed in saying that space has three dimensions. This it is that we are about to consider, and we shall put the question in these terms: When we say that space has three dimensions, what do we mean?

Of all the theorems of topology, the most important one is the idea that space has three dimensions. This is what we are going to explore, and we will frame the question like this: When we say that space has three dimensions, what do we really mean?

3. The Physical Continuum of Several Dimensions

I have explained in 'Science and Hypothesis' whence we derive the notion of physical continuity and how that of mathematical continuity has arisen from it. It happens that we are capable of distinguishing two impressions one from the other, while each is indistinguishable from a third. Thus we can readily distinguish a weight of 12 grams from a weight of 10 grams, while a weight of 11 grams could be distinguished from neither the one nor the other. Such a statement, translated into symbols, may be written:

I have explained in 'Science and Hypothesis' where we get the idea of physical continuity and how the concept of mathematical continuity has emerged from it. We can identify two impressions as different from each other, even though each one is indistinguishable from a third. For example, we can easily tell a weight of 12 grams apart from a weight of 10 grams, while we can’t distinguish a weight of 11 grams from either of them. This statement, expressed in symbols, can be written as:

A = B,      B = C,      A < C.

A = B,      B = C,      A < C.

This would be the formula of the physical continuum, as crude experience gives it to us, whence arises an intolerable contradiction[Pg 241] that has been obviated by the introduction of the mathematical continuum. This is a scale of which the steps (commensurable or incommensurable numbers) are infinite in number but are exterior to one another, instead of encroaching on one another as do the elements of the physical continuum, in conformity with the preceding formula.

This would be the formula of the physical continuum, as basic experience presents it to us, which leads to an unbearable contradiction[Pg 241] that has been addressed by introducing the mathematical continuum. This is a scale with infinite steps (both commensurable and incommensurable numbers) that are separate from each other, unlike the elements of the physical continuum, which overlap with one another according to the previous formula.

The physical continuum is, so to speak, a nebula not resolved; the most perfect instruments could not attain to its resolution. Doubtless if we measured the weights with a good balance instead of judging them by the hand, we could distinguish the weight of 11 grams from those of 10 and 12 grams, and our formula would become:

The physical continuum is, in a way, an unresolved nebula; even the best instruments couldn't achieve its clarity. Surely, if we weighed the objects with a good scale instead of estimating them by hand, we could tell the difference between 11 grams and 10 or 12 grams, and our formula would be:

A < B,      B < C,      A < C.

A < B,      B < C,      A < C.

But we should always find between A and B and between B and C new elements D and E, such that

But we should always find new elements D and E between A and B, and between B and C, such that

A = D,      D = B,      A < B;      B = E,      E = C,      B < C,

A = D,      D = B,      A < B;      B = E,      E = C,      B < C,

and the difficulty would only have receded and the nebula would always remain unresolved; the mind alone can resolve it and the mathematical continuum it is which is the nebula resolved into stars.

and the difficulty would only have faded and the nebula would always stay unresolved; only the mind can solve it, and the mathematical continuum is what turns the nebula into stars.

Yet up to this point we have not introduced the notion of the number of dimensions. What is meant when we say that a mathematical continuum or that a physical continuum has two or three dimensions?

Yet up to this point, we haven't discussed the idea of dimensions. What do we mean when we say that a mathematical continuum or a physical continuum has two or three dimensions?

First we must introduce the notion of cut, studying first physical continua. We have seen what characterizes the physical continuum. Each of the elements of this continuum consists of a manifold of impressions; and it may happen either that an element can not be discriminated from another element of the same continuum, if this new element corresponds to a manifold of impressions not sufficiently different, or, on the contrary, that the discrimination is possible; finally it may happen that two elements indistinguishable from a third may, nevertheless, be distinguished one from the other.

First, we need to introduce the concept of a cut, starting with physical continuums. We've looked at what defines the physical continuum. Each element of this continuum consists of a range of impressions, and it may be the case that one element cannot be distinguished from another element of the same continuum if the new element corresponds to a range of impressions that aren't significantly different. On the other hand, it may be possible to make a distinction; finally, it might happen that two elements that are indistinguishable from a third can still be distinguished from each other.

That postulated, if A and B are two distinguishable elements of a continuum C, a series of elements may be found, E1, E2, ..., En, all belonging to this same continuum C and such that each of[Pg 242] them is indistinguishable from the preceding, that E1 is indistinguishable from A, and En indistinguishable from B. Therefore we can go from A to B by a continuous route and without quitting C. If this condition is fulfilled for any two elements A and B of the continuum C, we may say that this continuum C is all in one piece. Now let us distinguish certain of the elements of C which may either be all distinguishable from one another, or themselves form one or several continua. The assemblage of the elements thus chosen arbitrarily among all those of C will form what I shall call the cut or the cuts.

That proposed idea states that if A and B are two identifiable elements of a continuum C, we can find a series of elements, E1, E2, ..., En, all part of the same continuum C, where each element is similar to the one before it, E1 is similar to A, and En is similar to B. Thus, we can transition from A to B along a continuous path without leaving C. If this condition holds true for any two elements A and B within the continuum C, we can say that this continuum C is whole. Now, let's identify certain elements of C that may either all be distinguishable from each other or could themselves form one or more continua. The collection of elements chosen this way from all those in C will create what I will refer to as the cut or the cuts.

Take on C any two elements A and B. Either we can also find a series of elements E1, E2, ..., En, such: (1) that they all belong to C; (2) that each of them is indistinguishable from the following, E1 indistinguishable from A and En from B; (3) and besides that none of the elements E is indistinguishable from any element of the cut. Or else, on the contrary, in each of the series E1, E2, ..., En satisfying the first two conditions, there will be an element E indistinguishable from one of the elements of the cut. In the first case we can go from A to B by a continuous route without quitting C and without meeting the cuts; in the second case that is impossible.

Take a set C with any two elements A and B. We can either find a series of elements E1, E2, ..., En such that: (1) they all belong to C; (2) each of them is indistinguishable from the next, with E1 being indistinguishable from A and En being indistinguishable from B; (3) and furthermore, none of the elements E is indistinguishable from any element of the cut. Otherwise, in each of the series E1, E2, ..., En that meet the first two conditions, there will be an element E that is indistinguishable from one of the elements of the cut. In the first case, we can move from A to B along a continuous path without leaving C and without encountering the cuts; in the second case, that is not possible.

If then for any two elements A and B of the continuum C, it is always the first case which presents itself, we shall say that C remains all in one piece despite the cuts.

If for any two elements A and B of the continuum C, the first case always occurs, we will say that C remains whole despite the cuts.

Thus, if we choose the cuts in a certain way, otherwise arbitrary, it may happen either that the continuum remains all in one piece or that it does not remain all in one piece; in this latter hypothesis we shall then say that it is divided by the cuts.

Thus, if we select the cuts in a specific way, which may seem random, it could result in the continuum either staying whole or becoming fragmented; in the latter case, we will say that it is divided by the cuts.

It will be noticed that all these definitions are constructed in setting out solely from this very simple fact, that two manifolds of impressions sometimes can be discriminated, sometimes can not be. That postulated, if, to divide a continuum, it suffices to consider as cuts a certain number of elements all distinguishable from one another, we say that this continuum is of one dimension; if, on the contrary, to divide a continuum, it is necessary to consider as cuts a system of elements themselves forming one or several continua, we shall say that this continuum is of several dimensions.[Pg 243]

It should be noted that all these definitions are based on a very simple fact: that two sets of impressions can sometimes be distinguished and sometimes cannot. Given that, if to divide a continuum, it is enough to consider a certain number of elements that are all distinguishable from one another, we say that this continuum is one-dimensional; if, on the other hand, to divide a continuum, it is necessary to consider a system of elements that themselves form one or more continua, we say that this continuum is multidimensional.[Pg 243]

If to divide a continuum C, cuts forming one or several continua of one dimension suffice, we shall say that C is a continuum of two dimensions; if cuts suffice which form one or several continua of two dimensions at most, we shall say that C is a continuum of three dimensions; and so on.

If to split a continuum C, cuts that create one or more continua of one dimension are enough, we will say that C is a continuum of two dimensions; if cuts are enough that create one or more continua of up to two dimensions, we will say that C is a continuum of three dimensions; and so on.

To justify this definition it is proper to see whether it is in this way that geometers introduce the notion of three dimensions at the beginning of their works. Now, what do we see? Usually they begin by defining surfaces as the boundaries of solids or pieces of space, lines as the boundaries of surfaces, points as the boundaries of lines, and they affirm that the same procedure can not be pushed further.

To explain this definition, it's important to consider how geometers introduce the idea of three dimensions at the start of their work. What do we observe? Typically, they start by defining surfaces as the edges of solids or sections of space, lines as the edges of surfaces, points as the edges of lines, and they assert that this process cannot be extended any further.

This is just the idea given above: to divide space, cuts that are called surfaces are necessary; to divide surfaces, cuts that are called lines are necessary; to divide lines, cuts that are called points are necessary; we can go no further, the point can not be divided, so the point is not a continuum. Then lines which can be divided by cuts which are not continua will be continua of one dimension; surfaces which can be divided by continuous cuts of one dimension will be continua of two dimensions; finally, space which can be divided by continuous cuts of two dimensions will be a continuum of three dimensions.

This is just the idea mentioned above: to split space, we need cuts called surfaces; to split surfaces, we need cuts called lines; to split lines, we need cuts called points; we can't go any further because a point can't be divided, so a point is not a continuum. Then, lines that can be divided by cuts that aren't continua will be one-dimensional continua; surfaces that can be divided by continuous cuts of one dimension will be two-dimensional continua; finally, space that can be divided by continuous cuts of two dimensions will be a three-dimensional continuum.

Thus the definition I have just given does not differ essentially from the usual definitions; I have only endeavored to give it a form applicable not to the mathematical continuum, but to the physical continuum, which alone is susceptible of representation, and yet to retain all its precision. Moreover, we see that this definition applies not alone to space; that in all which falls under our senses we find the characteristics of the physical continuum, which would allow of the same classification; that it would be easy to find there examples of continua of four, of five, dimensions, in the sense of the preceding definition; such examples occur of themselves to the mind.

The definition I just provided isn’t fundamentally different from the typical definitions; I’ve just aimed to frame it in a way that’s relevant not to the mathematical continuum, but to the physical continuum, which is the only one that can be represented, while still keeping all its precision. Furthermore, we see that this definition applies not just to space; in everything we can perceive through our senses, we find characteristics of the physical continuum that would allow for the same classification. It would be easy to find examples of continua with four or five dimensions based on the earlier definition; such examples readily come to mind.

I should explain finally, if I had the time, that this science, of which I spoke above and to which Riemann gave the name of analysis situs, teaches us to make distinctions among continua of the same number of dimensions and that the classification of these continua rests also on the consideration of cuts.[Pg 244]

I should finally explain, if I had the time, that this science, which I mentioned earlier and which Riemann called analysis situs, teaches us to differentiate between continua that have the same number of dimensions, and that the classification of these continua is also based on the idea of cuts.[Pg 244]

From this notion has arisen that of the mathematical continuum of several dimensions in the same way that the physical continuum of one dimension engendered the mathematical continuum of one dimension. The formula

From this idea has come the concept of the mathematical continuum of several dimensions just like the physical continuum of one dimension created the mathematical continuum of one dimension. The formula

A > C,      A = B,      B = C,

A > C,      A = B,      B = C,

which summed up the data of crude experience, implied an intolerable contradiction. To get free from it, it was necessary to introduce a new notion while still respecting the essential characteristics of the physical continuum of several dimensions. The mathematical continuum of one dimension admitted of a scale whose divisions, infinite in number, corresponded to the different values, commensurable or not, of one same magnitude. To have the mathematical continuum of n dimensions, it will suffice to take n like scales whose divisions correspond to different values of n independent magnitudes called coordinates. We thus shall have an image of the physical continuum of n dimensions, and this image will be as faithful as it can be after the determination not to allow the contradiction of which I spoke above.

which summed up the data of raw experience, suggested an unacceptable contradiction. To escape it, a new concept needed to be introduced while still honoring the essential features of the physical continuum of multiple dimensions. The mathematical continuum of one dimension allowed for a scale with divisions, infinite in number, that corresponded to the various values, whether comparable or not, of the same quantity. To create the mathematical continuum of n dimensions, it is enough to have n similar scales with divisions that correspond to different values of n independent quantities known as coordinates. This will provide us with a representation of the physical continuum of n dimensions, and this representation will be as accurate as possible once we decide not to accept the contradiction I mentioned earlier.

4. The Notion of Point

It seems now that the question we put to ourselves at the start is answered. When we say that space has three dimensions, it will be said, we mean that the manifold of points of space satisfies the definition we have just given of the physical continuum of three dimensions. To be content with that would be to suppose that we know what is the manifold of points of space, or even one point of space.

It seems now that the question we asked ourselves at the beginning has been answered. When we say that space has three dimensions, we mean that the collection of points in space meets the definition we've just given of a physical continuum with three dimensions. To be satisfied with that would mean thinking we understand what the collection of points in space is, or even just one point in space.

Now that is not as simple as one might think. Every one believes he knows what a point is, and it is just because we know it too well that we think there is no need of defining it. Surely we can not be required to know how to define it, because in going back from definition to definition a time must come when we must stop. But at what moment should we stop?

Now, that's not as simple as it seems. Everyone thinks they know what a point is, and it's precisely because we know it so well that we believe there's no need to define it. Surely, we can't be expected to know how to define it, because if we keep going back from definition to definition, there will come a time when we need to stop. But when should we stop?

We shall stop first when we reach an object which falls under our senses or that we can represent to ourselves; definition then will become useless; we do not define the sheep to a child; we say to him: See the sheep.[Pg 245]

We’ll pause first when we come across something we can see or picture in our minds; at that point, definitions won't be necessary; we don’t define a sheep to a child; we simply say to them: Look at the sheep.[Pg 245]

So, then, we should ask ourselves if it is possible to represent to ourselves a point of space. Those who answer yes do not reflect that they represent to themselves in reality a white spot made with the chalk on a blackboard or a black spot made with a pen on white paper, and that they can represent to themselves only an object or rather the impressions that this object made on their senses.

So, let's ask ourselves if it's possible to visualize a point in space. Those who say yes don’t realize that they’re actually picturing a white dot made with chalk on a blackboard or a black dot made with a pen on white paper, and that they can only imagine an object or, more accurately, the impressions that this object made on their senses.

When they try to represent to themselves a point, they represent the impressions that very little objects made them feel. It is needless to add that two different objects, though both very little, may produce extremely different impressions, but I shall not dwell on this difficulty, which would still require some discussion.

When they try to visualize a point, they recall the feelings that very small objects gave them. It's important to note that two different small objects can create very different impressions, but I won't focus on this issue, which would require further discussion.

But it is not a question of that; it does not suffice to represent one point, it is necessary to represent a certain point and to have the means of distinguishing it from an other point. And in fact, that we may be able to apply to a continuum the rule I have above expounded and by which one may recognize the number of its dimensions, we must rely upon the fact that two elements of this continuum sometimes can and sometimes can not be distinguished. It is necessary therefore that we should in certain cases know how to represent to ourselves a specific element and to distinguish it from an other element.

But it’s not just about that; it’s not enough to point out one thing, we need to identify a specific thing and be able to tell it apart from another thing. In fact, for us to apply the rule I mentioned earlier to a continuum, which helps us determine the number of its dimensions, we have to be aware that sometimes we can distinguish between two elements in this continuum, and sometimes we can’t. Therefore, we must sometimes know how to represent a specific element and tell it apart from another element.

The question is to know whether the point that I represented to myself an hour ago is the same as this that I now represent to myself, or whether it is a different point. In other words, how do we know whether the point occupied by the object A at the instant α is the same as the point occupied by the object B at the instant β, or still better, what this means?

The question is whether the point I imagined an hour ago is the same as the one I'm thinking of now, or if it's a different point. In other words, how can we tell if the point held by object A at time α is the same as the point held by object B at time β? And, even better, what does that actually mean?

I am seated in my room; an object is placed on my table; during a second I do not move, no one touches the object. I am tempted to say that the point A which this object occupied at the beginning of this second is identical with the point B which it occupies at its end. Not at all; from the point A to the point B is 30 kilometers, because the object has been carried along in the motion of the earth. We can not know whether an object, be it large or small, has not changed its absolute position in space, and not only can we not affirm it, but this affirmation has no[Pg 246] meaning and in any case can not correspond to any representation.

I’m sitting in my room; there’s an object on my table; for a moment, I don’t move, and no one touches the object. I’m tempted to say that the point A where this object was at the start of that moment is the same as the point B where it is at the end. But that’s not true; the distance from point A to point B is 30 kilometers, because the object has been carried along by the motion of the earth. We can’t know if an object, no matter its size, hasn’t changed its absolute position in space, and not only can we not confirm it, but that claim has no[Pg 246] meaning and certainly can’t correspond to any representation.

But then we may ask ourselves if the relative position of an object with regard to other objects has changed or not, and first whether the relative position of this object with regard to our body has changed. If the impressions this object makes upon us have not changed, we shall be inclined to judge that neither has this relative position changed; if they have changed, we shall judge that this object has changed either in state or in relative position. It remains to decide which of the two. I have explained in 'Science and Hypothesis' how we have been led to distinguish the changes of position. Moreover, I shall return to that further on. We come to know, therefore, whether the relative position of an object with regard to our body has or has not remained the same.

But then we might ask ourselves if the position of an object in relation to other objects has changed or not, and first whether its position in relation to our body has changed. If the impressions this object makes on us haven’t changed, we’re likely to think that its relative position hasn’t changed either; if they have changed, we’ll conclude that the object has changed either in state or in its relative position. It’s important to determine which of the two. I explained in 'Science and Hypothesis' how we’ve learned to distinguish changes in position. Additionally, I’ll revisit that later. Thus, we can know whether the position of an object in relation to our body has remained the same or not.

If now we see that two objects have retained their relative position with regard to our body, we conclude that the relative position of these two objects with regard to one another has not changed; but we reach this conclusion only by indirect reasoning. The only thing that we know directly is the relative position of the objects with regard to our body. A fortiori it is only by indirect reasoning that we think we know (and, moreover, this belief is delusive) whether the absolute position of the object has changed.

If we observe that two objects have maintained their relative position to our body, we can conclude that the relative position between these two objects has not changed; however, we arrive at this conclusion through indirect reasoning. The only thing we know directly is the relative position of the objects to our body. A fortiori, it is also only through indirect reasoning that we believe (and this belief can be misleading) that we know whether the absolute position of the object has changed.

In a word, the system of coordinate axes to which we naturally refer all exterior objects is a system of axes invariably bound to our body, and carried around with us.

In short, the coordinate system we instinctively use to relate to all external objects is a system of axes that is always connected to our body and moves with us.

It is impossible to represent to oneself absolute space; when I try to represent to myself simultaneously objects and myself in motion in absolute space, in reality I represent to myself my own self motionless and seeing move around me different objects and a man that is exterior to me, but that I convene to call me.

It’s impossible to fully imagine absolute space. When I try to picture both myself and objects moving in absolute space at the same time, I actually see myself as still, watching various objects and a person outside of me whom I refer to as me.

Will the difficulty be solved if we agree to refer everything to these axes bound to our body? Shall we know then what is a point thus defined by its relative position with regard to ourselves? Many persons will answer yes and will say that they 'localize' exterior objects.

Will the difficulty be solved if we agree to refer everything to these axes connected to our body? Will we then understand what a point is when defined by its relative position to ourselves? Many people will answer yes and will say that they 'locate' external objects.

What does this mean? To localize an object simply means to represent to oneself the movements that would be necessary to[Pg 247] reach it. I will explain myself. It is not a question of representing the movements themselves in space, but solely of representing to oneself the muscular sensations which accompany these movements and which do not presuppose the preexistence of the notion of space.

What does this mean? To localize an object just means to imagine the movements needed to [Pg 247] get to it. Let me clarify. It’s not about visualizing the actual movements in space, but rather about imagining the muscular sensations that come with these movements, which don’t require the prior notion of space.

If we suppose two different objects which successively occupy the same relative position with regard to ourselves, the impressions that these two objects make upon us will be very different; if we localize them at the same point, this is simply because it is necessary to make the same movements to reach them; apart from that, one can not just see what they could have in common.

If we consider two different objects that take up the same position relative to us one after the other, the impressions they leave on us will be quite different. If we place them in the same spot, it’s only because we have to make the same movements to get to them; other than that, it's hard to see what they could possibly have in common.

But, given an object, we can conceive many different series of movements which equally enable us to reach it. If then we represent to ourselves a point by representing to ourselves the series of muscular sensations which accompany the movements which enable us to reach this point, there will be many ways entirely different of representing to oneself the same point. If one is not satisfied with this solution, but wishes, for instance, to bring in the visual sensations along with the muscular sensations, there will be one or two more ways of representing to oneself this same point and the difficulty will only be increased. In any case the following question comes up: Why do we think that all these representations so different from one another still represent the same point?

But given an object, we can imagine many different series of movements that allow us to reach it. If we picture a point by thinking of the series of muscle sensations that come with the movements needed to get to this point, there will be many completely different ways to visualize the same point. If one isn't satisfied with this solution and wants to include visual sensations along with muscle sensations, there will be one or two additional ways to visualize the same point, and the complexity will only increase. In any case, the question arises: Why do we believe that all these very different representations still refer to the same point?

Another remark: I have just said that it is to our own body that we naturally refer exterior objects; that we carry about everywhere with us a system of axes to which we refer all the points of space and that this system of axes seems to be invariably bound to our body. It should be noticed that rigorously we could not speak of axes invariably bound to the body unless the different parts of this body were themselves invariably bound to one another. As this is not the case, we ought, before referring exterior objects to these fictitious axes, to suppose our body brought back to the initial attitude.

Another point: I've just mentioned that we naturally relate outside objects to our own body; that we carry a system of axes with us everywhere to reference all points in space, and this system seems to be consistently tied to our body. It's important to note that technically, we can't talk about axes consistently tied to the body unless the different parts of that body are also consistently connected to each other. Since this isn't true, we should, before referencing outside objects to these imaginary axes, imagine our body returned to the starting position.

5. The Notion of Displacement

I have shown in 'Science and Hypothesis' the preponderant rôle played by the movements of our body in the genesis of the[Pg 248] notion of space. For a being completely immovable there would be neither space nor geometry; in vain would exterior objects be displaced about him, the variations which these displacements would make in his impressions would not be attributed by this being to changes of position, but to simple changes of state; this being would have no means of distinguishing these two sorts of changes, and this distinction, fundamental for us, would have no meaning for him.

I have demonstrated in 'Science and Hypothesis' the significant role that our body movements play in the development of the[Pg 248] concept of space. For a being that could not move at all, there would be no space or geometry; no matter how objects shifted around him, he wouldn’t interpret those changes as movements but simply as changes in his condition; this being would have no way to differentiate between these two types of changes, and this distinction, which is essential for us, would hold no significance for him.

The movements that we impress upon our members have as effect the varying of the impressions produced on our senses by external objects; other causes may likewise make them vary; but we are led to distinguish the changes produced by our own motions and we easily discriminate them for two reasons: (1) because they are voluntary; (2) because they are accompanied by muscular sensations.

The movements we make affect the different impressions that external objects create on our senses; other factors can also cause them to change; however, we can identify the changes caused by our own actions and easily recognize them for two reasons: (1) because they are intentional; (2) because they come with muscle sensations.

So we naturally divide the changes that our impressions may undergo into two categories to which perhaps I have given an inappropriate designation: (1) the internal changes, which are voluntary and accompanied by muscular sensations; (2) the external changes, having the opposite characteristics.

So we naturally categorize the changes that our impressions can go through into two groups, which I might have labeled inappropriately: (1) internal changes, which are voluntary and accompanied by muscle sensations; (2) external changes, which have the opposite characteristics.

We then observe that among the external changes are some which can be corrected, thanks to an internal change which brings everything back to the primitive state; others can not be corrected in this way (it is thus that, when an exterior object is displaced, we may then by changing our own position replace ourselves as regards this object in the same relative position as before, so as to reestablish the original aggregate of impressions; if this object was not displaced, but changed its state, that is impossible). Thence comes a new distinction among external changes: those which may be so corrected we call changes of position; and the others, changes of state.

We can see that among the external changes, some can be fixed by making an internal change that returns everything to its original state, while others can't be fixed this way. For example, when an external object moves, we can change our own position to regain the same relative position to that object and restore the original set of impressions. However, if the object didn't move but simply changed its state, that's not possible. This leads us to a new distinction among external changes: those that can be corrected are called changes of position, while those that cannot are called changes of state.

Think, for example, of a sphere with one hemisphere blue and the other red; it first presents to us the blue hemisphere, then it so revolves as to present the red hemisphere. Now think of a spherical vase containing a blue liquid which becomes red in consequence of a chemical reaction. In both cases the sensation of red has replaced that of blue; our senses have experienced the same impressions which have succeeded each other in the same[Pg 249] order, and yet these two changes are regarded by us as very different; the first is a displacement, the second a change of state. Why? Because in the first case it is sufficient for me to go around the sphere to place myself opposite the blue hemisphere and reestablish the original blue sensation.

Consider a sphere with one half blue and the other half red; it first shows us the blue half, then it rotates to reveal the red half. Now imagine a spherical vase filled with a blue liquid that turns red due to a chemical reaction. In both instances, the experience of red has taken over from blue; our senses have gone through the same impressions in the same[Pg 249] order, yet we view these two changes as quite different; the first is a simple shift, while the second is a transformation. Why is that? Because in the first scenario, all I need to do is walk around the sphere to face the blue half again and restore the original blue sensation.

Still more; if the two hemispheres, in place of being red and blue, had been yellow and green, how should I have interpreted the revolution of the sphere? Before, the red succeeded the blue, now the green succeeds the yellow; and yet I say that the two spheres have undergone the same revolution, that each has turned about its axis; yet I can not say that the green is to yellow as the red is to blue; how then am I led to decide that the two spheres have undergone the same displacement? Evidently because, in one case as in the other, I am able to reestablish the original sensation by going around the sphere, by making the same movements, and I know that I have made the same movements because I have felt the same muscular sensations; to know it, I do not need, therefore, to know geometry in advance and to represent to myself the movements of my body in geometric space.

Still more; if the two hemispheres, instead of being red and blue, had been yellow and green, how would I have interpreted the revolution of the sphere? Before, red followed blue, now green follows yellow; and yet I claim that the two spheres have gone through the same revolution, that each has rotated around its axis; still, I can’t say that green relates to yellow the same way red relates to blue; so how do I conclude that the two spheres have experienced the same displacement? Clearly because, in both cases, I can restore the original sensation by moving around the sphere, performing the same actions, and I know I’ve done the same actions because I’ve felt the same muscular sensations; to know this, I don’t need to have prior knowledge of geometry or to visualize my body’s movements in geometric space.

Another example: An object is displaced before my eye; its image was first formed at the center of the retina; then it is formed at the border; the old sensation was carried to me by a nerve fiber ending at the center of the retina; the new sensation is carried to me by another nerve fiber starting from the border of the retina; these two sensations are qualitatively different; otherwise, how could I distinguish them?

Another example: An object moves in front of me; its image is first created at the center of my retina; then it appears at the edge; the previous sensation was transmitted to me by a nerve fiber that ends at the center of the retina; the new sensation is sent to me by another nerve fiber that starts from the edge of the retina; these two sensations feel different; otherwise, how could I tell them apart?

Why then am I led to decide that these two sensations, qualitatively different, represent the same image, which has been displaced? It is because I can follow the object with the eye and by a displacement of the eye, voluntary and accompanied by muscular sensations, bring back the image to the center of the retina and reestablish the primitive sensation.

Why then do I feel that these two sensations, which are different in quality, represent the same image that has been shifted? It's because I can track the object with my eyes and by moving my eyes, which is intentional and paired with muscular sensations, I can bring the image back to the center of my retina and restore the original sensation.

I suppose that the image of a red object has gone from the center A to the border B of the retina, then that the image of a blue object goes in its turn from the center A to the border B of the retina; I shall decide that these two objects have undergone the same displacement. Why? Because in both cases I shall have been able to reestablish the primitive sensation, and[Pg 250] that to do it I shall have had to execute the same movement of the eye, and I shall know that my eye has executed the same movement because I shall have felt the same muscular sensations.

I guess that the image of a red object has moved from the center A to the edge B of the retina, and then the image of a blue object follows the same path from the center A to the edge B of the retina; I will conclude that these two objects have experienced the same displacement. Why? Because in both cases I will have been able to restore the original sensation, and[Pg 250] to do that, I will have had to make the same eye movement, and I will know that my eye made the same movement because I will have felt the same muscular sensations.

If I could not move my eye, should I have any reason to suppose that the sensation of red at the center of the retina is to the sensation of red at the border of the retina as that of blue at the center is to that of blue at the border? I should only have four sensations qualitatively different, and if I were asked if they are connected by the proportion I have just stated, the question would seem to me ridiculous, just as if I were asked if there is an analogous proportion between an auditory sensation, a tactile sensation and an olfactory sensation.

If I couldn't move my eye, would I have any reason to think that the feeling of red at the center of my retina is related to the feeling of red at the edge of the retina in the same way that the feeling of blue at the center is related to the feeling of blue at the edge? I would only have four different sensations, and if someone asked me if they're connected in the way I just described, I'd find the question absurd, just like if someone asked me if there's a similar relationship between a sound sensation, a touch sensation, and a smell sensation.

Let us now consider the internal changes, that is, those which are produced by the voluntary movements of our body and which are accompanied by muscular changes. They give rise to the two following observations, analogous to those we have just made on the subject of external changes.

Let’s now look at the internal changes, meaning those caused by our body’s voluntary movements and linked to muscle changes. They lead to two observations that are similar to the ones we just made about external changes.

1. I may suppose that my body has moved from one point to another, but that the same attitude is retained; all the parts of the body have therefore retained or resumed the same relative situation, although their absolute situation in space may have varied. I may suppose that not only has the position of my body changed, but that its attitude is no longer the same, that, for instance, my arms which before were folded are now stretched out.

1. I might think that my body has shifted from one spot to another, but the same attitude is still there; all parts of my body have kept or taken back the same relative position, even though their exact location in space may have changed. I might also think that not just the position of my body has changed, but that its attitude isn’t the same anymore, like when my arms, which were once crossed, are now stretched out.

I should therefore distinguish the simple changes of position without change of attitude, and the changes of attitude. Both would appear to me under form of muscular sensations. How then am I led to distinguish them? It is that the first may serve to correct an external change, and that the others can not, or at least can only give an imperfect correction.

I should therefore differentiate between simple changes in position without a change in attitude and changes in attitude itself. Both seem to me to manifest as muscle sensations. So how do I differentiate between them? It's because the first can help correct an external change, while the others cannot, or at least can only provide an incomplete correction.

This fact I proceed to explain as I would explain it to some one who already knew geometry, but it need not thence be concluded that it is necessary already to know geometry to make this distinction; before knowing geometry I ascertain the fact (experimentally, so to speak), without being able to explain it. But merely to make the distinction between the two kinds of change, I do not need to explain the fact, it suffices me to ascertain it.

This fact I will explain as if I were speaking to someone who already understands geometry, but that doesn't mean you have to know geometry to recognize this distinction. Before I understood geometry, I figured this fact out (in a way, through experience), even though I couldn't explain it. However, just to distinguish between the two types of change, I don't need to explain the fact; it's enough for me to ascertain it.

However that may be, the explanation is easy. Suppose that[Pg 251] an exterior object is displaced; if we wish the different parts of our body to resume with regard to this object their initial relative position, it is necessary that these different parts should have resumed likewise their initial relative position with regard to one another. Only the internal changes which satisfy this latter condition will be capable of correcting the external change produced by the displacement of that object. If, therefore, the relative position of my eye with regard to my finger has changed, I shall still be able to replace the eye in its initial relative situation with regard to the object and reestablish thus the primitive visual sensations, but then the relative position of the finger with regard to the object will have changed and the tactile sensations will not be reestablished.

However that may be, the explanation is simple. Suppose that[Pg 251] an outside object is moved; if we want the different parts of our body to return to their original relative positions regarding this object, it's necessary for these different parts to also return to their original relative positions with each other. Only the internal changes that satisfy this condition can correct the external change caused by the movement of that object. So, if the relative position of my eye to my finger has changed, I can still place my eye back in its original position relative to the object and thus restore the original visual sensations, but then the relative position of my finger to the object will have changed, and the tactile sensations will not be restored.

2. We ascertain likewise that the same external change may be corrected by two internal changes corresponding to different muscular sensations. Here again I can ascertain this without knowing geometry; and I have no need of anything else; but I proceed to give the explanation of the fact, employing geometrical language. To go from the position A to the position B I may take several routes. To the first of these routes will correspond a series S of muscular sensations; to a second route will correspond another series S´´, of muscular sensations which generally will be completely different, since other muscles will be used.

2. We also find that the same external change can be adjusted by two internal changes linked to different muscular sensations. Once again, I can figure this out without knowing geometry; I don’t need anything else. However, I’m going to explain the fact using geometrical terms. To move from position A to position B, I can take several paths. The first path corresponds to a series S of muscular sensations; the second path corresponds to another series S´´, of muscular sensations that will generally be completely different since different muscles will be used.

How am I led to regard these two series S and S´´ as corresponding to the same displacement AB? It is because these two series are capable of correcting the same external change. Apart from that, they have nothing in common.

How do I come to see these two series S and S´´ as corresponding to the same displacement AB? It's because both series can correct the same external change. Other than that, they have nothing else in common.

Let us now consider two external changes: α and β, which shall be, for instance, the rotation of a sphere half blue, half red, and that of a sphere half yellow, half green; these two changes have nothing in common, since the one is for us the passing of blue into red and the other the passing of yellow into green. Consider, on the other hand, two series of internal changes S and S´´; like the others, they will have nothing in common. And yet I say that α and β correspond to the same displacement, and that S and S´´ correspond also to the same displacement. why? Simply because S can correct α as well as β and because α can be corrected by S´´ as well as by S. And then a question suggests itself:[Pg 252]

Let’s think about two external changes: α and β, which could be the rotation of a sphere that’s half blue and half red, and the rotation of another sphere that’s half yellow and half green. These two changes are completely different because one represents blue turning into red, while the other represents yellow turning into green. Now, if we consider two series of internal changes S and S´´, they also have nothing in common. Yet, I claim that α and β correspond to the same displacement, and that S and S´´ also correspond to the same displacement. Why is that? Simply because S can correct both α and β, and α can be corrected by S´´ just as well as by S. This raises a question:[Pg 252]

If I have ascertained that S corrects α and β and that S´´ corrects α, am I certain that S´´ likewise corrects β? Experiment alone can teach us whether this law is verified. If it were not verified, at least approximately, there would be no geometry, there would be no space, because we should have no more interest in classifying the internal and external changes as I have just done, and, for instance, in distinguishing changes of state from changes of position.

If I’ve figured out that S fixes α and β, and that S´´ fixes α, can I be sure that S´´ also fixes β? Only experiments can show us whether this principle holds true. If it didn’t hold true, at least not closely enough, there would be no geometry, no space, because we wouldn’t care about classifying the internal and external changes as I just did, like distinguishing between changes of state and changes of position.

It is interesting to see what has been the rôle of experience in all this. It has shown me that a certain law is approximately verified. It has not told me how space is, and that it satisfies the condition in question. I knew, in fact, before all experience, that space satisfied this condition or that it would not be; nor have I any right to say that experience told me that geometry is possible; I very well see that geometry is possible, since it does not imply contradiction; experience only tells me that geometry is useful.

It’s interesting to see what role experience has played in all this. It has shown me that a certain law is almost verified. However, it hasn’t explained how space is, or that it meets the condition in question. In fact, I knew, even before any experience, that space met this condition or else it wouldn’t exist; nor do I have any right to say that experience confirmed for me that geometry is possible; I clearly see that geometry is possible since it doesn’t imply contradiction; experience only tells me that geometry is useful.

6. Visual Space

Although motor impressions have had, as I have just explained, an altogether preponderant influence in the genesis of the notion of space, which never would have taken birth without them, it will not be without interest to examine also the rôle of visual impressions and to investigate how many dimensions 'visual space' has, and for that purpose to apply to these impressions the definition of § 3.

Although motor impressions have significantly influenced the development of the concept of space, which wouldn’t have emerged without them, it’s also interesting to look at the role of visual impressions and explore how many dimensions 'visual space' has. To do this, we will apply the definition from § 3 to these impressions.

A first difficulty presents itself: consider a red color sensation affecting a certain point of the retina; and on the other hand a blue color sensation affecting the same point of the retina. It is necessary that we have some means of recognizing that these two sensations, qualitatively different, have something in common. Now, according to the considerations expounded in the preceding paragraph, we have been able to recognize this only by the movements of the eye and the observations to which they have given rise. If the eye were immovable, or if we were unconscious of its movements, we should not have been able to recognize that these two sensations, of different quality, had something in common; we should not have been able to disengage from them what[Pg 253] gives them a geometric character. The visual sensations, without the muscular sensations, would have nothing geometric, so that it may be said there is no pure visual space.

A first challenge arises: think about a red color sensation impacting a specific spot on the retina, and a blue color sensation at that same spot on the retina. We need a way to recognize that these two sensations, which are qualitatively different, share something in common. As discussed in the previous paragraph, we can only recognize this through the movements of the eye and the observations that come from them. If the eye were motionless, or if we were unaware of its movements, we wouldn’t be able to see that these two sensations, though different in quality, have something in common; we wouldn’t be able to extract from them what[Pg 253] gives them a geometric aspect. Visual sensations, without the sensations of movement, would lack any geometric quality, meaning that there is no such thing as pure visual space.

To do away with this difficulty, consider only sensations of the same nature, red sensations, for instance, differing one from another only as regards the point of the retina that they affect. It is clear that I have no reason for making such an arbitrary choice among all the possible visual sensations, for the purpose of uniting in the same class all the sensations of the same color, whatever may be the point of the retina affected. I should never have dreamt of it, had I not before learned, by the means we have just seen, to distinguish changes of state from changes of position, that is, if my eye were immovable. Two sensations of the same color affecting two different parts of the retina would have appeared to me as qualitatively distinct, just as two sensations of different color.

To eliminate this issue, let’s only consider sensations of the same type, like red sensations, that differ only in the point of the retina they impact. It's clear that I have no reason to make such an arbitrary choice among all possible visual sensations to group together all sensations of the same color, regardless of which part of the retina is involved. I would never have thought of it if I hadn’t already learned, through the methods we just discussed, to differentiate between changes in state and changes in position, assuming my eye remains still. Two sensations of the same color affecting two different areas of the retina would have seemed qualitatively different to me, just like sensations of different colors.

In restricting myself to red sensations, I therefore impose upon myself an artificial limitation and I neglect systematically one whole side of the question; but it is only by this artifice that I am able to analyze visual space without mingling any motor sensation.

By limiting myself to red sensations, I'm setting an artificial boundary for myself, and I'm systematically ignoring one entire aspect of the issue. However, it's only through this trick that I'm able to analyze visual space without mixing in any physical sensations.

Imagine a line traced on the retina and dividing in two its surface; and set apart the red sensations affecting a point of this line, or those differing from them too little to be distinguished from them. The aggregate of these sensations will form a sort of cut that I shall call C, and it is clear that this cut suffices to divide the manifold of possible red sensations, and that if I take two red sensations affecting two points situated on one side and the other of the line, I can not pass from one of these sensations to the other in a continuous way without passing at a certain moment through a sensation belonging to the cut.

Imagine a line drawn on the retina that splits its surface in two; now, separate the red sensations affecting a point on this line, or those that are too similar to be distinguished from them. The collection of these sensations will create a sort of division that I'll call C. It's clear that this division is enough to separate the various possible red sensations. If I take two red sensations that affect points on either side of the line, I can't move from one sensation to the other continuously without encountering a sensation that belongs to the division at some point.

If, therefore, the cut has n dimensions, the total manifold of my red sensations, or if you wish, the whole visual space, will have n + 1.

If the cut has n dimensions, then the total set of my red sensations, or the entire visual space, will have n + 1.

Now, I distinguish the red sensations affecting a point of the cut C. The assemblage of these sensations will form a new cut . It is clear that this will divide the cut C, always giving to the word divide the same meaning.[Pg 254]

Now, I can identify the red sensations affecting a point of the cut C. The combination of these sensations will create a new cut . It's obvious that this will split the cut C, always keeping the word divide in the same sense.[Pg 254]

If, therefore, the cut has n dimensions, the cut C will have n + 1 and the whole of visual space n + 2.

If the cut has n dimensions, then the cut C will have n + 1 dimensions, and the entire visual space will have n + 2 dimensions.

If all the red sensations affecting the same point of the retina were regarded as identical, the cut reducing to a single element would have 0 dimensions, and visual space would have 2.

If all the red sensations hitting the same spot on the retina were considered the same, the cut would be reduced to a single element with 0 dimensions, and visual space would have 2.

And yet most often it is said that the eye gives us the sense of a third dimension, and enables us in a certain measure to recognize the distance of objects. When we seek to analyze this feeling, we ascertain that it reduces either to the consciousness of the convergence of the eyes, or to that of the effort of accommodation which the ciliary muscle makes to focus the image.

And yet it's often said that our eyes provide us with a sense of depth and help us gauge the distance of objects. When we try to break down this feeling, we realize it comes down to either being aware of how our eyes converge or the effort our ciliary muscle makes to focus the image.

Two red sensations affecting the same point of the retina will therefore be regarded as identical only if they are accompanied by the same sensation of convergence and also by the same sensation of effort of accommodation or at least by sensations of convergence and accommodation so slightly different as to be indistinguishable.

Two red sensations hitting the same spot on the retina will only be considered the same if they are accompanied by the same feeling of convergence and the same feeling of accommodation effort, or at least by sensations of convergence and accommodation that are so slightly different they can't be distinguished.

On this account the cut is itself a continuum and the cut C has more than one dimension.

On this basis, the cut is a continuum itself, while the cut C has multiple dimensions.

But it happens precisely that experience teaches us that when two visual sensations are accompanied by the same sensation of convergence, they are likewise accompanied by the same sensation of accommodation. If then we form a new cut C´´ with all those of the sensations of the cut , which are accompanied by a certain sensation of convergence, in accordance with the preceding law they will all be indistinguishable and may be regarded as identical. Therefore C´´ will not be a continuum and will have 0 dimension; and as C´´ divides it will thence result that has one, C two and the whole visual space three dimensions.

But experience teaches us that when two visual sensations are accompanied by the same feeling of convergence, they are also linked to the same feeling of accommodation. If we create a new cut C´´ using all the sensations from the cut that come with a specific sensation of convergence, according to the earlier rule, they will all be indistinguishable and can be considered identical. Therefore, C´´ will not be a continuum and will have 0 dimensions; since C´´ divides , it follows that has one dimension, C has two dimensions, and the whole visual space has three dimensions.

But would it be the same if experience had taught us the contrary and if a certain sensation of convergence were not always accompanied by the same sensation of accommodation? In this case two sensations affecting the same point of the retina and accompanied by the same sense of convergence, two sensations which consequently would both appertain to the cut C´´, could nevertheless be distinguished since they would be accompanied by two different sensations of accommodation. Therefore C´´ would be in its turn a continuum and would have one dimension (at[Pg 255] least); then would have two, C three and the whole visual space would have four dimensions.

But would it be the same if our experiences had taught us differently and if a certain feeling of convergence didn't always come with the same feeling of accommodation? In this case, two sensations affecting the same spot on the retina and accompanied by the same sense of convergence—two sensations that would both relate to C´´—could still be distinguished because they would be paired with two different sensations of accommodation. Therefore, C´´ would in turn be a continuum and would have one dimension (at[Pg 255] least); then would have two, C three, and the whole visual space would have four dimensions.

Will it then be said that it is experience which teaches us that space has three dimensions, since it is in setting out from an experimental law that we have come to attribute three to it? But we have therein performed, so to speak, only an experiment in physiology; and as also it would suffice to fit over the eyes glasses of suitable construction to put an end to the accord between the feelings of convergence and of accommodation, are we to say that putting on spectacles is enough to make space have four dimensions and that the optician who constructed them has given one more dimension to space? Evidently not; all we can say is that experience has taught us that it is convenient to attribute three dimensions to space.

Will it then be said that experience teaches us that space has three dimensions, since we came to attribute three to it based on an experimental law? But what we have done is essentially just a physiological experiment; similarly, if we were to wear specially designed glasses to eliminate the alignment between our feelings of convergence and accommodation, would we say that wearing glasses is enough to make space have four dimensions and that the optician who made them has added another dimension to space? Clearly not; all we can say is that experience has taught us it’s convenient to attribute three dimensions to space.

But visual space is only one part of space, and in even the notion of this space there is something artificial, as I have explained at the beginning. The real space is motor space and this it is that we shall examine in the following chapter.

But visual space is just one aspect of space, and even the idea of this space has some artificiality, as I mentioned at the start. The true space is motor space, and this is what we will explore in the next chapter.


CHAPTER IV

Space and its 3 Dimensions

1. The Group of Displacements

Let us sum up briefly the results obtained. We proposed to investigate what was meant in saying that space has three dimensions and we have asked first what is a physical continuum and when it may be said to have n dimensions. If we consider different systems of impressions and compare them with one another, we often recognize that two of these systems of impressions are indistinguishable (which is ordinarily expressed in saying that they are too close to one another, and that our senses are too crude, for us to distinguish them) and we ascertain besides that two of these systems can sometimes be discriminated from one another though indistinguishable from a third system. In that case we say the manifold of these systems of impressions forms a physical continuum C. And each of these systems is called an element of the continuum C.

Let’s briefly summarize the results we’ve obtained. We set out to explore what it means to say that space has three dimensions, and we first asked what a physical continuum is and when it can be said to have n dimensions. When we consider different systems of impressions and compare them, we often find that two of these systems are indistinguishable (which usually means they are too similar for our senses to tell apart), and we also discover that sometimes two of these systems can be told apart from each other, even if they can’t be distinguished from a third system. In that situation, we say that the collection of these systems of impressions forms a physical continuum C, with each system referred to as an element of the continuum C.

How many dimensions has this continuum? Take first two elements A and B of C, and suppose there exists a series Σ of elements, all belonging to the continuum C, of such a sort that A and B are the two extreme terms of this series and that each term of the series is indistinguishable from the preceding. If such a series Σ can be found, we say that A and B are joined to one another; and if any two elements of C are joined to one another, we say that C is all of one piece.

How many dimensions does this continuum have? Let's take the first two elements A and B from C, and assume there’s a series Σ of elements, all part of the continuum C, such that A and B are the two endpoints of this series and each term in the series is indistinguishable from the one before it. If we can find such a series Σ, we say that A and B are connected to each other; and if any two elements of C are connected to one another, we say that C is entirely unified.

Now take on the continuum C a certain number of elements in a way altogether arbitrary. The aggregate of these elements will be called a cut. Among the various series Σ which join A to B, we shall distinguish those of which an element is indistinguishable from one of the elements of the cut (we shall say that these are they which cut the cut) and those of which all the elements are distinguishable from all those of the cut. If all the series Σ which join A to B cut the cut, we shall say that A and B are[Pg 257] separated by the cut, and that the cut divides C. If we can not find on C two elements which are separated by the cut, we shall say that the cut does not divide C.

Now consider the continuum C with a certain number of elements in a completely arbitrary way. The collection of these elements will be called a cut. Among the various series Σ that connect A and B, we will identify those where an element is indistinguishable from one of the elements of the cut (we will refer to these as those that cut the cut) and those where all the elements are distinguishable from every element of the cut. If all the series Σ connecting A to B cut the cut, we will say that A and B are [Pg 257] separated by the cut, and that the cut divides C. If we cannot find two elements on C that are separated by the cut, we will say that the cut does not divide C.

These definitions laid down, if the continuum C can be divided by cuts which do not themselves form a continuum, this continuum C has only one dimension; in the contrary case it has several. If a cut forming a continuum of 1 dimension suffices to divide C, C will have 2 dimensions; if a cut forming a continuum of 2 dimensions suffices, C will have 3 dimensions, etc. Thanks to these definitions, we can always recognize how many dimensions any physical continuum has. It only remains to find a physical continuum which is, so to speak, equivalent to space, of such a sort that to every point of space corresponds an element of this continuum, and that to points of space very near one another correspond indistinguishable elements. Space will have then as many dimensions as this continuum.

These definitions established, if the continuum C can be divided by cuts that do not themselves create a continuum, then this continuum C has only one dimension; otherwise, it has multiple dimensions. If a single cut that creates a 1-dimensional continuum is enough to divide C, then C will have 2 dimensions; if a cut that creates a 2-dimensional continuum is enough, C will have 3 dimensions, and so on. Thanks to these definitions, we can always determine how many dimensions any physical continuum has. We just need to find a physical continuum that is, in a way, equivalent to space, such that each point in space corresponds to an element of this continuum, and points in space that are very close to each other correspond to indistinguishable elements. Therefore, space will have as many dimensions as this continuum.

The intermediation of this physical continuum, capable of representation, is indispensable; because we can not represent space to ourselves, and that for a multitude of reasons. Space is a mathematical continuum, it is infinite, and we can represent to ourselves only physical continua and finite objects. The different elements of space, which we call points, are all alike, and, to apply our definition, it is necessary that we know how to distinguish the elements from one another, at least if they are not too close. Finally absolute space is nonsense, and it is necessary for us to begin by referring space to a system of axes invariably bound to our body (which we must always suppose put back in the initial attitude).

The way we understand this physical continuum, which we can represent, is essential; we cannot visualize space ourselves, and there are many reasons for that. Space is a mathematical continuum; it’s infinite, and we can only conceive of physical continua and finite objects. The various elements of space, which we call points, are all the same, and to apply our definition, we need to know how to differentiate the elements from one another, at least if they aren’t too close together. Ultimately, absolute space is meaningless, and we must start by relating space to a system of axes that is always tied to our body (which we must always assume is positioned in the original stance).

Then I have sought to form with our visual sensations a physical continuum equivalent to space; that certainly is easy and this example is particularly appropriate for the discussion of the number of dimensions; this discussion has enabled us to see in what measure it is allowable to say that 'visual space' has three dimensions. Only this solution is incomplete and artificial. I have explained why, and it is not on visual space but on motor space that it is necessary to bring our efforts to bear. I have then recalled what is the origin of the distinction we make between[Pg 258] changes of position and changes of state. Among the changes which occur in our impressions, we distinguish, first the internal changes, voluntary and accompanied by muscular sensations, and the external changes, having opposite characteristics. We ascertain that it may happen that an external change may be corrected by an internal change which reestablishes the primitive sensations. The external changes, capable of being corrected by an internal change are called changes of position, those not capable of it are called changes of state. The internal changes capable of correcting an external change are called displacements of the whole body; the others are called changes of attitude.

Then I’ve tried to create a physical continuum with our visual sensations that’s similar to space; that’s definitely simple, and this example is especially relevant for discussing the number of dimensions. This discussion has helped us understand to what extent we can claim that ‘visual space’ has three dimensions. However, this solution is incomplete and somewhat artificial. I’ve explained why, and it’s crucial to focus our efforts not on visual space but on motor space. I then recalled the origin of the distinction we make between[Pg 258]changes in position and changes in state. Among the changes that happen in our impressions, we first distinguish between the internal changes, which are voluntary and come with muscular sensations, and the external changes, which have opposite characteristics. We find that sometimes an external change can be corrected by an internal change that restores the original sensations. The external changes that can be corrected by an internal change are called changes of position; those that can’t be corrected are called changes of state. The internal changes that can correct an external change are called displacements of the whole body; the others are referred to as changes of attitude.

Now let α and β be two external changes, α´ and β´ two internal changes. Suppose that a may be corrected either by α´ or by β', and that α´ can correct either α or β; experience tells us then that β´ can likewise correct β. In this case we say that α and β correspond to the same displacement and also that α´ and β´ correspond to the same displacement. That postulated, we can imagine a physical continuum which we shall call the continuum or group of displacements and which we shall define in the following manner. The elements of this continuum shall be the internal changes capable of correcting an external change. Two of these internal changes α´ and β´ shall be regarded as indistinguishable: (1) if they are so naturally, that is, if they are too close to one another; (2) if α´ is capable of correcting the same external change as a third internal change naturally indistinguishable from β'. In this second case, they will be, so to speak, indistinguishable by convention, I mean by agreeing to disregard circumstances which might distinguish them.

Now let α and β be two external changes, and α´ and β´ be two internal changes. Suppose that α can be corrected either by α´ or by β', and that α´ can correct either α or β; experience tells us that β´ can also correct β. In this case, we say that α and β correspond to the same displacement, and that α´ and β´ also correspond to the same displacement. Assuming that, we can imagine a physical continuum which we will call the continuum or group of displacements and we will define it in the following way. The elements of this continuum will be the internal changes capable of correcting an external change. Two of these internal changes, α´ and β´, will be considered indistinguishable: (1) if they are naturally indistinguishable, meaning they are too close to each other; (2) if α´ can correct the same external change as a third internal change that is naturally indistinguishable from β'. In this second case, they will be, so to speak, indistinguishable by convention, meaning by agreeing to ignore circumstances that might distinguish them.

Our continuum is now entirely defined, since we know its elements and have fixed under what conditions they may be regarded as indistinguishable. We thus have all that is necessary to apply our definition and determine how many dimensions this continuum has. We shall recognize that it has six. The continuum of displacements is, therefore, not equivalent to space, since the number of dimensions is not the same; it is only related to space. Now how do we know that this continuum of displacements has six dimensions? We know it by experience.

Our continuum is fully defined now that we understand its elements and have established the conditions under which they can be seen as indistinguishable. We have everything we need to apply our definition and figure out how many dimensions this continuum has. We will recognize that it has six. Therefore, the continuum of displacements is not the same as space, since the number of dimensions differs; it only relates to space. So, how do we know this continuum of displacements has six dimensions? We know it from experience.

It would be easy to describe the experiments by which we[Pg 259] could arrive at this result. It would be seen that in this continuum cuts can be made which divide it and which are continua; that these cuts themselves can be divided by other cuts of the second order which yet are continua, and that this would stop only after cuts of the sixth order which would no longer be continua. From our definitions that would mean that the group of displacements has six dimensions.

It would be straightforward to outline the experiments that led us to this result. It would show that within this continuum, we can make cuts that divide it, and those cuts are still continua; that these cuts can also be subdivided by other cuts of the second order, which remain continua, and this process would continue only until we reach cuts of the sixth order, which would no longer be continua. According to our definitions, this would indicate that the group of displacements has six dimensions.

That would be easy, I have said, but that would be rather long; and would it not be a little superficial? This group of displacements, we have seen, is related to space, and space could be deduced from it, but it is not equivalent to space, since it has not the same number of dimensions; and when we shall have shown how the notion of this continuum can be formed and how that of space may be deduced from it, it might always be asked why space of three dimensions is much more familiar to us than this continuum of six dimensions, and consequently doubted whether it was by this detour that the notion of space was formed in the human mind.

That would be easy, I said, but it would take quite a while; and wouldn’t it be a bit shallow? This group of displacements, as we have seen, is connected to space, and we could derive space from it, but it's not the same as space since it doesn't have the same number of dimensions. When we show how we can form this idea of a continuum and how we can derive the concept of space from it, one might still wonder why three-dimensional space is so much more familiar to us than this six-dimensional continuum. This leads to questioning whether it was actually through this roundabout way that the idea of space developed in the human mind.

2. Identity of Two Points

What is a point? How do we know whether two points of space are identical or different? Or, in other words, when I say: The object A occupied at the instant α the point which the object B occupies at the instant β, what does that mean?

What is a point? How can we tell if two points in space are the same or different? In other words, when I say: The object A occupied, at the moment α, the point that the object B occupies at the moment β, what does that mean?

Such is the problem we set ourselves in the preceding chapter, §4. As I have explained it, it is not a question of comparing the positions of the objects A and B in absolute space; the question then would manifestly have no meaning. It is a question of comparing the positions of these two objects with regard to axes invariably bound to my body, supposing always this body replaced in the same attitude.

Such is the problem we discussed in the previous chapter, §4. As I explained, it's not about comparing the positions of objects A and B in absolute space; that wouldn't make any sense. It's about comparing the positions of these two objects in relation to axes that are always tied to my body, assuming that my body is always in the same position.

I suppose that between the instants α and β I have moved neither my body nor my eye, as I know from my muscular sense. Nor have I moved either my head, my arm or my hand. I ascertain that at the instant α impressions that I attributed to the object A were transmitted to me, some by one of the fibers of my optic nerve, the others by one of the sensitive tactile nerves of my finger; I ascertain that at the instant β other impressions which I attribute to the object B are transmitted to me, some by[Pg 260] this same fiber of the optic nerve, the others by this same tactile nerve.

I think that between moments α and β, I haven't moved my body or my eyes, as I can tell from my muscle sense. I also haven't moved my head, my arm, or my hand. I can see that at moment α, the impressions I connected to object A were sent to me, some through one of the fibers of my optic nerve and others through one of the sensory touch nerves in my finger; I can see that at moment β, different impressions that I connect to object B are sent to me, some through the same fiber of the optic nerve and others through the same touch nerve.

Here I must pause for an explanation; how am I told that this impression which I attribute to A, and that which I attribute to B, impressions which are qualitatively different, are transmitted to me by the same nerve? Must we suppose, to take for example the visual sensations, that A produces two simultaneous sensations, a sensation purely luminous a and a colored sensation , that B produces in the same way simultaneously a luminous sensation b and a colored sensation , that if these different sensations are transmitted to me by the same retinal fiber, a is identical with b, but that in general the colored sensations and produced by different bodies are different? In that case it would be the identity of the sensation a which accompanies with the sensation b which accompanies , which would tell that all these sensations are transmitted to me by the same fiber.

Here I need to pause for an explanation: how is it that the impression I associate with A, and the one I associate with B, which are qualitatively different, are transmitted to me by the same nerve? Should we assume, for example, with visual sensations, that A creates two simultaneous sensations: a purely luminous sensation a and a colored sensation ? Similarly, does B produce a luminous sensation b and a colored sensation at the same time? If these different sensations are conveyed to me by the same retinal fiber, then a is the same as b, but generally the colored sensations and created by different objects are distinct. In that case, it would be the identity of the sensation a that accompanies with the sensation b that accompanies that indicates all these sensations are transmitted to me by the same fiber.

However it may be with this hypothesis and although I am led to prefer to it others considerably more complicated, it is certain that we are told in some way that there is something in common between these sensations a + and b +, without which we should have no means of recognizing that the object B has taken the place of the object A.

However it may be with this hypothesis, and even though I tend to prefer other, more complicated ones, it's clear that there is something shared between these sensations a + and b + . Without this commonality, we would have no way of recognizing that object B has replaced object A.

Therefore I do not further insist and I recall the hypothesis I have just made: I suppose that I have ascertained that the impressions which I attribute to B are transmitted to me at the instant β by the same fibers, optic as well as tactile, which, at the instant α, had transmitted to me the impressions that I attributed to A. If it is so, we shall not hesitate to declare that the point occupied by B at the instant β is identical with the point occupied by A at the instant α.

Therefore, I won’t insist any further, and I’ll revisit the assumption I just made: I believe I’ve determined that the impressions I attribute to B are received by me at time β through the same fibers, both optical and tactile, that, at time α, transmitted the impressions I attributed to A. If that’s the case, we can confidently say that the position held by B at time β is the same as the position held by A at time α.

I have just enunciated two conditions for these points being identical; one is relative to sight, the other to touch. Let us consider them separately. The first is necessary, but is not sufficient. The second is at once necessary and sufficient. A person knowing geometry could easily explain this in the following manner: Let O be the point of the retina where is formed at the instant α the image of the body A; let M be the point of space occupied at the instant α by this body A; let be the point of[Pg 261] space occupied at the instant β by the body B. For this body B to form its image in O, it is not necessary that the points M and coincide; since vision acts at a distance, it suffices for the three points O M to be in a straight line. This condition that the two objects form their image on O is therefore necessary, but not sufficient for the points M and to coincide. Let now P be the point occupied by my finger and where it remains, since it does not budge. As touch does not act at a distance, if the body A touches my finger at the instant α, it is because M and P coincide; if B touches my finger at the instant β, it is because and P coincide. Therefore M and coincide. Thus this condition that if A touches my finger at the instant α, B touches it at the instant β, is at once necessary and sufficient for M and to coincide.

I have just outlined two conditions for these points to be the same; one relates to sight, and the other to touch. Let's consider them one at a time. The first is necessary, but not enough on its own. The second is both necessary and sufficient. A person familiar with geometry could explain this like so: Let O be the point on the retina where the image of the object A is formed at time α; let M be the location in space occupied by object A at time α; and let be the location in space occupied by object B at time β. For object B to create its image at O, it’s not necessary for points M and to overlap; since vision works at a distance, it’s enough for the three points O, M, and to be in a straight line. So, the condition that both objects form their image at O is necessary but not sufficient for the points M and to overlap. Now let P be the point where my finger is, and it stays there since it doesn’t move. Since touch doesn’t work at a distance, if object A touches my finger at time α, it’s because M and P overlap; if B touches my finger at time β, it’s because and P overlap. Therefore, M and must overlap. So, the condition that if A touches my finger at time α, and B touches it at time β, is both necessary and sufficient for M and to overlap.

But we who, as yet, do not know geometry can not reason thus; all that we can do is to ascertain experimentally that the first condition relative to sight may be fulfilled without the second, which is relative to touch, but that the second can not be fulfilled without the first.

But we who, as of now, don’t know geometry can’t reason like that; all we can do is experimentally determine that the first condition related to sight can be met without the second condition, which is related to touch, but the second cannot be met without the first.

Suppose experience had taught us the contrary, as might well be; this hypothesis contains nothing absurd. Suppose, therefore, that we had ascertained experimentally that the condition relative to touch may be fulfilled without that of sight being fulfilled and that, on the contrary, that of sight can not be fulfilled without that of touch being also. It is clear that if this were so we should conclude that it is touch which may be exercised at a distance, and that sight does not operate at a distance.

Suppose experience had shown us the opposite, which is definitely possible; this idea isn’t absurd. So, let’s say we found out through experiments that the requirements for touch can be met without those for sight being met, while, on the other hand, the requirements for sight cannot be met without those for touch being met as well. It's clear that if this were true, we would conclude that touch can happen at a distance, and that sight does not work at a distance.

But this is not all; up to this time I have supposed that to determine the place of an object I have made use only of my eye and a single finger; but I could just as well have employed other means, for example, all my other fingers.

But that's not all; until now I've thought that to find the location of an object, I’ve only used my eye and one finger; but I could have just as easily used other methods, like all my other fingers.

I suppose that my first finger receives at the instant α a tactile impression which I attribute to the object A. I make a series of movements, corresponding to a series S of muscular sensations. After these movements, at the instant α', my second finger receives a tactile impression that I attribute likewise to A. Afterward, at the instant β, without my having budged, as my muscular sense tells me, this same second finger transmits to me[Pg 262] anew a tactile impression which I attribute this time to the object B; I then make a series of movements, corresponding to a series of muscular sensations. I know that this series is the inverse of the series S and corresponds to contrary movements. I know this because many previous experiences have shown me that if I made successively the two series of movements corresponding to S and to , the primitive impressions would be reestablished, in other words, that the two series mutually compensate. That settled, should I expect that at the instant β', when the second series of movements is ended, my first finger would feel a tactile impression attributable to the object B?

I think that my first finger gets a tactile sensation at time α that I connect to the object A. I then perform a set of movements that correspond to a set S of muscle sensations. After these movements, at time α', my second finger feels a tactile sensation that I also attribute to A. Later, at time β, without moving, as my muscle sense indicates, this same second finger sends me[Pg 262] another tactile sensation that I attribute this time to the object B; I then perform a new set of movements that correspond to a new set of muscle sensations. I understand that this set is the opposite of the set S and corresponds to opposite movements. I know this because from past experiences, I’ve learned that if I perform both sets of movements, S and , the original sensations will be restored, meaning that the two sets balance each other out. With that in mind, should I expect that at time β', when the second set of movements is done, my first finger would feel a tactile sensation connected to the object B?

To answer this question, those already knowing geometry would reason as follows: There are chances that the object A has not budged, between the instants α and α', nor the object B between the instants β and β'; assume this. At the instant α, the object A occupied a certain point M of space. Now at this instant it touched my first finger, and as touch does not operate at a distance, my first finger was likewise at the point M. I afterward made the series S of movements and at the end of this series, at the instant α', I ascertained that the object A touched my second finger. I thence conclude that this second finger was then at M, that is, that the movements S had the result of bringing the second finger to the place of the first. At the instant β the object B has come in contact with my second finger: as I have not budged, this second finger has remained at M; therefore the object B has come to M; by hypothesis it does not budge up to the instant β'. But between the instants β and β' I have made the movements ; as these movements are the inverse of the movements S, they must have for effect bringing the first finger in the place of the second. At the instant β´ this first finger will, therefore, be at M; and as the object B is likewise at M, this object B will touch my first finger. To the question put, the answer should therefore be yes.

To answer this question, those who already understand geometry would think like this: There’s a possibility that object A hasn’t moved from the time between moments α and α', nor has object B from moments β and β'; let’s assume that. At moment α, object A was at a specific point M in space. At that moment, it touched my first finger, and since touch doesn’t happen at a distance, my first finger was also at point M. Later, I performed a series of movements S and at the end of this series, at moment α', I found that object A was touching my second finger. I conclude that this second finger was then at M, meaning the movements S effectively moved the second finger to the position of the first. At moment β, object B made contact with my second finger: since I haven’t moved, this second finger has stayed at M; thus, object B has arrived at M; by assumption, it hasn’t moved until moment β'. However, between moments β and β', I made the movements ; since these movements are the reverse of movements S, they must move the first finger to the spot of the second. At moment β’, the first finger will therefore be at M; and since object B is also at M, object B will touch my first finger. So, to the question asked, the answer should be yes.

We who do not yet know geometry can not reason thus; but we ascertain that this anticipation is ordinarily realized; and we can always explain the exceptions by saying that the object A has moved between the instants α and α', or the object B between the instants β and β'.[Pg 263]

We who don't yet know geometry can't reason like this; but we find that this expectation usually holds true; and we can always clarify the exceptions by saying that object A has moved between moments α and α', or object B between moments β and β'.[Pg 263]

But could not experience have given a contrary result? Would this contrary result have been absurd in itself? Evidently not. What should we have done then if experience had given this contrary result? Would all geometry thus have become impossible? Not the least in the world. We should have contented ourselves with concluding that touch can operate at a distance.

But could experience have led to a different outcome? Would that different outcome have been unreasonable? Clearly not. What would we have done if experience had provided this different outcome? Would all of geometry have become impossible? Not at all. We would have simply concluded that touch can operate at a distance.

When I say, touch does not operate at a distance, but sight operates at a distance, this assertion has only one meaning, which is as follows: To recognize whether B occupies at the instant β the point occupied by A at the instant α, I can use a multitude of different criteria. In one my eye intervenes, in another my first finger, in another my second finger, etc. Well, it is sufficient for the criterion relative to one of my fingers to be satisfied in order that all the others should be satisfied, but it is not sufficient that the criterion relative to the eye should be. This is the sense of my assertion. I content myself with affirming an experimental fact which is ordinarily verified.

When I say that touch doesn't work at a distance but sight does, I mean one thing: to determine if B is at the same spot that A was at an earlier time, I can use many different methods. In some cases, I rely on my eye, in others, I use my first finger, my second finger, and so on. For any one of these finger-based methods to be valid, it’s enough for just one to be satisfied for the others to be too. However, that isn’t the case with sight; the eye’s method alone isn't sufficient. This is what I mean. I’m just pointing out an experimental fact that can usually be observed.

At the end of the preceding chapter we analyzed visual space; we saw that to engender this space it is necessary to bring in the retinal sensations, the sensation of convergence and the sensation of accommodation; that if these last two were not always in accord, visual space would have four dimensions in place of three; we also saw that if we brought in only the retinal sensations, we should obtain 'simple visual space,' of only two dimensions. On the other hand, consider tactile space, limiting ourselves to the sensations of a single finger, that is in sum to the assemblage of positions this finger can occupy. This tactile space that we shall analyze in the following section and which consequently I ask permission not to consider further for the moment, this tactile space, I say, has three dimensions. Why has space properly so called as many dimensions as tactile space and more than simple visual space? It is because touch does not operate at a distance, while vision does operate at a distance. These two assertions have the same meaning and we have just seen what this is.

At the end of the last chapter, we analyzed visual space; we discovered that to create this space, it's necessary to include retinal sensations, the sensation of convergence, and the sensation of accommodation. If these last two aren't always in sync, visual space would have four dimensions instead of three. We also noted that if we only accounted for retinal sensations, we would only achieve 'simple visual space,' which has only two dimensions. On the other hand, let's look at tactile space, focusing on the sensations of a single finger, which essentially refers to all the positions this finger can occupy. This tactile space we'll analyze in the next section, so I’d like to set it aside for now, has three dimensions. Why does space, as we properly define it, have as many dimensions as tactile space and more than simple visual space? It's because touch doesn't work from a distance, while vision does. These two statements mean the same thing, and we have just seen what that is.

Now I return to a point over which I passed rapidly in order not to interrupt the discussion. How do we know that the impressions made on our retina by A at the instant α and B at the[Pg 264] instant β are transmitted by the same retinal fiber, although these impressions are qualitatively different? I have suggested a simple hypothesis, while adding that other hypotheses, decidedly more complex, would seem to me more probably true. Here then are these hypotheses, of which I have already said a word. How do we know that the impressions produced by the red object A at the instant α, and by the blue object B at the instant β, if these two objects have been imaged on the same point of the retina, have something in common? The simple hypothesis above made may be rejected and we may suppose that these two impressions, qualitatively different, are transmitted by two different though contiguous nervous fibers. What means have I then of knowing that these fibers are contiguous? It is probable that we should have none, if the eye were immovable. It is the movements of the eye which have told us that there is the same relation between the sensation of blue at the point A and the sensation of blue at the point B of the retina as between the sensation of red at the point A and the sensation of red at the point B. They have shown us, in fact, that the same movements, corresponding to the same muscular sensations, carry us from the first to the second, or from the third to the fourth. I do not emphasize these considerations, which belong, as one sees, to the question of local signs raised by Lotze.

Now I'm going back to a point I quickly skipped to keep the discussion flowing. How do we know that the images of A at moment α and B at moment β are sent through the same retinal fiber, even though these images are qualitatively different? I've proposed a simple hypothesis, but I think there are other, more complex hypotheses that are likely more accurate. So, here are those hypotheses, which I've already mentioned. How do we know that the impressions from the red object A at moment α and the blue object B at moment β, if these two objects are focused on the same part of the retina, have something in common? We might dismiss the simple hypothesis and assume these two distinct impressions are sent by two different but adjacent nerve fibers. What evidence do I have then to show that these fibers are adjacent? It seems we wouldn’t have any if the eye didn’t move. It's actually the movements of the eye that reveal the same relationship between the sensation of blue at point A and the sensation of blue at point B on the retina, just like the relationship between the sensation of red at point A and the sensation of red at point B. These movements have shown us that the same actions, linked to the same muscular feelings, take us from the first to the second, or from the third to the fourth. I don’t want to dwell on these thoughts, which relate, as you can see, to the issue of local signs raised by Lotze.

3. Tactile Space

Thus I know how to recognize the identity of two points, the point occupied by A at the instant α and the point occupied by B at the instant β, but only on one condition, namely, that I have not budged between the instants α and β. That does not suffice for our object. Suppose, therefore, that I have moved in any manner in the interval between these two instants, how shall I know whether the point occupied by A at the instant α is identical with the point occupied by B at the instant β? I suppose that at the instant α, the object A was in contact with my first finger and that in the same way, at the instant β, the object B touches this first finger; but at the same time my muscular sense has told me that in the interval my body has moved. I have considered above two series of muscular sensations S and , and[Pg 265] I have said it sometimes happens that we are led to consider two such series S and as inverse one of the other, because we have often observed that when these two series succeed one another our primitive impressions are reestablished.

So, I know how to identify the location of two points: the point where A is at time α and the point where B is at time β, but only if I haven't moved between times α and β. That's not enough for our purpose. Now, let's say I've moved in some way during the time between these two points; how can I tell if the point where A is at time α is the same as the point where B is at time β? I imagine that at time α, the object A was touching my first finger and that, similarly, at time β, the object B also touches this first finger; however, my muscle sense tells me that my body has moved in the meantime. I've discussed two sets of muscle sensations, S and , and[Pg 265] I've noted that sometimes we see these two sets S and as inverses of each other, because we've often found that when these two sets follow one another, our original impressions are restored.

If then my muscular sense tells me that I have moved between the two instants α and β, but so as to feel successively the two series of muscular sensations S and that I consider inverses, I shall still conclude, just as if I had not budged, that the points occupied by A at the instant α and by B at the instant β are identical, if I ascertain that my first finger touches A at the instant α, and B at the instant β.

If my sense of movement tells me that I've moved between the two moments α and β, but I still feel the two sets of muscle sensations S and that I see as opposites, I'll still conclude, just as if I hadn't moved, that the positions of A at moment α and B at moment β are the same, as long as I confirm that my index finger touches A at moment α and B at moment β.

This solution is not yet completely satisfactory, as one will see. Let us see, in fact, how many dimensions it would make us attribute to space. I wish to compare the two points occupied by A and B at the instants α and β, or (what amounts to the same thing since I suppose that my finger touches A at the instant α and B at the instant β) I wish to compare the two points occupied by my finger at the two instants α and β. The sole means I use for this comparison is the series Σ of muscular sensations which have accompanied the movements of my body between these two instants. The different imaginable series Σ form evidently a physical continuum of which the number of dimensions is very great. Let us agree, as I have done, not to consider as distinct the two series Σ and Σ + S + , when S and are inverses one of the other in the sense above given to this word; in spite of this agreement, the aggregate of distinct series Σ will still form a physical continuum and the number of dimensions will be less but still very great.

This solution isn’t fully satisfactory yet, as you’ll see. Let’s look at how many dimensions it would require us to attribute to space. I want to compare the two points occupied by A and B at moments α and β, or (which is the same since I assume my finger touches A at moment α and B at moment β) I want to compare the two points my finger occupies at those two moments. The only way I’m making this comparison is through the series Σ of muscle sensations that I experienced while moving my body between those two moments. The different possible series Σ clearly create a physical continuum with a very high number of dimensions. Let's agree, as I have, not to regard the two series Σ and Σ + S + as distinct when S and are inverses of each other as defined above; despite this agreement, the collection of distinct series Σ will still create a physical continuum, and the number of dimensions will be lower but still very large.

To each of these series Σ corresponds a point of space; to two series Σ and Σ´ thus correspond two points M and . The means we have hitherto used enable us to recognize that M and are not distinct in two cases: (1) if Σ is identical with Σ´; (2) if Σ´ = Σ + S + , S and being inverses one of the other. If in all the other cases we should regard M and as distinct, the manifold of points would have as many dimensions as the aggregate of distinct series Σ, that is, much more than three.

To each of these series Σ corresponds a point in space; to the two series Σ and Σ' correspond two points M and M'. The methods we have used so far allow us to see that M and M' are not distinct in two situations: (1) if Σ is the same as Σ'; (2) if Σ' = Σ + S + S', where S and S' are inverses of each other. If in all other cases we consider M and M' as distinct, the collection of points would have as many dimensions as the total number of distinct series Σ, which is much more than three.

For those who already know geometry, the following explanation would be easily comprehensible. Among the imaginable[Pg 266] series of muscular sensations, there are those which correspond to series of movements where the finger does not budge. I say that if one does not consider as distinct the series Σ and Σ + σ, where the series σ corresponds to movements where the finger does not budge, the aggregate of series will constitute a continuum of three dimensions, but that if one regards as distinct two series Σ and Σ´ unless Σ´ = Σ + S + , S and being inverses, the aggregate of series will constitute a continuum of more than three dimensions.

For those who already understand geometry, the following explanation will be easy to grasp. Among the countless[Pg 266] sequences of muscular sensations, there are those that match sequences of movements where the finger doesn’t move. I claim that if one does not view the sequences Σ and Σ + σ as separate, where the sequence σ corresponds to movements where the finger doesn’t move, the total of these sequences will form a three-dimensional continuum. However, if one considers two sequences Σ and Σ´ to be distinct unless Σ´ = Σ + S + , with S and being opposites, then the total of these sequences will create a continuum of more than three dimensions.

In fact, let there be in space a surface A, on this surface a line B, on this line a point M. Let C0 be the aggregate of all series Σ. Let C1 be the aggregate of all the series Σ, such that at the end of corresponding movements the finger is found upon the surface A, and C2 or C3 the aggregate of series Σ such that at the end the finger is found on B, or at M. It is clear, first that C1 will constitute a cut which will divide C0, that C2 will be a cut which will divide C1, and C3 a cut which will divide C2. Thence it results, in accordance with our definitions, that if C3 is a continuum of n dimensions, C0 will be a physical continuum of n + 3 dimensions.

In fact, let’s consider a surface A in space, a line B on this surface, and a point M on that line. Let C0 represent the total collection of all series Σ. Let C1 represent the collection of all series Σ, where at the end of the corresponding movements, the finger lands on surface A. Let C2 and C3 represent the collections of series Σ where the finger ends up on B or at M, respectively. It’s clear that C1 will create a division in C0, C2 will divide C1, and C3 will divide C2. Consequently, according to our definitions, if C3 is a continuum of n dimensions, then C0 will be a physical continuum of n + 3 dimensions.

Therefore, let Σ and Σ´ = Σ + σ be two series forming part of C3; for both, at the end of the movements, the finger is found at M; thence results that at the beginning and at the end of the series σ the finger is at the same point M. This series σ is therefore one of those which correspond to movements where the finger does not budge. If Σ and Σ + σ are not regarded as distinct, all the series of C3 blend into one; therefore C3 will have 0 dimension, and C0 will have 3, as I wished to prove. If, on the contrary, I do not regard Σ and Σ + σ as blending (unless σ = S + , S and being inverses), it is clear that C3 will contain a great number of series of distinct sensations; because, without the finger budging, the body may take a multitude of different attitudes. Then C3 will form a continuum and C0 will have more than three dimensions, and this also I wished to prove.

Therefore, let Σ and Σ' = Σ + σ be two series that are part of C3; for both, at the end of the movements, the finger ends up at M; this means that at the start and finish of the series σ, the finger is at the same point M. This series σ corresponds to movements where the finger doesn’t move. If Σ and Σ + σ are seen as the same, all the series of C3 merge into one; thus, C3 will have 0 dimensions, and C0 will have 3, which is what I aimed to show. On the other hand, if I do not consider Σ and Σ + σ as merging (unless σ = S + , with S and being opposites), it’s clear that C3 will consist of many series of distinct sensations; because, even without the finger moving, the body can adopt a wide range of different positions. Then, C3 will create a continuum and C0 will have more than three dimensions, which is also what I aimed to prove.

We who do not yet know geometry can not reason in this way; we can only verify. But then a question arises; how, before knowing geometry, have we been led to distinguish from the others these series σ where the finger does not budge? It is, in[Pg 267] fact, only after having made this distinction that we could be led to regard Σ and Σ + σ as identical, and it is on this condition alone, as we have just seen, that we can arrive at space of three dimensions.

We who don’t know geometry yet can’t reason like this; we can only confirm things. But then a question comes up: how, before learning geometry, were we able to distinguish these series σ where nothing changes? It’s actually only after making this distinction that we can see Σ and Σ + σ as the same, and it’s only on this condition, as we just observed, that we can reach three-dimensional space.

We are led to distinguish the series σ, because it often happens that when we have executed the movements which correspond to these series σ of muscular sensations, the tactile sensations which are transmitted to us by the nerve of the finger that we have called the first finger, persist and are not altered by these movements. Experience alone tells us that and it alone could tell us.

We need to recognize the series σ because it often occurs that after performing the actions related to these series σ of muscle feelings, the touch sensations sent to us by the nerve in the finger we refer to as the first finger remain unchanged and are not affected by these movements. Only experience reveals this, and it is the only one that can.

If we have distinguished the series of muscular sensations S + formed by the union of two inverse series, it is because they preserve the totality of our impressions; if now we distinguish the series σ, it is because they preserve certain of our impressions. (When I say that a series of muscular sensations S 'preserves' one of our impressions A, I mean that we ascertain that if we feel the impression A, then the muscular sensations S, we still feel the impression A after these sensations S.)

If we have identified the series of muscle sensations S + created by combining two opposite series, it’s because they capture the entirety of our experiences; now that we recognize the series σ, it’s because they capture some of our experiences. (When I say that a series of muscle sensations S 'captures' one of our experiences A, I mean that we can confirm that if we feel the experience A and then the muscle sensations S, we still feel the experience A after these sensations S.)

I have said above it often happens that the series σ do not alter the tactile impressions felt by our first finger; I said often, I did not say always. This it is that we express in our ordinary language by saying that the tactile impressions would not be altered if the finger has not moved, on the condition that neither has the object A, which was in contact with this finger, moved. Before knowing geometry, we could not give this explanation; all we could do is to ascertain that the impression often persists, but not always.

I mentioned earlier that it's common for the series σ not to change the tactile sensations experienced by our index finger; I said common, not always. This is what we mean in everyday language when we say that the tactile sensations wouldn’t change if the finger hasn’t moved, provided that neither has the object A, which was touching this finger, moved either. Before we learned geometry, we couldn’t explain this; we could only observe that the impression often remains, but not always.

But that the impression often continues is enough to make the series σ appear remarkable to us, to lead us to put in the same class the series Σ and Σ + σ, and hence not regard them as distinct. Under these conditions we have seen that they will engender a physical continuum of three dimensions.

But the fact that the impression often lasts is enough to make the series σ seem remarkable to us, leading us to categorize the series Σ and Σ + σ in the same group, and thus not see them as separate. In this context, we have observed that they will create a three-dimensional physical continuum.

Behold then a space of three dimensions engendered by my first finger. Each of my fingers will create one like it. It remains to consider how we are led to regard them as identical with visual space, as identical with geometric space.

Behold a three-dimensional space created by my index finger. Each of my fingers will create one like it. Now, we need to think about how we come to see them as the same as visual space, as the same as geometric space.

But one reflection before going further; according to the foregoing, we know the points of space, or more generally the final[Pg 268] situation of our body, only by the series of muscular sensations revealing to us the movements which have carried us from a certain initial situation to this final situation. But it is clear that this final situation will depend, on the one hand, upon these movements and, on the other hand, upon the initial situation from which we set out. Now these movements are revealed to us by our muscular sensations; but nothing tells us the initial situation; nothing can distinguish it for us from all the other possible situations. This puts well in evidence the essential relativity of space.

But before we proceed, let's consider this: based on what we've discussed so far, we understand our position in space, or more generally, the final position of our body, only through a series of muscular sensations that show us the movements that took us from a certain starting point to this final position. However, it’s clear that this final position depends, on one hand, on these movements and, on the other hand, on the initial position from which we began. Now, these movements are communicated to us through our muscular sensations, but nothing informs us about the initial position; nothing sets it apart from all the other possible positions. This highlights the fundamental relativity of space.

4. Identity of the Different Spaces

We are therefore led to compare the two continua C and engendered, for instance, one by my first finger D, the other by my second finger . These two physical continua both have three dimensions. To each element of the continuum C, or, if you prefer, to each point of the first tactile space, corresponds a series of muscular sensations Σ, which carry me from a certain initial situation to a certain final situation.[8] Moreover, the same point of this first space will correspond to Σ and Σ + σ, if σ is a series of which we know that it does not make the finger D move.

We are therefore led to compare the two continua C and generated, for example, one by my first finger D, and the other by my second finger . Both of these physical continua have three dimensions. For each element of the continuum C, or if you prefer, for each point in the first tactile space, there corresponds a series of muscle sensations Σ, which take me from a specific starting point to a specific ending point.[8] Additionally, the same point in this first space will correspond to Σ and Σ + σ, if σ is a series known to not cause finger D to move.

Similarly to each element of the continuum , or to each point of the second tactile space, corresponds a series of sensations Σ´, and the same point will correspond to Σ´ and to Σ´ + σ´, if σ´ is a series which does not make the finger move.

Similarly to each element of the continuum , or to each point of the second tactile space, there is a corresponding series of sensations Σ´. The same point will correspond to Σ´ and to Σ´ + σ´, provided that σ´ is a series that doesn't cause the finger to move.

What makes us distinguish the various series designated σ from those called σ´ is that the first do not alter the tactile impressions felt by the finger D and the second preserve those the finger feels.

What sets the different series labeled σ apart from those called σ´ is that the first ones do not change the sensations felt by the finger D, while the second ones maintain those that the finger experiences.

Now see what we ascertain: in the beginning my finger feels a sensation ; I make movements which produce muscular sensations S; my finger D feels the impression A; I make movements which produce a series of sensations σ; my finger D continues to feel the impression A, since this is the characteristic[Pg 269] property of the series σ; I then make movements which produce the series of muscular sensations, inverse to S in the sense above given to this word. I ascertain then that my finger feels anew the impression . (It is of course understood that S has been suitably chosen.)

Now let's see what we discover: at first, my finger experiences a sensation ; I make movements that create muscular sensations S; my finger D feels the impression A; I make movements that create a series of sensations σ; my finger D continues to feel the impression A, as this is the defining[Pg 269] feature of the series σ; I then make movements that create the series of muscular sensations, which are inverse to S in the sense previously mentioned. I then determine that my finger feels the impression again. (Of course, it is understood that S has been appropriately chosen.)

This means that the series S + σ + , preserving the tactile impressions of the finger , is one of the series I have called σ´. Inversely, if one takes any series σ´, + σ´ + S will be one of the series that we call σ´.

This means that the series S + σ + , keeping the tactile impressions of the finger , is one of the series I’ve referred to as σ´. Conversely, if you take any series σ´, + σ´ + S will be one of the series we call σ´.

Thus if S is suitably chosen, S + σ + will be a series σ´, and by making σ vary in all possible ways, we shall obtain all the possible series σ´.

Thus if S is chosen wisely, S + σ + will form a series σ´, and by allowing σ to change in every possible way, we will obtain all the possible series σ´.

Not yet knowing geometry, we limit ourselves to verifying all that, but here is how those who know geometry would explain the fact. In the beginning my finger is at the point M, in contact with the object a, which makes it feel the impression . I make the movements corresponding to the series S; I have said that this series should be suitably chosen, I should so make this choice that these movements carry the finger D to the point originally occupied by the finger , that is, to the point M; this finger D will thus be in contact with the object a, which will make it feel the impression A.

Not knowing geometry yet, we can only verify what we see, but here’s how someone who understands geometry would explain it. At first, my finger is at point M, in contact with object a, which makes it feel the impression . I then make movements that correspond to the series S; I mentioned that this series should be chosen carefully, so that these movements move finger D back to the original position of finger , which is point M; this finger D will then be in contact with object a, allowing it to feel the impression A.

I then make the movements corresponding to the series σ; in these movements, by hypothesis, the position of the finger D does not change, this finger therefore remains in contact with the object a and continues to feel the impression A. Finally I make the movements corresponding to the series . As is inverse to S, these movements carry the finger to the point previously occupied by the finger D, that is, to the point M. If, as may be supposed, the object a has not budged, this finger will be in contact with this object and will feel anew the impression .... Q.E.D.

I then make the movements corresponding to the series σ; during these movements, the position of the finger D does not change, so this finger stays in contact with the object a and continues to feel the impression A. Finally, I make the movements corresponding to the series . Since is the reverse of S, these movements move the finger to the spot where the finger D was, which is at point M. If, as we can assume, the object a hasn’t moved, the finger will be in contact with this object and will feel the impression .... Q.E.D.

Let us see the consequences. I consider a series of muscular sensations Σ. To this series will correspond a point M of the first tactile space. Now take again the two series S and , inverses of one another, of which we have just spoken. To the series S + Σ + will correspond a point N of the second tactile space, since to any series of muscular sensations corresponds,[Pg 270] as we have said, a point, whether in the first space or in the second.

Let’s look at the results. I’m thinking about a series of muscle sensations Σ. This series corresponds to a point M in the first tactile space. Now, let’s consider the two series S and , which are inverses of each other, as we just discussed. The series S + Σ + corresponds to a point N in the second tactile space, since, as we mentioned before, any series of muscle sensations corresponds,[Pg 270] to a point, whether in the first space or the second.

I am going to consider the two points N and M, thus defined, as corresponding. What authorizes me so to do? For this correspondence to be admissible, it is necessary that if two points M and , corresponding in the first space to two series Σ and Σ´, are identical, so also are the two corresponding points of the second space N and , that is, the two points which correspond to the two series S + Σ + and S + Σ´ + . Now we shall see that this condition is fulfilled.

I am going to consider the two points N and M, as defined, to be corresponding. What gives me the right to do this? For this correspondence to be valid, it is necessary that if two points M and , corresponding in the first space to two series Σ and Σ´, are the same, then the two corresponding points in the second space N and must also be the same. This means the two points that correspond to the two series S + Σ + and S + Σ´ + must match as well. Now we shall see that this condition is met.

First a remark. As S and are inverses of one another, we shall have S + = 0, and consequently S + + Σ = Σ + S + = Σ, or again Σ + S + + Σ´ = Σ + Σ´; but it does not follow that we have S + Σ + = Σ; because, though we have used the addition sign to represent the succession of our sensations, it is clear that the order of this succession is not indifferent: we can not, therefore, as in ordinary addition, invert the order of the terms; to use abridged language, our operations are associative, but not commutative.

First, a note. Since S and are inverses of each other, we have S + = 0, and as a result, S + + Σ = Σ + S + = Σ, or again Σ + S + + Σ´ = Σ + Σ´; however, it doesn't mean that S + Σ + = Σ; because, although we've used the addition sign to show the sequence of our sensations, it's clear that the order of this sequence matters: we cannot, therefore, as in regular addition, change the order of the terms; to put it simply, our operations are associative, but not commutative.

That fixed, in order that Σ and Σ´ should correspond to the same point M = of the first space, it is necessary and sufficient for us to have Σ´ = Σ + σ. We shall then have: S + Σ´ + = S + Σ + σ + = S + Σ + + S + σ + .

That fixed, for Σ and Σ' to correspond to the same point M = M' of the first space, it is necessary and sufficient that Σ' = Σ + σ. We will then have: S + Σ' + S' = S + Σ + σ + S' = S + Σ + S' + S + σ + S'.

But we have just ascertained that S + σ + was one of the series σ´. We shall therefore have: S + Σ´ + = S + Σ + + σ´, which means that the series S + Σ´ + and S + Σ + correspond to the same point N = of the second space. Q.E.D.

But we've just established that S + σ + was one of the series σ´. Therefore, we can say: S + Σ´ + = S + Σ + + σ´, which means that the series S + Σ´ + and S + Σ + correspond to the same point N = of the second space. Q.E.D.

Our two spaces therefore correspond point for point; they can be 'transformed' one into the other; they are isomorphic. How are we led to conclude thence that they are identical?

Our two spaces correspond perfectly; they can be 'transformed' into one another; they are isomorphic. How do we conclude from this that they are identical?

Consider the two series σ and S + σ + = σ´. I have said that often, but not always, the series σ preserves the tactile impression A felt by the finger D; and similarly it often happens, but not always, that the series σ´ preserves the tactile impression felt by the finger . Now I ascertain that it happens very often (that is, much more often than what I have just called 'often') that when the series σ has preserved the impression A of the[Pg 271] finger D, the series σ´ preserves at the same time the impression of the finger ; and, inversely, that if the first impression is altered, the second is likewise. That happens very often, but not always.

Consider the two series σ and S + σ + = σ´. I've mentioned that, often but not always, the series σ retains the tactile impression A felt by the finger D; similarly, it often happens, but not always, that the series σ´ retains the tactile impression felt by the finger . Now I find that it happens very often (that is, much more often than what I just called 'often') that when the series σ has preserved the impression A of the finger D, the series σ´ preserves at the same time the impression of the finger ; and vice versa, that if the first impression changes, the second does too. That happens very often, but not always.

We interpret this experimental fact by saying that the unknown object a which gives the impression A to the finger D is identical with the unknown object which gives the impression to the finger . And in fact when the first object moves, which the disappearance of the impression A tells us, the second likewise moves, since the impression disappears likewise. When the first object remains motionless, the second remains motionless. If these two objects are identical, as the first is at the point M of the first space and the second at the point N of the second space, these two points are identical. This is how we are led to regard these two spaces as identical; or better, this is what we mean when we say that they are identical.

We understand this experimental fact by stating that the unknown object a which creates the sensation A on finger D is the same as the unknown object which creates the sensation on finger . In fact, when the first object moves, as indicated by the loss of sensation A, the second one also moves because the sensation similarly disappears. When the first object stays still, the second one stays still as well. If these two objects are identical, with the first one at point M in the first space and the second one at point N in the second space, then these two points are also identical. This is how we come to view these two spaces as identical; or more accurately, this is what we mean when we say they are identical.

What we have just said of the identity of the two tactile spaces makes unnecessary our discussing the question of the identity of tactile space and visual space, which could be treated in the same way.

What we've just mentioned about the identity of the two tactile spaces makes it unnecessary to discuss the question of the identity between tactile space and visual space, which could be approached in the same manner.

5. Space and Empiricism

It seems that I am about to be led to conclusions in conformity with empiristic ideas. I have, in fact, sought to put in evidence the rôle of experience and to analyze the experimental facts which intervene in the genesis of space of three dimensions. But whatever may be the importance of these facts, there is one thing we must not forget and to which besides I have more than once called attention. These experimental facts are often verified but not always. That evidently does not mean that space has often three dimensions, but not always.

It looks like I'm being guided toward conclusions that align with empirical ideas. I've actually tried to highlight the role of experience and to analyze the experimental facts that contribute to the creation of three-dimensional space. However important these facts may be, we must not forget one thing, which I've pointed out multiple times. These experimental facts are often confirmed, but not always. That clearly doesn’t mean that space sometimes has three dimensions and sometimes doesn’t.

I know well that it is easy to save oneself and that, if the facts do not verify, it will be easily explained by saying that the exterior objects have moved. If experience succeeds, we say that it teaches us about space; if it does not succeed, we hie to exterior objects which we accuse of having moved; in other words, if it does not succeed, it is given a fillip.

I know it's easy to justify one's actions and that if the facts don't line up, we can just say that outside factors have changed. If our experiences work out, we claim they teach us about space; if they don't, we blame external factors for shifting; in other words, if things go wrong, we make excuses.

These fillips are legitimate; I do not refuse to admit them; but[Pg 272] they suffice to tell us that the properties of space are not experimental truths, properly so called. If we had wished to verify other laws, we could have succeeded also, by giving other analogous fillips. Should we not always have been able to justify these fillips by the same reasons? One could at most have said to us: 'Your fillips are doubtless legitimate, but you abuse them; why move the exterior objects so often?'

These boosts are valid; I won't deny that; but[Pg 272] they only show us that the properties of space aren't genuine experimental truths. If we wanted to confirm other laws, we could have done that too, just by applying similar boosts. Shouldn't we always be able to justify these boosts with the same reasoning? Someone could have said to us: 'Your boosts are certainly valid, but you're overusing them; why change the external objects so frequently?'

To sum up, experience does not prove to us that space has three dimensions; it only proves to us that it is convenient to attribute three to it, because thus the number of fillips is reduced to a minimum.

To sum up, experience doesn’t show us that space has three dimensions; it just shows us that it’s useful to think of it that way because it keeps things simple and manageable.

I will add that experience brings us into contact only with representative space, which is a physical continuum, never with geometric space, which is a mathematical continuum. At the very most it would appear to tell us that it is convenient to give to geometric space three dimensions, so that it may have as many as representative space.

I will add that experience only connects us with representative space, which is a physical continuum, never with geometric space, which is a mathematical continuum. At most, it seems to suggest that it's useful to give geometric space three dimensions so that it has as many as representative space.

The empiric question may be put under another form. Is it impossible to conceive physical phenomena, the mechanical phenomena, for example, otherwise than in space of three dimensions? We should thus have an objective experimental proof, so to speak, independent of our physiology, of our modes of representation.

The empirical question can be rephrased. Is it impossible to imagine physical phenomena, such as mechanical phenomena, in anything other than three-dimensional space? This would provide an objective experimental proof, so to speak, that exists independently of our physiology and ways of representation.

But it is not so; I shall not here discuss the question completely, I shall confine myself to recalling the striking example given us by the mechanics of Hertz. You know that the great physicist did not believe in the existence of forces, properly so called; he supposed that visible material points are subjected to certain invisible bonds which join them to other invisible points and that it is the effect of these invisible bonds that we attribute to forces.

But that’s not the case; I won’t go into this question in detail here. I’ll just mention the notable example provided by Hertz’s mechanics. You know that the great physicist didn’t believe in forces as we typically understand them; he thought that visible material points are connected by certain invisible links to other invisible points, and that it’s the effect of these invisible links that we refer to as forces.

But that is only a part of his ideas. Suppose a system formed of n material points, visible or not; that will give in all 3n coordinates; let us regard them as the coordinates of a single point in space of 3n dimensions. This single point would be constrained to remain upon a surface (of any number of dimensions < 3n) in virtue of the bonds of which we have just spoken; to go on this surface from one point to another, it would always[Pg 273] take the shortest way; this would be the single principle which would sum up all mechanics.

But that's just part of his ideas. Imagine a system made up of n material points, whether they're visible or not; that gives us all 3n coordinates. Let's think of them as the coordinates of a single point in a space with 3n dimensions. This single point would have to stay on a surface (with any number of dimensions less than 3n) due to the connections we just mentioned; when moving across this surface from one point to another, it would always[Pg 273] take the shortest path; this would be the one principle that sums up all mechanics.

Whatever should be thought of this hypothesis, whether we be allured by its simplicity, or repelled by its artificial character, the simple fact that Hertz was able to conceive it, and to regard it as more convenient than our habitual hypotheses, suffices to prove that our ordinary ideas, and, in particular, the three dimensions of space, are in no wise imposed upon mechanics with an invincible force.

Whatever we might think about this hypothesis, whether we're drawn to its simplicity or put off by its artificial nature, the mere fact that Hertz could conceive it and see it as more convenient than our usual hypotheses is enough to show that our everyday ideas, especially the three dimensions of space, are not imposed on mechanics with any kind of unbreakable force.

6. Mind and Space

Experience, therefore, has played only a single rôle, it has served as occasion. But this rôle was none the less very important; and I have thought it necessary to give it prominence. This rôle would have been useless if there existed an a priori form imposing itself upon our sensitivity, and which was space of three dimensions.

Experience has only played one role; it has acted as an occasion. But this role was still very important, and I felt it necessary to highlight it. This role would have been pointless if there were an a priori form that imposed itself on our sensitivity, which was three-dimensional space.

Does this form exist, or, if you choose, can we represent to ourselves space of more than three dimensions? And first what does this question mean? In the true sense of the word, it is clear that we can not represent to ourselves space of four, nor space of three, dimensions; we can not first represent them to ourselves empty, and no more can we represent to ourselves an object either in space of four, or in space of three, dimensions: (1) Because these spaces are both infinite and we can not represent to ourselves a figure in space, that is, the part in the whole, without representing the whole, and that is impossible, because it is infinite; (2) because these spaces are both mathematical continua, and we can represent to ourselves only the physical continuum; (3) because these spaces are both homogeneous, and the frames in which we enclose our sensations, being limited, can not be homogeneous.

Does this form exist, or can we imagine a space with more than three dimensions? First, what does this question mean? It's clear that we can't truly visualize four-dimensional space, nor can we fully grasp three-dimensional space. We can't first visualize them as empty, and we can't visualize an object in either four-dimensional or three-dimensional space: (1) Because both of these spaces are infinite, we can't depict a figure in space, meaning the part in the whole, without also depicting the whole, which is impossible since it's infinite; (2) because both spaces are mathematical continua, and we can only visualize the physical continuum; (3) because both spaces are homogeneous, and our perceptions, being limited, can't be homogeneous.

Thus the question put can only be understood in one way; is it possible to imagine that, the results of the experiences related above having been different, we might have been led to attribute to space more than three dimensions; to imagine, for instance, that the sensation of accommodation might not be constantly in accord with the sensation of convergence of the eyes;[Pg 274] or indeed that the experiences of which we have spoken in § 2, and of which we express the result by saying 'that touch does not operate at a distance,' might have led us to an inverse conclusion.

So the question can only be understood in one way: is it possible to imagine that if the results of the experiences mentioned earlier had been different, we might have started to think of space as having more than three dimensions? For example, could we imagine that the sensation of accommodation might not always match up with the sensation of the eyes converging?[Pg 274] Or, could the experiences we discussed in § 2, which we summarize by saying 'that touch does not work at a distance,' have led us to the opposite conclusion?

And then yes evidently that is possible; from the moment one imagines an experience, one imagines just thereby the two contrary results it may give. That is possible, but that is difficult, because we have to overcome a multitude of associations of ideas, which are the fruit of a long personal experience and of the still longer experience of the race. Is it these associations (or at least those of them that we have inherited from our ancestors), which constitute this a priori form of which it is said that we have pure intuition? Then I do not see why one should declare it refractory to analysis and should deny me the right of investigating its origin.

And yes, that's clearly possible; from the moment you imagine an experience, you also think about the two opposing outcomes it might produce. That is possible, but it's challenging because we have to get past a multitude of associations that stem from our own long personal experiences and the even longer history of our species. Are these associations (or at least the ones we've inherited from our ancestors) what make up this a priori form that's said to represent pure intuition? If so, I don't understand why it should be considered impossible to analyze and why I shouldn't have the right to explore its origins.

When it is said that our sensations are 'extended' only one thing can be meant, that is that they are always associated with the idea of certain muscular sensations, corresponding to the movements which enable us to reach the object which causes them, which enable us, in other words, to defend ourselves against it. And it is just because this association is useful for the defense of the organism, that it is so old in the history of the species and that it seems to us indestructible. Nevertheless, it is only an association and we can conceive that it may be broken; so that we may not say that sensation can not enter consciousness without entering in space, but that in fact it does not enter consciousness without entering in space, which means, without being entangled in this association.

When we say our sensations are 'extended,' we mean that they're always linked to certain muscle sensations that go along with the movements we make to reach the things that cause those sensations, allowing us to protect ourselves from them. This connection is deeply rooted in our evolution because it's useful for our survival, which is why it feels so fundamental. However, it is just an association, and we can imagine that it could be disrupted; so we shouldn’t claim that sensation can't enter awareness without being spatial, but rather that in reality, it doesn’t enter awareness without being spatial, meaning it’s tied up in this association.

No more can I understand one's saying that the idea of time is logically subsequent to space, since we can represent it to ourselves only under the form of a straight line; as well say that time is logically subsequent to the cultivation of the prairies, since it is usually represented armed with a scythe. That one can not represent to himself simultaneously the different parts of time, goes without saying, since the essential character of these parts is precisely not to be simultaneous. That does not mean that we have not the intuition of time. So far as that goes, no more should we have that of space, because neither can we[Pg 275] represent it, in the proper sense of the word, for the reasons I have mentioned. What we represent to ourselves under the name of straight is a crude image which as ill resembles the geometric straight as it does time itself.

I can no longer understand why some say that the concept of time comes after space, since we can only visualize it as a straight line; it’s like saying time follows the farming of prairies just because it’s usually depicted with a scythe. The fact that we can’t grasp the different parts of time at the same moment is obvious, since those parts are defined by their non-simultaneity. That doesn’t mean we lack an intuition for time. Similarly, we shouldn’t claim to have an intuition for space, because we can’t truly visualize it either, for the reasons I mentioned. What we picture as straight is a rough image that bears little resemblance to geometric straightness or to time itself.

Why has it been said that every attempt to give a fourth dimension to space always carries this one back to one of the other three? It is easy to understand. Consider our muscular sensations and the 'series' they may form. In consequence of numerous experiences, the ideas of these series are associated together in a very complex woof, our series are classed. Allow me, for convenience of language, to express my thought in a way altogether crude and even inexact by saying that our series of muscular sensations are classed in three classes corresponding to the three dimensions of space. Of course this classification is much more complicated than that, but that will suffice to make my reasoning understood. If I wish to imagine a fourth dimension, I shall suppose another series of muscular sensations, making part of a fourth class. But as all my muscular sensations have already been classed in one of the three pre-existent classes, I can only represent to myself a series belonging to one of these three classes, so that my fourth dimension is carried back to one of the other three.

Why is it said that every attempt to add a fourth dimension to space always brings us back to one of the other three? It's simple to understand. Think about our muscle sensations and the 'series' they can create. Through many experiences, the ideas of these series are linked together in a very complex way; our series are classified. For the sake of clarity, let me put it in a straightforward and even somewhat imprecise way: our series of muscle sensations are grouped into three categories that correspond to the three dimensions of space. Of course, this classification is much more intricate than that, but it’s enough to convey my point. If I want to imagine a fourth dimension, I would have to consider another series of muscle sensations, creating a fourth category. However, since all my muscle sensations have already been categorized into one of the three existing classes, I can only visualize a series that fits into one of these three categories, which means my fourth dimension is ultimately reduced to one of the other three.

What does that prove? This: that it would have been necessary first to destroy the old classification and replace it by a new one in which the series of muscular sensations should have been distributed into four classes. The difficulty would have disappeared.

What does that prove? This: that it would have been necessary to first eliminate the old classification and replace it with a new one where the series of muscle sensations would be divided into four categories. The difficulty would have vanished.

It is presented sometimes under a more striking form. Suppose I am enclosed in a chamber between the six impassable boundaries formed by the four walls, the floor and the ceiling; it will be impossible for me to get out and to imagine my getting out. Pardon, can you not imagine that the door opens, or that two of these walls separate? But of course, you answer, one must suppose that these walls remain immovable. Yes, but it is evident that I have the right to move; and then the walls that we suppose absolutely at rest will be in motion with regard to me. Yes, but such a relative motion can not be arbitrary; when objects are at rest, their relative motion with regard to any axes[Pg 276] is that of a rigid solid; now, the apparent motions that you imagine are not in conformity with the laws of motion of a rigid solid. Yes, but it is experience which has taught us the laws of motion of a rigid solid; nothing would prevent our imagining them different. To sum up, for me to imagine that I get out of my prison, I have only to imagine that the walls seem to open, when I move.

Sometimes it's presented in a more dramatic way. Imagine I'm stuck in a room surrounded by six unbreakable barriers formed by the four walls, the floor, and the ceiling; it's impossible for me to escape or even picture escaping. But wait, can't you imagine that the door opens or that two of these walls part? Of course, you say, but we have to assume these walls stay fixed. Yes, but clearly I have the right to move; and then the walls we believe are completely still will be moving in relation to me. True, but such relative motion can't just be random; when objects are still, their relative motion with respect to any axes[Pg 276] behaves like a solid object. However, the motions you're envisioning don't follow the laws of motion for a solid object. Yes, but it's our experiences that taught us the laws of motion for solids; nothing stops us from imagining them differently. In conclusion, to picture myself escaping my confinement, I just need to imagine that the walls appear to open when I move.

I believe, therefore, that if by space is understood a mathematical continuum of three dimensions, were it otherwise amorphous, it is the mind which constructs it, but it does not construct it out of nothing; it needs materials and models. These materials, like these models, preexist within it. But there is not a single model which is imposed upon it; it has choice; it may choose, for instance, between space of four and space of three dimensions. What then is the rôle of experience? It gives the indications following which the choice is made.

I believe that if we understand space as a mathematical continuum with three dimensions, then if it were otherwise formless, it's the mind that shapes it, but it doesn't create it out of nothing; it requires materials and models. These materials, like these models, already exist within it. However, there's not just one model that has to be followed; it has choice; it can choose, for example, between four-dimensional space and three-dimensional space. So, what role does experience play? It provides the guidance that informs the choice.

Another thing: whence does space get its quantitative character? It comes from the rôle which the series of muscular sensations play in its genesis. These are series which may repeat themselves, and it is from their repetition that number comes; it is because they can repeat themselves indefinitely that space is infinite. And finally we have seen, at the end of section 3, that it is also because of this that space is relative. So it is repetition which has given to space its essential characteristics; now, repetition supposes time; this is enough to tell that time is logically anterior to space.

Another thing: where does space get its quantitative nature? It comes from the role that the series of muscle sensations play in its creation. These are series that can repeat themselves, and it is from their repetition that numbers arise; it is because they can repeat indefinitely that space is infinite. Finally, as we saw at the end of section 3, this also makes space relative. So, repetition is what gives space its essential features; now, repetition requires time; this is enough to suggest that time is logically prior to space.

7. Rôle of the Semicircular Canals

I have not hitherto spoken of the rôle of certain organs to which the physiologists attribute with reason a capital importance, I mean the semicircular canals. Numerous experiments have sufficiently shown that these canals are necessary to our sense of orientation; but the physiologists are not entirely in accord; two opposing theories have been proposed, that of Mach-Delage and that of M. de Cyon.

I haven't yet talked about the role of certain organs that physiologists reasonably consider to be critically important, specifically the semicircular canals. Numerous experiments have clearly demonstrated that these canals are essential for our sense of orientation; however, physiologists do not completely agree on this matter. Two conflicting theories have been put forward: Mach-Delage's theory and that of M. de Cyon.

M. de Cyon is a physiologist who has made his name illustrious by important discoveries on the innervation of the heart; I can not, however, agree with his ideas on the question before us. Not[Pg 277] being a physiologist, I hesitate to criticize the experiments he has directed against the adverse theory of Mach-Delage; it seems to me, however, that they are not convincing, because in many of them the total pressure was made to vary in one of the canals, while, physiologically, what varies is the difference between the pressures on the two extremities of the canal; in others the organs were subjected to profound lesions, which must alter their functions.

M. de Cyon is a physiologist who has made a name for himself with significant discoveries about the heart's innervation; however, I can't agree with his ideas on the issue at hand. Not being a physiologist, I hesitate to critique the experiments he conducted against Mach-Delage's opposing theory; still, I find them unconvincing because, in many of them, the total pressure was manipulated in one of the canals, whereas, physiologically, what really changes is the difference in pressure between the two ends of the canal. In other experiments, the organs were subjected to severe damage, which must affect their functions.

Besides, this is not important; the experiments, if they were irreproachable, might be convincing against the old theory. They would not be convincing for the new theory. In fact, if I have rightly understood the theory, my explaining it will be enough for one to understand that it is impossible to conceive of an experiment confirming it.

Besides, this isn't important; the experiments, even if they were flawless, might challenge the old theory. They wouldn’t support the new theory. In fact, if I understand the theory correctly, just explaining it should make it clear that it’s impossible to imagine an experiment that confirms it.

The three pairs of canals would have as sole function to tell us that space has three dimensions. Japanese mice have only two pairs of canals; they believe, it would seem, that space has only two dimensions, and they manifest this opinion in the strangest way; they put themselves in a circle, and, so ordered, they spin rapidly around. The lampreys, having only one pair of canals, believe that space has only one dimension, but their manifestations are less turbulent.

The three pairs of canals exist solely to show us that space has three dimensions. Japanese mice have only two pairs of canals; they seem to believe that space has just two dimensions, and they express this belief in a peculiar way: they form a circle and spin around quickly. Lampreys, with only one pair of canals, believe that space has only one dimension, but their expressions are less frenzied.

It is evident that such a theory is inadmissible. The sense-organs are designed to tell us of changes which happen in the exterior world. We could not understand why the Creator should have given us organs destined to cry without cease: Remember that space has three dimensions, since the number of these three dimensions is not subject to change.

It is clear that this kind of theory doesn't hold up. Our senses are made to inform us about changes that occur in the outside world. We wouldn't understand why the Creator would give us organs that would just keep crying: Keep in mind that space has three dimensions, and the number of these three dimensions doesn’t change.

We must, therefore, come back to the theory of Mach-Delage. What the nerves of the canals can tell us is the difference of pressure on the two extremities of the same canal, and thereby: (1) the direction of the vertical with regard to three axes rigidly bound to the head; (2) the three components of the acceleration of translation of the center of gravity of the head; (3) the centrifugal forces developed by the rotation of the head; (4) the acceleration of the motion of rotation of the head.

We need to return to Mach-Delage's theory. The nerves in the canals inform us about the pressure differences at both ends of the same canal, and this can tell us: (1) the vertical direction in relation to three axes fixed to the head; (2) the three components of the acceleration of the head's center of gravity; (3) the centrifugal forces created by the head's rotation; (4) the acceleration of the head's rotational movement.

It follows from the experiments of M. Delage that it is this last indication which is much the most important; doubtless[Pg 278] because the nerves are less sensible to the difference of pressure itself than to the brusque variations of this difference. The first three indications may thus be neglected.

It follows from M. Delage's experiments that this last indication is by far the most important; likely[Pg 278] because the nerves are less sensitive to the pressure difference itself than to the sudden changes in this difference. Therefore, the first three indications can be disregarded.

Knowing the acceleration of the motion of rotation of the head at each instant, we deduce from it, by an unconscious integration, the final orientation of the head, referred to a certain initial orientation taken as origin. The circular canals contribute, therefore, to inform us of the movements that we have executed, and that on the same ground as the muscular sensations. When, therefore, above we speak of the series S or of the series Σ, we should say, not that these were series of muscular sensations alone, but that they were series at the same time of muscular sensations and of sensations due to the semicircular canals. Apart from this addition, we should have nothing to change in what precedes.

Knowing the acceleration of the head's rotational motion at any moment, we unconsciously integrate it to determine the final orientation of the head, based on a certain initial position considered as the starting point. The circular canals help us understand the movements we've made, alongside the muscular sensations. So, when we refer to the series S or the series Σ, we shouldn't say they are just series of muscular sensations, but rather that they are series that include both muscular sensations and sensations from the semicircular canals. Aside from this addition, we shouldn't change anything from what has been stated before.

In the series S and Σ, these sensations of the semicircular canals evidently hold a very important place. Yet alone they would not suffice, because they can tell us only of the movements of the head; they tell us nothing of the relative movements of the body or of the members in regard to the head. And more, it seems that they tell us only of the rotations of the head and not of the translations it may undergo.

In the series S and Σ, these feelings from the semicircular canals clearly play a significant role. However, by themselves, they aren't enough, as they only inform us about the movements of the head; they don’t give us any information about the body's or limbs' movements in relation to the head. Furthermore, it appears they only indicate the head's rotations and not any translations it might experience.


PART II

THE PHYSICAL SCIENCES


CHAPTER V

Analysis and Physics

I

You have doubtless often been asked of what good is mathematics and whether these delicate constructions entirely mind-made are not artificial and born of our caprice.

You’ve probably been asked many times what the point of mathematics is and whether these intricate concepts, which seem entirely created by our minds, are just artificial and a product of our whims.

Among those who put this question I should make a distinction; practical people ask of us only the means of money-making. These merit no reply; rather would it be proper to ask of them what is the good of accumulating so much wealth and whether, to get time to acquire it, we are to neglect art and science, which alone give us souls capable of enjoying it, 'and for life's sake to sacrifice all reasons for living.'

Among those who ask this question, I should make a distinction; practical people only want to know how to make money. They don’t deserve a response; instead, it would be better to ask them what the point is of accumulating so much wealth and whether, in order to make that money, we should ignore art and science, which are the only things that give us the ability to enjoy it, and, for the sake of life, sacrifice all the reasons to live.

Besides, a science made solely in view of applications is impossible; truths are fecund only if bound together. If we devote ourselves solely to those truths whence we expect an immediate result, the intermediary links are wanting and there will no longer be a chain.

Besides, a science focused only on practical applications is impossible; truths can only thrive when they're connected. If we only invest in those truths that we think will yield quick results, the connections between them will be missing, and there will no longer be a cohesive structure.

The men most disdainful of theory get from it, without suspecting it, their daily bread; deprived of this food, progress would quickly cease, and we should soon congeal into the immobility of old China.

The men who look down on theory actually rely on it for their daily needs, without even realizing it; without this support, progress would grind to a halt, and we'd soon freeze in the same old ways as ancient China.

But enough of uncompromising practicians! Besides these, there are those who are only interested in nature and who ask us if we can enable them to know it better.

But enough of strict practitioners! In addition to these, there are those who are simply interested in nature and who ask us if we can help them understand it better.

To answer these, we have only to show them the two monuments already rough-hewn, Celestial Mechanics and Mathematical Physics.[Pg 280]

To answer these, we just need to show them the two monuments that are already roughly shaped, Celestial Mechanics and Mathematical Physics.[Pg 280]

They would doubtless concede that these structures are well worth the trouble they have cost us. But this is not enough. Mathematics has a triple aim. It must furnish an instrument for the study of nature. But that is not all: it has a philosophic aim and, I dare maintain, an esthetic aim. It must aid the philosopher to fathom the notions of number, of space, of time. And above all, its adepts find therein delights analogous to those given by painting and music. They admire the delicate harmony of numbers and forms; they marvel when a new discovery opens to them an unexpected perspective; and has not the joy they thus feel the esthetic character, even though the senses take no part therein? Only a privileged few are called to enjoy it fully, it is true, but is not this the case for all the noblest arts?

They would certainly agree that these structures are worth the effort and resources they required. But that's not enough. Mathematics has three main goals. It needs to provide a tool for exploring nature. But that's just one aspect: it also has a philosophical purpose, and I would argue, an aesthetic purpose as well. It should help philosophers understand concepts like numbers, space, and time. Most importantly, those who study it find joy similar to that which painting and music provide. They appreciate the subtle harmony of numbers and shapes; they are amazed when a new discovery reveals an unforeseen perspective; and doesn't the joy they experience have an aesthetic quality, even if it doesn’t involve their senses? It's true that only a select few get to experience it fully, but isn't that the case for all the greatest arts?

This is why I do not hesitate to say that mathematics deserves to be cultivated for its own sake, and the theories inapplicable to physics as well as the others. Even if the physical aim and the esthetic aim were not united, we ought not to sacrifice either.

This is why I confidently state that mathematics should be valued for its own sake, including theories that don't apply to physics and those that do. Even if the goals of physics and aesthetics weren't aligned, we shouldn't give up on either one.

But more: these two aims are inseparable and the best means of attaining one is to aim at the other, or at least never to lose sight of it. This is what I am about to try to demonstrate in setting forth the nature of the relations between the pure science and its applications.

But more: these two goals are connected, and the best way to achieve one is to focus on the other, or at least to never forget about it. This is what I’m going to demonstrate by explaining the relationship between pure science and its applications.

The mathematician should not be for the physicist a mere purveyor of formulas; there should be between them a more intimate collaboration. Mathematical physics and pure analysis are not merely adjacent powers, maintaining good neighborly relations; they mutually interpenetrate and their spirit is the same. This will be better understood when I have shown what physics gets from mathematics and what mathematics, in return, borrows from physics.

The mathematician shouldn't just be a supplier of formulas for the physicist; there should be a closer collaboration between them. Mathematical physics and pure analysis aren’t just neighboring fields that maintain a friendly relationship; they deeply interconnect and share the same essence. This will be clearer once I demonstrate what physics gains from mathematics and what mathematics, in turn, takes from physics.

II

The physicist can not ask of the analyst to reveal to him a new truth; the latter could at most only aid him to foresee it. It is a long time since one still dreamt of forestalling experiment, or of constructing the entire world on certain premature hypotheses. Since all those constructions in which one yet took a naïve delight it is an age, to-day only their ruins remain.[Pg 281]

The physicist cannot expect the analyst to reveal a new truth; the analyst can at most help him anticipate it. It's been a long time since anyone dreamed of predicting experiments or building the whole world on some hasty assumptions. Those naive constructions that once brought joy are now just ruins.[Pg 281]

All laws are therefore deduced from experiment; but to enunciate them, a special language is needful; ordinary language is too poor, it is besides too vague, to express relations so delicate, so rich, and so precise.

All laws are therefore derived from experiment; however, to state them, a specific language is necessary; regular language is too limited and too vague to express such delicate, rich, and precise relationships.

This therefore is one reason why the physicist can not do without mathematics; it furnishes him the only language he can speak. And a well-made language is no indifferent thing; not to go beyond physics, the unknown man who invented the word heat devoted many generations to error. Heat has been treated as a substance, simply because it was designated by a substantive, and it has been thought indestructible.

This is one reason why a physicist can't do without mathematics; it provides the only language they can use. And a well-crafted language is really important; staying within the realm of physics, the unknown person who came up with the word heat caused many generations to get it wrong. Heat has been treated as a substance just because it was named with a noun, and people thought it was indestructible.

On the other hand, he who invented the word electricity had the unmerited good fortune to implicitly endow physics with a new law, that of the conservation of electricity, which, by a pure chance, has been found exact, at least until now.

On the other hand, the person who came up with the word electricity had the undeserved luck to implicitly give physics a new law, the law of conservation of electricity, which, purely by chance, has been proven correct, at least up to now.

Well, to continue the simile, the writers who embellish a language, who treat it as an object of art, make of it at the same time a more supple instrument, more apt for rendering shades of thought.

Well, to continue the comparison, writers who enhance a language, who treat it as a form of art, also make it a more flexible tool, better suited for expressing subtle nuances of thought.

We understand, then, how the analyst, who pursues a purely esthetic aim, helps create, just by that, a language more fit to satisfy the physicist.

We see, then, how the analyst, who focuses solely on aesthetic goals, contributes to the creation of a language that better meets the needs of the physicist.

But this is not all: law springs from experiment, but not immediately. Experiment is individual, the law deduced from it is general; experiment is only approximate, the law is precise, or at least pretends to be. Experiment is made under conditions always complex, the enunciation of the law eliminates these complications. This is what is called 'correcting the systematic errors.'

But that's not all: law comes from experimentation, but not directly. Experimentation is personal, while the law derived from it is universal; experimentation is only rough, the law is exact—or at least claims to be. Experiments are conducted under always complicated conditions, while the statement of the law removes these complications. This is what is known as 'correcting the systematic errors.'

In a word, to get the law from experiment, it is necessary to generalize; this is a necessity imposed upon the most circumspect observer. But how generalize? Every particular truth may evidently be extended in an infinity of ways. Among these thousand routes opening before us, it is necessary to make a choice, at least provisional; in this choice, what shall guide us?

In short, to derive laws from experiments, we need to generalize; this is a requirement for even the most careful observer. But how do we generalize? Each specific truth can clearly be expanded in countless ways. Among the many paths available to us, we need to make a choice, at least temporarily; what should guide us in this choice?

It can only be analogy. But how vague is this word! Primitive man knew only crude analogies, those which strike the senses, those of colors or of sounds. He never would have dreamt of likening light to radiant heat.[Pg 282]

It can only be an analogy. But how vague is this word! Primitive man only understood simple analogies, those that appeal to the senses, like colors or sounds. He would never have thought to compare light to radiant heat.[Pg 282]

What has taught us to know the true, profound analogies, those the eyes do not see but reason divines?

What has taught us to understand the real, deep connections that the eyes can't see but reason reveals?

It is the mathematical spirit, which disdains matter to cling only to pure form. This it is which has taught us to give the same name to things differing only in material, to call by the same name, for instance, the multiplication of quaternions and that of whole numbers.

It is the mathematical spirit that looks down on matter and focuses solely on pure form. This is what has taught us to use the same name for things that differ only in substance, such as calling both the multiplication of quaternions and the multiplication of whole numbers by the same name.

If quaternions, of which I have just spoken, had not been so promptly utilized by the English physicists, many persons would doubtless see in them only a useless fancy, and yet, in teaching us to liken what appearances separate, they would have already rendered us more apt to penetrate the secrets of nature.

If quaternions, which I just mentioned, hadn't been so quickly used by English physicists, many people would probably see them as just a pointless idea. However, by helping us compare things that appearances keep apart, they have already made us better at uncovering nature's mysteries.

Such are the services the physicist should expect of analysis; but for this science to be able to render them, it must be cultivated in the broadest fashion without immediate expectation of utility—the mathematician must have worked as artist.

Such are the services that a physicist should expect from analysis; however, for this science to deliver, it must be developed in the broadest way without any immediate expectation of usefulness—the mathematician must work as an artist.

What we ask of him is to help us to see, to discern our way in the labyrinth which opens before us. Now, he sees best who stands highest. Examples abound, and I limit myself to the most striking.

What we ask of him is to help us see and find our way through the maze that lies ahead. The person who sees the clearest is the one who stands at the highest point. There are many examples, but I'll just mention the most notable ones.

The first will show us how to change the language suffices to reveal generalizations not before suspected.

The first will show us how to change the language, which will reveal generalizations that we hadn't suspected before.

When Newton's law has been substituted for Kepler's we still know only elliptic motion. Now, in so far as concerns this motion, the two laws differ only in form; we pass from one to the other by a simple differentiation. And yet from Newton's law may be deduced by an immediate generalization all the effects of perturbations and the whole of celestial mechanics. If, on the other hand, Kepler's enunciation had been retained, no one would ever have regarded the orbits of the perturbed planets, those complicated curves of which no one has ever written the equation, as the natural generalizations of the ellipse. The progress of observations would only have served to create belief in chaos.

When Newton's law replaced Kepler's, we still only understood elliptical motion. In terms of this motion, the two laws are different only in how they are expressed; we transition from one to the other with a straightforward differentiation. Yet, from Newton's law, we can immediately derive all the effects of disturbances and everything about celestial mechanics. If we had kept Kepler's statements, no one would have ever seen the orbits of the perturbed planets—those complex curves that no one has managed to write an equation for—as natural extensions of the ellipse. The advancement of observations would have only reinforced the idea of chaos.

The second example is equally deserving of consideration.

The second example is just as worthy of consideration.

When Maxwell began his work, the laws of electro-dynamics admitted up to his time accounted for all the known facts. It was not a new experiment which came to invalidate them. But in looking at them under a new bias, Maxwell saw that the equations[Pg 283] became more symmetrical when a term was added, and besides, this term was too small to produce effects appreciable with the old methods.

When Maxwell started his work, the laws of electro-dynamics that existed at that time explained all the known facts. It wasn't a new experiment that disproved them. However, by viewing the equations from a different perspective, Maxwell realized that they became more symmetrical when he added a term, and this term was also too small to cause noticeable effects with the previous methods.

You know that Maxwell's a priori views awaited for twenty years an experimental confirmation; or, if you prefer, Maxwell was twenty years ahead of experiment. How was this triumph obtained?

You know that Maxwell's a priori theories waited twenty years for experimental proof; or, if you prefer, Maxwell was twenty years ahead of the experiments. How was this achievement accomplished?

It was because Maxwell was profoundly steeped in the sense of mathematical symmetry; would he have been so, if others before him had not studied this symmetry for its own beauty?

It was because Maxwell was deeply immersed in the idea of mathematical symmetry; would he have been so if others before him hadn't explored this symmetry for its own beauty?

It was because Maxwell was accustomed to 'think in vectors,' and yet it was through the theory of imaginaries (neomonics) that vectors were introduced into analysis. And those who invented imaginaries hardly suspected the advantage which would be obtained from them for the study of the real world, of this the name given them is proof sufficient.

It was because Maxwell was used to 'thinking in vectors,' and yet it was through the theory of imaginaries (neomonics) that vectors were brought into analysis. Those who created imaginaries probably didn't realize the benefits they would bring to the study of the real world, as the name they were given shows.

In a word, Maxwell was perhaps not an able analyst, but this ability would have been for him only a useless and bothersome baggage. On the other hand, he had in the highest degree the intimate sense of mathematical analogies. Therefore it is that he made good mathematical physics.

In short, Maxwell might not have been a skilled analyst, but that skill would have just been a useless and annoying burden for him. On the flip side, he possessed a strong instinct for mathematical analogies. That's why he was able to excel in mathematical physics.

Maxwell's example teaches us still another thing.

Maxwell's example teaches us something else as well.

How should the equations of mathematical physics be treated? Should we simply deduce all the consequences and regard them as intangible realities? Far from it; what they should teach us above all is what can and what should be changed. It is thus that we get from them something useful.

How should we approach the equations of mathematical physics? Should we just figure out all their consequences and treat them as unchangeable truths? Definitely not; what they should primarily teach us is what can be changed and what should be changed. That’s how we gain something useful from them.

The third example goes to show us how we may perceive mathematical analogies between phenomena which have physically no relation either apparent or real, so that the laws of one of these phenomena aid us to divine those of the other.

The third example shows us how we can find mathematical similarities between phenomena that seem to have no connection, either obvious or real, allowing us to use the rules of one to understand the rules of the other.

The very same equation, that of Laplace, is met in the theory of Newtonian attraction, in that of the motion of liquids, in that of the electric potential, in that of magnetism, in that of the propagation of heat and in still many others. What is the result? These theories seem images copied one from the other; they are mutually illuminating, borrowing their language from each other; ask electricians if they do not felicitate themselves on[Pg 284] having invented the phrase flow of force, suggested by hydrodynamics and the theory of heat.

The same equation from Laplace appears in the theory of Newtonian attraction, in the motion of liquids, in electric potential, in magnetism, in heat propagation, and in many other areas. What’s the conclusion? These theories seem to reflect one another; they shed light on each other and share terminology. Just ask electricians if they don’t take pride in coining the term “flow of force,” inspired by hydrodynamics and heat theory.[Pg 284]

Thus mathematical analogies not only may make us foresee physical analogies, but besides do not cease to be useful when these latter fail.

Thus, mathematical analogies can help us anticipate physical analogies, but they remain useful even when those physical ones fall short.

To sum up, the aim of mathematical physics is not only to facilitate for the physicist the numerical calculation of certain constants or the integration of certain differential equations. It is besides, it is above all, to reveal to him the hidden harmony of things in making him see them in a new way.

To sum up, the goal of mathematical physics isn't just to help physicists with the numerical calculations of specific constants or the integration of certain differential equations. More importantly, it's to uncover the hidden harmony of things by allowing them to see these concepts in a new light.

Of all the parts of analysis, the most elevated, the purest, so to speak, will be the most fruitful in the hands of those who know how to use them.

Of all the aspects of analysis, the highest and most refined will be the most productive for those who know how to utilize them.

III

Let us now see what analysis owes to physics.

Let’s now look at what analysis owes to physics.

It would be necessary to have completely forgotten the history of science not to remember that the desire to understand nature has had on the development of mathematics the most constant and happiest influence.

It would be impossible to have completely forgotten the history of science and not recall that the desire to understand nature has had the most consistent and positive impact on the development of mathematics.

In the first place the physicist sets us problems whose solution he expects of us. But in proposing them to us, he has largely paid us in advance for the service we shall render him, if we solve them.

In the first place, the physicist gives us problems that he expects us to solve. But by presenting these problems to us, he has mostly compensated us in advance for the help we will provide him if we find the solutions.

If I may be allowed to continue my comparison with the fine arts, the pure mathematician who should forget the existence of the exterior world would be like a painter who knew how to harmoniously combine colors and forms, but who lacked models. His creative power would soon be exhausted.

If I can continue my comparison with the fine arts, the pure mathematician who forgets the existence of the outside world would be like a painter who knows how to combine colors and shapes harmoniously but lacks models. His creative energy would quickly run out.

The combinations which numbers and symbols may form are an infinite multitude. In this multitude how shall we choose those which are worthy to fix our attention? Shall we let ourselves be guided solely by our caprice? This caprice, which itself would besides soon tire, would doubtless carry us very far apart and we should quickly cease to understand each other.

The combinations that numbers and symbols can create are endless. Among this multitude, how do we select the ones that deserve our focus? Should we let our whims guide us? This whim, which would soon become tiresome, would likely take us in vastly different directions, and we would quickly stop understanding one another.

But this is only the smaller side of the question. Physics will doubtless prevent our straying, but it will also preserve us from a danger much more formidable; it will prevent our ceaselessly going around in the same circle.[Pg 285]

But this is just the smaller part of the issue. Physics will surely keep us on track, but it will also protect us from a much bigger threat; it will stop us from endlessly going around in the same cycle.[Pg 285]

History proves that physics has not only forced us to choose among problems which came in a crowd; it has imposed upon us such as we should without it never have dreamed of. However varied may be the imagination of man, nature is still a thousand times richer. To follow her we must take ways we have neglected, and these paths lead us often to summits whence we discover new countries. What could be more useful!

History shows that physics has not only made us choose from a multitude of problems; it has presented us with challenges we would never have imagined without it. No matter how diverse human imagination may be, nature is still infinitely richer. To explore her wonders, we must take paths we've overlooked, and these routes often lead us to heights from which we find new territories. What could be more beneficial!

It is with mathematical symbols as with physical realities; it is in comparing the different aspects of things that we are able to comprehend their inner harmony, which alone is beautiful and consequently worthy of our efforts.

It’s like mathematical symbols and physical realities; by comparing different aspects of things, we can understand their inner harmony, which is what’s truly beautiful and deserving of our efforts.

The first example I shall cite is so old we are tempted to forget it; it is nevertheless the most important of all.

The first example I'll mention is so old that we might forget it; still, it’s the most important of all.

The sole natural object of mathematical thought is the whole number. It is the external world which has imposed the continuum upon us, which we doubtless have invented, but which it has forced us to invent. Without it there would be no infinitesimal analysis; all mathematical science would reduce itself to arithmetic or to the theory of substitutions.

The only natural focus of mathematical thinking is whole numbers. It’s the external world that has made the concept of continuity necessary for us, something we definitely created, but it’s something the world has compelled us to create. Without this concept, there wouldn’t be any infinitesimal calculus; all of mathematics would simply come down to arithmetic or the theory of replacements.

On the contrary, we have devoted to the study of the continuum almost all our time and all our strength. Who will regret it; who will think that this time and this strength have been wasted? Analysis unfolds before us infinite perspectives that arithmetic never suspects; it shows us at a glance a majestic assemblage whose array is simple and symmetric; on the contrary, in the theory of numbers, where reigns the unforeseen, the view is, so to speak, arrested at every step.

On the other hand, we've dedicated almost all our time and energy to studying the continuum. Who would regret it? Who would think that this time and energy have been wasted? Analysis reveals endless perspectives that arithmetic can’t even imagine; it gives us a clear view of a grand arrangement that is simple and symmetrical. In contrast, in number theory, where the unexpected reigns, our view is, so to speak, halted at every turn.

Doubtless it will be said that outside of the whole number there is no rigor, and consequently no mathematical truth; that the whole number hides everywhere, and that we must strive to render transparent the screens which cloak it, even if to do so we must resign ourselves to interminable repetitions. Let us not be such purists and let us be grateful to the continuum, which, if all springs from the whole number, was alone capable of making so much proceed therefrom.

Doubtless it will be said that outside of whole numbers there is no rigor, and therefore no mathematical truth; that whole numbers are hidden everywhere, and we must work to make clear the barriers that conceal them, even if doing so means accepting endless repetitions. Let’s not be so strict and let’s appreciate the continuum, which, if everything comes from whole numbers, was the only thing capable of allowing so much to emerge from it.

Need I also recall that M. Hermite obtained a surprising advantage from the introduction of continuous variables into the theory of numbers? Thus the whole number's own domain is[Pg 286] itself invaded, and this invasion has established order where disorder reigned.

Need I also remind you that M. Hermite gained an unexpected benefit from introducing continuous variables into the number theory? So, the entire realm of whole numbers is[Pg 286] itself disrupted, and this disruption has created order where chaos existed.

See what we owe to the continuum and consequently to physical nature.

See what we owe to the continuous flow and, as a result, to the natural world.

Fourier's series is a precious instrument of which analysis makes continual use, it is by this means that it has been able to represent discontinuous functions; Fourier invented it to solve a problem of physics relative to the propagation of heat. If this problem had not come up naturally, we should never have dared to give discontinuity its rights; we should still long have regarded continuous functions as the only true functions.

Fourier's series is a valuable tool that analysis constantly relies on; it allows for the representation of discontinuous functions. Fourier created it to address a physics problem related to the transfer of heat. If this issue hadn't arisen, we might never have accepted discontinuity; we would likely still view continuous functions as the only real functions.

The notion of function has been thereby considerably extended and has received from some logician-analysts an unforeseen development. These analysts have thus adventured into regions where reigns the purest abstraction and have gone as far away as possible from the real world. Yet it is a problem of physics which has furnished them the occasion.

The concept of function has been significantly expanded and has experienced an unexpected evolution from some logician-analysts. These analysts have ventured into areas where the highest level of abstraction exists and have distanced themselves as much as possible from the real world. However, it is a physics problem that has provided them with the opportunity.

After Fourier's series, other analogous series have entered the domain of analysis; they have entered by the same door; they have been imagined in view of applications.

After Fourier's series, other similar series have entered the field of analysis; they have come in through the same door; they have been created with applications in mind.

The theory of partial differential equations of the second order has an analogous history. It has been developed chiefly by and for physics. But it may take many forms, because such an equation does not suffice to determine the unknown function, it is necessary to adjoin to it complementary conditions which are called conditions at the limits; whence many different problems.

The theory of second-order partial differential equations has a similar history. It has mainly been developed by and for physics. However, it can take on many different forms because a single equation isn’t enough to determine the unknown function; it's necessary to add complementary conditions, known as boundary conditions, which results in various problems.

If the analysts had abandoned themselves to their natural tendencies, they would never have known but one, that which Madame Kovalevski has treated in her celebrated memoir. But there are a multitude of others which they would have ignored. Each of the theories of physics, that of electricity, that of heat, presents us these equations under a new aspect. It may, therefore, be said that without these theories we should not know partial differential equations.

If the analysts had given in to their natural inclinations, they would only be familiar with the one that Madame Kovalevski discussed in her famous memoir. However, there are many others that they would have overlooked. Each theory of physics—like those of electricity and heat—presents us with these equations from a new perspective. So, it can be said that without these theories, we wouldn't know about partial differential equations.

It is needless to multiply examples. I have given enough to be able to conclude: when physicists ask of us the solution of a problem, it is not a duty-service they impose upon us, it is on the contrary we who owe them thanks.

It’s unnecessary to give more examples. I've provided enough to conclude: when physicists ask us to solve a problem, they’re not placing a burden on us; instead, we should be grateful to them.

IV

But this is not all; physics not only gives us the occasion to solve problems; it aids us to find the means thereto, and that in two ways. It makes us foresee the solution; it suggests arguments to us.

But that's not all; physics not only gives us the chance to solve problems, but it also helps us find the ways to do so, and in two ways. It helps us anticipate the solution; it offers us insights.

I have spoken above of Laplace's equation which is met in a multitude of diverse physical theories. It is found again in geometry, in the theory of conformal representation and in pure analysis, in that of imaginaries.

I have mentioned earlier Laplace's equation, which appears in many different physical theories. It also shows up in geometry, in the theory of conformal representation, and in pure analysis, specifically in the study of imaginary numbers.

In this way, in the study of functions of complex variables, the analyst, alongside of the geometric image, which is his usual instrument, finds many physical images which he may make use of with the same success. Thanks to these images, he can see at a glance what pure deduction would show him only successively. He masses thus the separate elements of the solution, and by a sort of intuition divines before being able to demonstrate.

In this way, when studying functions of complex variables, the analyst, along with the geometric representation that he usually relies on, discovers many physical representations that he can use just as effectively. Thanks to these representations, he can quickly grasp what pure deduction would only reveal to him step by step. He compiles the individual components of the solution and, through a kind of intuition, anticipates results before he’s able to prove them.

To divine before demonstrating! Need I recall that thus have been made all the important discoveries? How many are the truths that physical analogies permit us to present and that we are not in condition to establish by rigorous reasoning!

To predict before showing! Do I need to remind you that this is how all significant discoveries have been made? How many truths can we present through physical analogies that we can't establish with strict reasoning!

For example, mathematical physics introduces a great number of developments in series. No one doubts that these developments converge; but the mathematical certitude is lacking. These are so many conquests assured for the investigators who shall come after us.

For example, mathematical physics brings a lot of advancements in sequences. No one questions that these advancements come together; however, the mathematical certainty is missing. These are numerous achievements secured for the researchers who will follow us.

On the other hand, physics furnishes us not alone solutions; it furnishes us besides, in a certain measure, arguments. It will suffice to recall how Felix Klein, in a question relative to Riemann surfaces, has had recourse to the properties of electric currents.

On the other hand, physics provides us not just with solutions; it also gives us, to some extent, arguments. It’s enough to remember how Felix Klein, in a question related to Riemann surfaces, referred to the properties of electric currents.

It is true, the arguments of this species are not rigorous, in the sense the analyst attaches to this word. And here a question arises: How can a demonstration not sufficiently rigorous for the analyst suffice for the physicist? It seems there can not be two rigors, that rigor is or is not, and that, where it is not there can not be deduction.

It’s true that the arguments of this type aren’t strict in the way the analyst understands that term. This raises a question: How can a demonstration that isn't rigorous enough for the analyst be enough for the physicist? It seems there can't be two types of rigor; rigor either exists or it doesn't, and where it doesn't exist, there can't be any deduction.

This apparent paradox will be better understood by recalling[Pg 288] under what conditions number is applied to natural phenomena. Whence come in general the difficulties encountered in seeking rigor? We strike them almost always in seeking to establish that some quantity tends to some limit, or that some function is continuous, or that it has a derivative.

This seeming contradiction will be clearer if we remember[Pg 288] the conditions under which numbers are used to describe natural phenomena. Where do the challenges we face in pursuing precision generally arise? We almost always encounter them when we try to prove that a certain quantity approaches a limit, or that a function is continuous, or that it has a derivative.

Now the numbers the physicist measures by experiment are never known except approximately; and besides, any function always differs as little as you choose from a discontinuous function, and at the same time it differs as little as you choose from a continuous function. The physicist may, therefore, at will suppose that the function studied is continuous, or that it is discontinuous; that it has or has not a derivative; and may do so without fear of ever being contradicted, either by present experience or by any future experiment. We see that with such liberty he makes sport of difficulties which stop the analyst. He may always reason as if all the functions which occur in his calculations were entire polynomials.

Now, the numbers that physicists measure in experiments are never known exactly, just roughly; and any function can differ by as little as you want from a discontinuous function, while at the same time, it can differ by as little as you want from a continuous function. Therefore, the physicist can freely assume that the function being studied is continuous or discontinuous; that it has a derivative or doesn't; and they can do this without worrying about being contradicted by current observations or any future experiments. We can see that with such freedom, they can play around with problems that would stump the analyst. They can always approach their calculations as if all the functions involved were simple polynomials.

Thus the sketch which suffices for physics is not the deduction which analysis requires. It does not follow thence that one can not aid in finding the other. So many physical sketches have already been transformed into rigorous demonstrations that to-day this transformation is easy. There would be plenty of examples did I not fear in citing them to tire the reader.

Thus, the outline that's enough for physics isn't the same as the detailed reasoning that analysis needs. That doesn't mean one can't help in discovering the other. So many physical outlines have already been turned into strict proofs that today this transformation is straightforward. There would be plenty of examples if I weren't worried about boring the reader by mentioning them.

I hope I have said enough to show that pure analysis and mathematical physics may serve one another without making any sacrifice one to the other, and that each of these two sciences should rejoice in all which elevates its associate.

I hope I've said enough to show that pure analysis and mathematical physics can support each other without sacrificing anything, and that both of these fields should take pride in anything that uplifts the other.


CHAPTER VI

Astrophysics

Governments and parliaments must find that astronomy is one of the sciences which cost most dear: the least instrument costs hundreds of thousands of dollars, the least observatory costs millions; each eclipse carries with it supplementary appropriations. And all that for stars which are so far away, which are complete strangers to our electoral contests, and in all probability will never take any part in them. It must be that our politicians have retained a remnant of idealism, a vague instinct for what is grand; truly, I think they have been calumniated; they should be encouraged and shown that this instinct does not deceive them, that they are not dupes of that idealism.

Governments and parliaments need to realize that astronomy is one of the most expensive sciences: even the simplest instruments cost hundreds of thousands of dollars, and basic observatories can cost millions; each eclipse brings the need for extra funding. And all of this for stars that are so distant, completely irrelevant to our elections, and most likely will never play a role in them. It seems that our politicians have kept a bit of idealism, a faint sense of what is grand; honestly, I believe they have been unfairly criticized; they should be supported and shown that this instinct is valid, that they are not being fooled by their idealism.

We might indeed speak to them of navigation, of which no one can underestimate the importance, and which has need of astronomy. But this would be to take the question by its smaller side.

We could definitely talk to them about navigation, which is something nobody can overlook in terms of its importance, and which depends on astronomy. But that would be to approach the issue from a narrower perspective.

Astronomy is useful because it raises us above ourselves; it is useful because it is grand; that is what we should say. It shows us how small is man's body, how great his mind, since his intelligence can embrace the whole of this dazzling immensity, where his body is only an obscure point, and enjoy its silent harmony. Thus we attain the consciousness of our power, and this is something which can not cost too dear, since this consciousness makes us mightier.

Astronomy is valuable because it lifts us beyond our everyday lives; it is valuable because it is magnificent; that's what we should acknowledge. It reveals how small our physical presence is, yet how vast our minds can be, as our intellect is capable of grasping this incredible expanse, where our bodies are just tiny specks, and appreciating its quiet beauty. In this way, we become aware of our own strength, and this realization is priceless because it empowers us.

But what I should wish before all to show is, to what point astronomy has facilitated the work of the other sciences, more directly useful, since it has given us a soul capable of comprehending nature.

But what I really want to highlight is how much astronomy has helped the other sciences that are more directly useful, because it has given us a mind capable of understanding nature.

Think how diminished humanity would be if, under heavens constantly overclouded, as Jupiter's must be, it had forever remained ignorant of the stars. Do you think that in such a world we should be what we are? I know well that under this somber vault we should have been deprived of the light of the[Pg 290] sun, necessary to organisms like those which inhabit the earth. But if you please, we shall assume that these clouds are phosphorescent and emit a soft and constant light. Since we are making hypotheses, another will cost no more. Well! I repeat my question: Do you think that in such a world we should be what we are?

Think how diminished humanity would be if, under skies that are constantly overcast, like Jupiter's must be, we had remained forever unaware of the stars. Do you think we would be who we are in such a world? I know that under this gloomy sky we would be lacking the sunlight, essential for living beings like those that exist on Earth. But if you don’t mind, let’s assume these clouds are glowing and give off a soft, constant light. Since we’re making assumptions, adding another one doesn't cost us anything. So, I ask again: Do you think we would be who we are in such a world?

The stars send us not only that visible and gross light which strikes our bodily eyes, but from them also comes to us a light far more subtle, which illuminates our minds and whose effects I shall try to show you. You know what man was on the earth some thousands of years ago, and what he is to-day. Isolated amidst a nature where everything was a mystery to him, terrified at each unexpected manifestation of incomprehensible forces, he was incapable of seeing in the conduct of the universe anything but caprice; he attributed all phenomena to the action of a multitude of little genii, fantastic and exacting, and to act on the world he sought to conciliate them by means analogous to those employed to gain the good graces of a minister or a deputy. Even his failures did not enlighten him, any more than to-day a beggar refused is discouraged to the point of ceasing to beg.

The stars provide us not just the visible light that hits our eyes, but they also send a more subtle light that enlightens our minds, and I will attempt to demonstrate its effects. You know what humanity was like on Earth thousands of years ago and what it is today. Isolated in a world where everything was a mystery, scared by each unexpected display of incomprehensible forces, people could only see the universe's behavior as random; they attributed all occurrences to the actions of many whimsical and demanding little spirits. To influence the world, they tried to win these spirits over using methods similar to those used to curry favor with a government minister or representative. Even their failures didn’t teach them anything, just as today, a rejected beggar doesn’t get discouraged enough to stop begging.

To-day we no longer beg of nature; we command her, because we have discovered certain of her secrets and shall discover others each day. We command her in the name of laws she can not challenge, because they are hers; these laws we do not madly ask her to change, we are the first to submit to them. Nature can only be governed by obeying her.

Today, we no longer beg nature; we command her because we have uncovered some of her secrets and will discover more each day. We command her based on laws she cannot contest because they are her own; we don’t foolishly ask her to change these laws; we are the first to follow them. Nature can only be controlled by obeying her.

What a change must our souls have undergone to pass from the one state to the other! Does any one believe that, without the lessons of the stars, under the heavens perpetually overclouded that I have just supposed, they would have changed so quickly? Would the metamorphosis have been possible, or at least would it not have been much slower?

What a change our souls must have gone through to move from one state to another! Does anyone really believe that, without the teachings of the stars, in the constantly overcast sky that I've just imagined, they would have changed so quickly? Would that transformation have been possible, or at least wouldn't it have been much slower?

And first of all, astronomy it is which taught that there are laws. The Chaldeans, who were the first to observe the heavens with some attention, saw that this multitude of luminous points is not a confused crowd wandering at random, but rather a disciplined army. Doubtless the rules of this discipline escaped them, but the harmonious spectacle of the starry night sufficed to give[Pg 291] them the impression of regularity, and that was in itself already a great thing. Besides, these rules were discerned by Hipparchus, Ptolemy, Copernicus, Kepler, one after another, and finally, it is needless to recall that Newton it was who enunciated the oldest, the most precise, the most simple, the most general of all natural laws.

And first of all, astronomy taught us that there are laws. The Chaldeans, who were the first to closely observe the heavens, recognized that this multitude of shining points isn’t just a random crowd but rather a disciplined army. They may not have understood the rules of this discipline, but the beautiful sight of the starry night was enough to give[Pg 291] them a sense of order, which was already a significant achievement. Moreover, these rules were uncovered by Hipparchus, Ptolemy, Copernicus, Kepler, one after another, and, of course, it's important to mention that it was Newton who articulated the oldest, most precise, simplest, and most universal of all natural laws.

And then, taught by this example, we have seen our little terrestrial world better and, under the apparent disorder, there also we have found again the harmony that the study of the heavens had revealed to us. It also is regular, it also obeys immutable laws, but they are more complicated, in apparent conflict one with another, and an eye untrained by other sights would have seen there only chaos and the reign of chance or caprice. If we had not known the stars, some bold spirits might perhaps have sought to foresee physical phenomena; but their failures would have been frequent, and they would have excited only the derision of the vulgar; do we not see, that even in our day the meteorologists sometimes deceive themselves, and that certain persons are inclined to laugh at them.

And then, learning from this example, we have understood our little world better and, beneath the apparent chaos, we have found the harmony that studying the heavens revealed to us. It is also orderly; it also follows unchanging laws, but those laws are more complex and seemingly at odds with one another, and an untrained eye would see nothing but chaos and randomness. If we hadn't known about the stars, some adventurous thinkers might have tried to predict physical events; but their attempts would have often failed, leading to mockery from the general public. Even today, we see that meteorologists sometimes get it wrong, and some people tend to laugh at them.

How often would the physicists, disheartened by so many checks, have fallen into discouragement, if they had not had, to sustain their confidence, the brilliant example of the success of the astronomers! This success showed them that nature obeys laws; it only remained to know what laws; for that they only needed patience, and they had the right to demand that the sceptics should give them credit.

How often would the physicists, frustrated by so many obstacles, have fallen into despair if they hadn’t had the inspiring success of the astronomers to boost their confidence? This success proved that nature follows rules; they just needed to figure out what those rules were. All they needed was patience, and they had every right to expect that the skeptics would acknowledge their efforts.

This is not all: astronomy has not only taught us that there are laws, but that from these laws there is no escape, that with them there is no possible compromise. How much time should we have needed to comprehend that fact, if we had known only the terrestrial world, where each elemental force would always seem to us in conflict with other forces? Astronomy has taught us that the laws are infinitely precise, and that if those we enunciate are approximative, it is because we do not know them well. Aristotle, the most scientific mind of antiquity, still accorded a part to accident, to chance, and seemed to think that the laws of nature, at least here below, determine only the large features of phenomena. How much has the ever-increasing precision of[Pg 292] astronomical predictions contributed to correct such an error, which would have rendered nature unintelligible!

This isn't all: astronomy has not only shown us that there are laws, but that there's no way to escape them and no compromise with them. How long would it have taken us to understand that if we had only known the earthly world, where it always seems like one elemental force is in conflict with another? Astronomy has taught us that the laws are incredibly precise, and when our statements are approximate, it's because we don’t understand them fully. Aristotle, the most scientific thinker of ancient times, still allowed room for accident and chance and seemed to believe that the laws of nature, at least on Earth, only determine the broad outlines of phenomena. How much has the growing accuracy of [Pg 292] astronomical predictions helped to correct such an error, which would have made nature incomprehensible!

But are these laws not local, varying in different places, like those which men make; does not that which is truth in one corner of the universe, on our globe, for instance, or in our little solar system, become error a little farther away? And then could it not be asked whether laws depending on space do not also depend upon time, whether they are not simple habitudes, transitory, therefore, and ephemeral? Again it is astronomy that answers this question. Consider the double stars; all describe conics; thus, as far as the telescope carries, it does not reach the limits of the domain which obeys Newton's law.

But aren't these laws local, different in various places, just like those made by humans? Doesn't what is considered true in one part of the universe, such as our planet or our small solar system, become false just a bit farther away? And couldn't we ask whether laws that depend on space also depend on time, whether they are just habits that are temporary and fleeting? Again, it's astronomy that provides the answer to this question. Look at the double stars; they all follow conic shapes; thus, as far as the telescope can see, it hasn't reached the boundaries of the realm that follows Newton's law.

Even the simplicity of this law is a lesson for us; how many complicated phenomena are contained in the two lines of its enunciation; persons who do not understand celestial mechanics may form some idea of it at least from the size of the treatises devoted to this science; and then it may be hoped that the complication of physical phenomena likewise hides from us some simple cause still unknown.

Even the simplicity of this law teaches us something; countless complicated phenomena are encapsulated in just two lines of its explanation. People who don’t understand celestial mechanics can at least get an idea of it from the length of the books written on this subject. And it’s possible that the complexity of physical phenomena also conceals some simple, yet unknown, cause from us.

It is therefore astronomy which has shown us what are the general characteristics of natural laws; but among these characteristics there is one, the most subtle and the most important of all, which I shall ask leave to stress.

It is astronomy that has revealed to us the key features of natural laws; however, among these features, there is one— the most nuanced and the most significant of all— that I would like to emphasize.

How was the order of the universe understood by the ancients; for instance, by Pythagoras, Plato or Aristotle? It was either an immutable type fixed once for all, or an ideal to which the world sought to approach. Kepler himself still thought thus when, for instance, he sought whether the distances of the planets from the sun had not some relation to the five regular polyhedrons. This idea contained nothing absurd, but it was sterile, since nature is not so made. Newton has shown us that a law is only a necessary relation between the present state of the world and its immediately subsequent state. All the other laws since discovered are nothing else; they are in sum, differential equations; but it is astronomy which furnished the first model for them, without which we should doubtless long have erred.

How did the ancients, like Pythagoras, Plato, or Aristotle, understand the order of the universe? They either viewed it as a fixed, unchanging type or as an ideal that the world aimed to reach. Kepler still held this belief when he explored whether the distances of the planets from the sun related to the five regular polyhedrons. This idea wasn’t foolish, but it was unproductive, as nature doesn’t work that way. Newton showed us that a law is simply a necessary relationship between the current state of the world and its immediate future state. All the other laws discovered since then are essentially the same; they are, in essence, differential equations. However, astronomy provided the first model for them, without which we surely would have made many mistakes.

Astronomy has also taught us to set at naught appearances.[Pg 293] The day Copernicus proved that what was thought the most stable was in motion, that what was thought moving was fixed, he showed us how deceptive could be the infantile reasonings which spring directly from the immediate data of our senses. True, his ideas did not easily triumph, but since this triumph there is no longer a prejudice so inveterate that we can not shake it off. How can we estimate the value of the new weapon thus won?

Astronomy has also taught us to ignore appearances.[Pg 293] When Copernicus demonstrated that what we believed to be the most stable was actually in motion, and that what we thought was moving was actually fixed, he revealed how misleading the simplistic reasoning based solely on our immediate sensory information can be. It's true that his ideas didn’t gain acceptance easily, but since that victory, there are no longer any biases so deeply rooted that we can’t overcome them. How can we measure the value of this new tool we've gained?

The ancients thought everything was made for man, and this illusion must be very tenacious, since it must ever be combated. Yet it is necessary to divest oneself of it; or else one will be only an eternal myope, incapable of seeing the truth. To comprehend nature one must be able to get out of self, so to speak, and to contemplate her from many different points of view; otherwise we never shall know more than one side. Now, to get out of self is what he who refers everything to himself can not do. Who delivered us from this illusion? It was those who showed us that the earth is only one of the smallest planets of the solar system, and that the solar system itself is only an imperceptible point in the infinite spaces of the stellar universe.

The ancients believed everything was made for humans, and this idea is surprisingly persistent, as it needs to be constantly challenged. However, it’s essential to let go of this belief; otherwise, we remain forever shortsighted and unable to see the truth. To understand nature, we must step outside of ourselves, so to speak, and view it from various perspectives; otherwise, we’ll only ever see one side. Those who can’t detach from their self-centered perspective will never manage this. Who freed us from this misconception? It was those who revealed to us that Earth is just one of the smallest planets in the solar system, and that the solar system itself is merely a tiny dot in the vast expanses of the universe.

At the same time astronomy taught us not to be afraid of big numbers. This was needful, not only for knowing the heavens, but to know the earth itself; and was not so easy as it seems to us to-day. Let us try to go back and picture to ourselves what a Greek would have thought if told that red light vibrates four hundred millions of millions of times per second. Without any doubt, such an assertion would have appeared to him pure madness, and he never would have lowered himself to test it. To-day a hypothesis will no longer appear absurd to us because it obliges us to imagine objects much larger or smaller than those our senses are capable of showing us, and we no longer comprehend those scruples which arrested our predecessors and prevented them from discovering certain truths simply because they were afraid of them. But why? It is because we have seen the heavens enlarging and enlarging without cease; because we know that the sun is 150 millions of kilometers from the earth and that the distances of the nearest stars are hundreds of thousands of times greater yet. Habituated to the contemplation of the infinitely great, we have become apt to comprehend[Pg 294] the infinitely small. Thanks to the education it has received, our imagination, like the eagle's eye that the sun does not dazzle, can look truth in the face.

At the same time, astronomy taught us not to be afraid of big numbers. This was important not just for understanding the heavens, but also for understanding the earth itself; and it wasn't as easy as it seems to us today. Let’s try to imagine what a Greek would have thought if told that red light vibrates four hundred million million times per second. Without a doubt, such a claim would have seemed pure madness to him, and he never would have considered testing it. Today, a hypothesis doesn’t seem absurd to us just because it requires us to imagine things much larger or smaller than what our senses can show us, and we no longer understand the hesitations that held our predecessors back from discovering certain truths simply because they were afraid of them. But why? It's because we have seen the universe expanding endlessly; because we know that the sun is 150 million kilometers away from the earth and that the distances to the nearest stars are hundreds of thousands of times greater. Used to contemplating the infinitely large, we have become capable of understanding the infinitely small. Thanks to the education we’ve received, our imagination, like an eagle's eye that the sun does not blind, can face the truth directly.

Was I wrong in saying that it is astronomy which has made us a soul capable of comprehending nature; that under heavens always overcast and starless, the earth itself would have been for us eternally unintelligible; that we should there have seen only caprice and disorder; and that, not knowing the world, we should never have been able to subdue it? What science could have been more useful? And in thus speaking I put myself at the point of view of those who only value practical applications. Certainly, this point of view is not mine; as for me, on the contrary, if I admire the conquests of industry, it is above all because if they free us from material cares, they will one day give to all the leisure to contemplate nature. I do not say: Science is useful, because it teaches us to construct machines. I say: Machines are useful, because in working for us, they will some day leave us more time to make science. But finally it is worth remarking that between the two points of view there is no antagonism, and that man having pursued a disinterested aim, all else has been added unto him.

Was I wrong to say that astronomy has made us capable of understanding nature? That in a sky always cloudy and starless, the earth would have been completely unintelligible to us? That we would have seen only randomness and chaos there, and if we didn’t know the world, we would never have been able to control it? What science could be more beneficial? And in saying this, I’m speaking from the perspective of those who only appreciate practical applications. Certainly, that’s not my perspective; for me, while I admire industrial achievements, it’s mainly because they free us from material concerns, allowing everyone the time to appreciate nature. I don’t say: Science is useful because it teaches us to build machines. I say: Machines are useful because by working for us, they will eventually give us more time to pursue science. But it’s important to note that there’s no conflict between the two viewpoints, and that when humans pursue a selfless goal, everything else will fall into place for them.

Auguste Comte has said somewhere, that it would be idle to seek to know the composition of the sun, since this knowledge would be of no use to sociology. How could he be so short-sighted? Have we not just seen that it is by astronomy that, to speak his language, humanity has passed from the theological to the positive state? He found an explanation for that because it had happened. But how has he not understood that what remained to do was not less considerable and would be not less profitable? Physical astronomy, which he seems to condemn, has already begun to bear fruit, and it will give us much more, for it only dates from yesterday.

Auguste Comte once said that trying to understand the composition of the sun would be pointless since that knowledge wouldn't help sociology. How could he be so narrow-minded? Haven't we just seen that astronomy has helped humanity transition from the theological to the positive state? He found an explanation for that because it had already happened. But how did he not realize that the work ahead is just as significant and will also be valuable? Physical astronomy, which he seems to dismiss, has already started to yield results, and it has much more to offer since it's only just begun.

First was discovered the nature of the sun, what the founder of positivism wished to deny us, and there bodies were found which exist on the earth, but had here remained undiscovered; for example, helium, that gas almost as light as hydrogen. That already contradicted Comte. But to the spectroscope we owe a lesson precious in a quite different way; in the most distant stars,[Pg 295] it shows us the same substances. It might have been asked whether the terrestrial elements were not due to some chance which had brought together more tenuous atoms to construct of them the more complex edifice that the chemists call atom; whether, in other regions of the universe, other fortuitous meetings had not engendered edifices entirely different. Now we know that this is not so, that the laws of our chemistry are the general laws of nature, and that they owe nothing to the chance which caused us to be born on the earth.

First, the nature of the sun was discovered, something the founder of positivism wanted to deny us, and there were bodies found that exist on Earth but had remained undiscovered here; for example, helium, a gas almost as light as hydrogen. This already contradicted Comte. But the spectroscope taught us a lesson that was valuable in a different way; in the most distant stars,[Pg 295] it reveals the same substances. One might have questioned whether terrestrial elements were just the result of chance that brought together lighter atoms to form the more complex structure that chemists refer to as an atom; whether, in other parts of the universe, other random gatherings had produced entirely different structures. Now we know that this isn’t the case, that the laws of our chemistry are the universal laws of nature, and that they are not the product of the chance that led us to be born on Earth.

But, it will be said, astronomy has given to the other sciences all it can give them, and now that the heavens have procured for us the instruments which enable us to study terrestrial nature, they could without danger veil themselves forever. After what we have just said, is there still need to answer this objection? One could have reasoned the same in Ptolemy's time; then also men thought they knew everything, and they still had almost everything to learn.

But some might argue that astronomy has provided all it can to other sciences, and now that the heavens have given us the tools to study the natural world, they could safely hide away forever. After what we've just discussed, is there still a need to respond to this point? One could have made the same argument back in Ptolemy's time; people then also believed they knew everything, and they still had nearly everything left to learn.

The stars are majestic laboratories, gigantic crucibles, such as no chemist could dream. There reign temperatures impossible for us to realize. Their only defect is being a little far away; but the telescope will soon bring them near to us, and then we shall see how matter acts there. What good fortune for the physicist and the chemist!

The stars are incredible laboratories, massive crucibles that no chemist could ever imagine. They have temperatures beyond our comprehension. Their only downside is that they're a bit too far away, but telescopes will soon bring them closer, and then we'll be able to see how matter behaves there. What a great opportunity for physicists and chemists!

Matter will there exhibit itself to us under a thousand different states, from those rarefied gases which seem to form the nebulæ and which are luminous with I know not what glimmering of mysterious origin, even to the incandescent stars and to the planets so near and yet so different.

Matter will show itself to us in a thousand different forms, from those thin gases that appear to make up the nebulas and glow with some unknown shimmer of mysterious origin, all the way to the bright stars and the planets that are so close yet so different.

Perchance even, the stars will some day teach us something about life; that seems an insensate dream and I do not at all see how it can be realized; but, a hundred years ago, would not the chemistry of the stars have also appeared a mad dream?

Maybe someday the stars will teach us something about life; that seems like a foolish dream and I really don’t see how it could come true; but, a hundred years ago, wouldn’t the chemistry of the stars have seemed just as crazy?

But limiting our views to horizons less distant, there still will remain to us promises less contingent and yet sufficiently seductive. If the past has given us much, we may rest assured that the future will give us still more.

But if we limit our perspective to shorter horizons, there will still be promises that are less uncertain yet still appealing. If the past has given us a lot, we can be confident that the future will give us even more.

In sum, it is incredible how useful belief in astrology has been to humanity. If Kepler and Tycho Brahe made a living,[Pg 296] it was because they sold to naïve kings predictions founded on the conjunctions of the stars. If these princes had not been so credulous, we should perhaps still believe that nature obeys caprice, and we should still wallow in ignorance.

In summary, it's amazing how helpful belief in astrology has been to humanity. If Kepler and Tycho Brahe made a living,[Pg 296] it was because they sold predictions based on the alignments of the stars to gullible kings. If these rulers hadn't been so trusting, we might still think that nature follows randomness, and we would still be stuck in ignorance.


CHAPTER VII

The History of Math Physics

The Past and the Future of Physics.—What is the present state of mathematical physics? What are the problems it is led to set itself? What is its future? Is its orientation about to be modified?

The Past and the Future of Physics.—What is the current state of mathematical physics? What problems is it looking to address? What does the future hold? Is its direction about to change?

Ten years hence will the aim and the methods of this science appear to our immediate successors in the same light as to ourselves; or, on the contrary, are we about to witness a profound transformation? Such are the questions we are forced to raise in entering to-day upon our investigation.

Ten years from now, will the goals and methods of this science seem the same to our immediate successors as they do to us? Or, on the other hand, are we about to see a major transformation? These are the questions we must consider as we begin our investigation today.

If it is easy to propound them: to answer is difficult. If we felt tempted to risk a prediction, we should easily resist this temptation, by thinking of all the stupidities the most eminent savants of a hundred years ago would have uttered, if some one had asked them what the science of the nineteenth century would be. They would have thought themselves bold in their predictions, and after the event, how very timid we should have found them. Do not, therefore, expect of me any prophecy.

If it's easy to put them out there, answering is tough. If we felt tempted to make a prediction, we’d quickly resist that urge by remembering all the mistakes the top scientists from a hundred years ago would have made if someone had asked them what science would look like in the nineteenth century. They would have thought they were being bold with their predictions, and looking back, we’d see how cautious they really were. So, don’t expect any prophecies from me.

But if, like all prudent physicians, I shun giving a prognosis, yet I can not dispense with a little diagnostic; well, yes, there are indications of a serious crisis, as if we might expect an approaching transformation. Still, be not too anxious: we are sure the patient will not die of it, and we may even hope that this crisis will be salutary, for the history of the past seems to guarantee us this. This crisis, in fact, is not the first, and to understand it, it is important to recall those which have preceded. Pardon then a brief historical sketch.

But if, like all careful doctors, I avoid giving a prognosis, I can’t skip offering a little diagnosis; yes, there are signs of a serious crisis, as if we might expect a significant change soon. Still, don’t be too worried: we’re certain the patient won’t die from it, and we can even hope that this crisis will be beneficial, since history suggests this is likely. This crisis, in fact, isn’t the first, and to understand it, it’s important to remember the ones that came before. So please allow me a brief historical overview.

The Physics of Central Forces.—Mathematical physics, as we know, was born of celestial mechanics, which gave birth to it at the end of the eighteenth century, at the moment when it itself attained its complete development. During its first years especially, the infant strikingly resembled its mother.[Pg 298]

The Physics of Central Forces.—Mathematical physics, as we understand it today, originated from celestial mechanics, which emerged at the end of the eighteenth century, just as it itself reached full development. In its early years, it resembled its predecessor quite closely.[Pg 298]

The astronomic universe is formed of masses, very great, no doubt, but separated by intervals so immense that they appear to us only as material points. These points attract each other inversely as the square of the distance, and this attraction is the sole force which influences their movements. But if our senses were sufficiently keen to show us all the details of the bodies which the physicist studies, the spectacle thus disclosed would scarcely differ from the one the astronomer contemplates. There also we should see material points, separated from one another by intervals, enormous in comparison with their dimensions, and describing orbits according to regular laws. These infinitesimal stars are the atoms. Like the stars proper, they attract or repel each other, and this attraction or this repulsion, following the straight line which joins them, depends only on the distance. The law according to which this force varies as function of the distance is perhaps not the law of Newton, but it is an analogous law; in place of the exponent −2, we have probably a different exponent, and it is from this change of exponent that arises all the diversity of physical phenomena, the variety of qualities and of sensations, all the world, colored and sonorous, which surrounds us; in a word, all nature.

The astronomical universe consists of very large masses, but they are separated by such vast distances that they only seem to us like tiny points. These points attract each other inversely as the square of the distance, and this attraction is the only force that affects their movements. However, if our senses were sharp enough to reveal all the details of the bodies that physicists study, the view we would see wouldn't be much different from what astronomers observe. We would see material points, spaced apart by distances that are huge compared to their sizes, following orbits based on regular laws. These tiny stars are the atoms. Like the actual stars, they attract or repel each other, and this attraction or repulsion, along the straight line connecting them, depends solely on the distance. The law governing how this force changes with distance may not be Newton's law, but it is a similar concept; instead of the exponent -2, we likely have a different exponent, and this shift in exponent leads to the wide range of physical phenomena, the variety of qualities and sensations, and all the colorful and sound-filled world around us; in short, all of nature.

Such is the primitive conception in all its purity. It only remains to seek in the different cases what value should be given to this exponent in order to explain all the facts. It is on this model that Laplace, for example, constructed his beautiful theory of capillarity; he regards it only as a particular case of attraction, or, as he says, of universal gravitation, and no one is astonished to find it in the middle of one of the five volumes of the 'Mécanique céleste.' More recently Briot believes he penetrated the final secret of optics in demonstrating that the atoms of ether attract each other in the inverse ratio of the sixth power of the distance; and Maxwell himself, does he not say somewhere that the atoms of gases repel each other in the inverse ratio of the fifth power of the distance? We have the exponent −6, or −5, in place of the exponent −2, but it is always an exponent.

This is the basic idea in all its simplicity. We just need to look at different cases to determine what value should be assigned to this exponent in order to explain all the facts. It's on this basis that Laplace, for instance, developed his elegant theory of capillarity; he views it merely as a specific case of attraction, or, as he puts it, of universal gravitation, and no one is surprised to find it in the middle of one of the five volumes of the 'Mécanique céleste.' More recently, Briot believes he has uncovered the ultimate secret of optics by showing that the atoms of ether attract each other according to the inverse of the sixth power of the distance; and Maxwell himself, doesn’t he mention somewhere that the atoms of gases repel each other according to the inverse of the fifth power of the distance? We have the exponent −6, or −5, instead of the exponent −2, but it’s still an exponent.

Among the theories of this epoch, one alone is an exception, that of Fourier; in it are indeed atoms acting at a distance one upon the other; they mutually transmit heat, but they do not[Pg 299] attract, they never budge. From this point of view, Fourier's theory must have appeared to the eyes of his contemporaries, to those of Fourier himself, as imperfect and provisional.

Among the theories of this time, one stands out as an exception: Fourier's. In it, there are indeed atoms that act on each other from a distance; they transfer heat to one another, but they don’t attract or move at all. From this perspective, Fourier's theory must have seemed to his contemporaries, and even to Fourier himself, as flawed and temporary.

This conception was not without grandeur; it was seductive, and many among us have not finally renounced it; they know that one will attain the ultimate elements of things only by patiently disentangling the complicated skein that our senses give us; that it is necessary to advance step by step, neglecting no intermediary; that our fathers were wrong in wishing to skip stations; but they believe that when one shall have arrived at these ultimate elements, there again will be found the majestic simplicity of celestial mechanics.

This idea was not lacking in grandeur; it was appealing, and many of us haven't completely let it go; we understand that we can reach the fundamental elements of things only by patiently untangling the complex web our senses provide us; that we need to move forward step by step, not overlooking any stages in the process; that our ancestors were mistaken in their desire to bypass these stages; but they believe that once we reach these fundamental elements, we will again encounter the majestic simplicity of celestial mechanics.

Neither has this conception been useless; it has rendered us an inestimable service, since it has contributed to make precise the fundamental notion of the physical law.

Neither has this idea been useless; it has provided us with invaluable service, as it has helped clarify the fundamental concept of physical law.

I will explain myself; how did the ancients understand law? It was for them an internal harmony, static, so to say, and immutable; or else it was like a model that nature tried to imitate. For us a law is something quite different; it is a constant relation between the phenomenon of to-day and that of to-morrow; in a word, it is a differential equation.

I will clarify my point; how did ancient people perceive law? For them, it was an internal harmony, stable and unchanging, so to speak; or it resembled a model that nature attempted to replicate. For us, law is something entirely different; it represents a continuous relationship between today's events and those of tomorrow; in short, it is a differential equation.

Behold the ideal form of physical law; well, it is Newton's law which first clothed it forth. If then one has acclimated this form in physics, it is precisely by copying as far as possible this law of Newton, that is by imitating celestial mechanics. This is, moreover, the idea I have tried to bring out in Chapter VI.

Behold the perfect example of physical law; it is Newton's law that first presented it. If someone has adapted this form in physics, it's specifically by closely modeling this law of Newton, meaning by mimicking celestial mechanics. This is, by the way, the concept I have attempted to emphasize in Chapter VI.

The Physics of the Principles.—Nevertheless, a day arrived when the conception of central forces no longer appeared sufficient, and this is the first of those crises of which I just now spoke.

The Physics of the Principles.—However, a day came when the idea of central forces didn’t seem adequate anymore, and this marks the first of those crises I just mentioned.

What was done then? The attempt to penetrate into the detail of the structure of the universe, to isolate the pieces of this vast mechanism, to analyze one by one the forces which put them in motion, was abandoned, and we were content to take as guides certain general principles the express object of which is to spare us this minute study. How so? Suppose we have before us any machine; the initial wheel work and the final wheel work alone[Pg 300] are visible, but the transmission, the intermediary machinery by which the movement is communicated from one to the other, is hidden in the interior and escapes our view; we do not know whether the communication is made by gearing or by belts, by connecting-rods or by other contrivances. Do we say that it is impossible for us to understand anything about this machine so long as we are not permitted to take it to pieces? You know well we do not, and that the principle of the conservation of energy suffices to determine for us the most interesting point. We easily ascertain that the final wheel turns ten times less quickly than the initial wheel, since these two wheels are visible; we are able thence to conclude that a couple applied to the one will be balanced by a couple ten times greater applied to the other. For that there is no need to penetrate the mechanism of this equilibrium and to know how the forces compensate each other in the interior of the machine; it suffices to be assured that this compensation can not fail to occur.

What was done then? The effort to delve into the details of the universe’s structure, to isolate the parts of this vast mechanism, and to analyze the forces that move them one by one, was set aside. Instead, we accepted certain general principles as guides to avoid this detailed study. How so? Imagine we have a machine in front of us; only the initial and final gears are visible[Pg 300], while the transmission and the intermediary mechanisms that communicate the movement from one to the other are hidden inside and out of sight. We don’t know whether the connection is made by gears, belts, connecting rods, or other devices. Do we claim that it’s impossible to understand anything about this machine as long as we can’t take it apart? You know well we don’t, and that the principle of conservation of energy is enough for us to identify the most interesting point. We can easily determine that the final gear turns ten times more slowly than the initial gear, since these two are visible; we can then conclude that a force applied to one will be balanced by a force ten times greater applied to the other. To do this, we don’t need to explore the inner workings of this balance or understand how the forces compensate each other within the machine; it’s enough to know that this compensation must occur.

Well, in regard to the universe, the principle of the conservation of energy is able to render us the same service. The universe is also a machine, much more complicated than all those of industry, of which almost all the parts are profoundly hidden from us; but in observing the motion of those that we can see, we are able, by the aid of this principle, to draw conclusions which remain true whatever may be the details of the invisible mechanism which animates them.

Well, when it comes to the universe, the principle of energy conservation can serve us in the same way. The universe is also a machine, much more complex than any industrial machine, with most of its parts being deeply hidden from us; however, by observing the motion of the parts we can see, we can use this principle to make conclusions that hold true no matter what the details of the invisible mechanisms are that drive them.

The principle of the conservation of energy, or Mayer's principle, is certainly the most important, but it is not the only one; there are others from which we can derive the same advantage. These are:

The principle of the conservation of energy, or Mayer's principle, is definitely the most significant, but it's not the only one; there are others from which we can gain the same benefit. These are:

Carnot's principle, or the principle of the degradation of energy.

Carnot's principle, or the principle of energy degradation.

Newton's principle, or the principle of the equality of action and reaction.

Newton's principle, also known as the principle of action and reaction being equal.

The principle of relativity, according to which the laws of physical phenomena must be the same for a stationary observer as for an observer carried along in a uniform motion of translation; so that we have not and can not have any means of discerning whether or not we are carried along in such a motion.[Pg 301]

The principle of relativity states that the laws of physical phenomena must be the same for a stationary observer as for someone moving at a constant speed; therefore, we have no way of knowing whether we are in motion or not.[Pg 301]

The principle of the conservation of mass, or Lavoisier's principle.

The principle of the conservation of mass, or Lavoisier's principle.

I will add the principle of least action.

I will include the principle of least action.

The application of these five or six general principles to the different physical phenomena is sufficient for our learning of them all that we could reasonably hope to know of them. The most remarkable example of this new mathematical physics is, beyond question, Maxwell's electromagnetic theory of light.

The use of these five or six general principles in relation to various physical phenomena is enough for us to learn all we can reasonably expect to know about them. The most notable example of this new mathematical physics is definitely Maxwell's electromagnetic theory of light.

We know nothing as to what the ether is, how its molecules are disposed, whether they attract or repel each other; but we know that this medium transmits at the same time the optical perturbations and the electrical perturbations; we know that this transmission must take place in conformity with the general principles of mechanics, and that suffices us for the establishment of the equations of the electromagnetic field.

We don't know what ether is, how its molecules are arranged, or whether they attract or repel each other; but we do know that this medium transmits both optical and electrical disturbances at the same time. We understand that this transmission must follow the general principles of mechanics, and that's enough for us to establish the equations of the electromagnetic field.

These principles are results of experiments boldly generalized; but they seem to derive from their very generality a high degree of certainty. In fact, the more general they are, the more frequent are the opportunities to check them, and the verifications multiplying, taking the most varied, the most unexpected forms, end by no longer leaving place for doubt.

These principles come from experiments that have been confidently generalized; however, their broad nature gives them a high level of certainty. In fact, the more general they are, the more often we have the chance to test them, and as these tests increase, taking many different and unexpected forms, there’s no room left for doubt.

Utility of the Old Physics.—Such is the second phase of the history of mathematical physics and we have not yet emerged from it. Shall we say that the first has been useless? that during fifty years science went the wrong way, and that there is nothing left but to forget so many accumulated efforts that a vicious conception condemned in advance to failure? Not the least in the world. Do you think the second phase could have come into existence without the first? The hypothesis of central forces contained all the principles; it involved them as necessary consequences; it involved both the conservation of energy and that of masses, and the equality of action and reaction, and the law of least action, which appeared, it is true, not as experimental truths, but as theorems; the enunciation of which had at the same time something more precise and less general than under their present form.

Utility of the Old Physics.—This is the second phase of the history of mathematical physics, and we are still in it. Can we say that the first phase was useless? That for fifty years, science went in the wrong direction, leaving us with nothing but to ignore the many efforts that were doomed from the start? Not at all. Do you really believe the second phase could have existed without the first? The idea of central forces included all the principles; it brought them in as necessary outcomes. It encompassed both the conservation of energy and mass, as well as the equality of action and reaction, and the principle of least action, which, it's true, appeared not as experimental facts, but as theorems; their statements at that time had something that was more specific and less general than they do now.

It is the mathematical physics of our fathers which has familiarized us little by little with these various principles; which has[Pg 302] habituated us to recognize them under the different vestments in which they disguise themselves. They have been compared with the data of experience, it has been seen how it was necessary to modify their enunciation to adapt them to these data; thereby they have been extended and consolidated. Thus they came to be regarded as experimental truths; the conception of central forces became then a useless support, or rather an embarrassment, since it made the principles partake of its hypothetical character.

It’s the mathematical physics of our predecessors that has gradually made us familiar with these various principles; it has[Pg 302] trained us to recognize them in the different forms they take. They’ve been compared with experiential data, and it was necessary to adjust their wording to fit these observations; as a result, they have been expanded and solidified. This is how they came to be seen as experimental truths; the idea of central forces then became an unnecessary crutch, or rather a complication, because it made the principles seem hypothetical.

The frames then have not broken, because they are elastic; but they have enlarged; our fathers, who established them, did not labor in vain, and we recognize in the science of to-day the general traits of the sketch which they traced.

The frames haven't broken, because they're flexible; however, they have grown; our ancestors, who created them, didn’t work in vain, and we see in today's science the broad features of the outline they drew.


CHAPTER VIII

The Current Crisis in Mathematical Physics

The New Crisis.—Are we now about to enter upon a third period? Are we on the eve of a second crisis? These principles on which we have built all, are they about to crumble away in their turn? This has been for some time a pertinent question.

The New Crisis.—Are we about to enter a third period? Are we on the brink of a second crisis? Are the principles we've built everything on about to fall apart as well? This has been a relevant question for quite some time.

When I speak thus, you no doubt think of radium, that grand revolutionist of the present time, and in fact I shall come back to it presently; but there is something else. It is not alone the conservation of energy which is in question; all the other principles are equally in danger, as we shall see in passing them successively in review.

When I say this, you probably think of radium, the amazing game-changer of our time, and I will get back to that shortly; but there’s more. It’s not just about energy conservation that’s at stake; all the other principles are at risk as well, as we’ll see when we go through them one by one.

Carnot's Principle.—Let us commence with the principle of Carnot. This is the only one which does not present itself as an immediate consequence of the hypothesis of central forces; more than that, it seems, if not to directly contradict that hypothesis, at least not to be reconciled with it without a certain effort. If physical phenomena were due exclusively to the movements of atoms whose mutual attraction depended only on the distance, it seems that all these phenomena should be reversible; if all the initial velocities were reversed, these atoms, always subjected to the same forces, ought to go over their trajectories in the contrary sense, just as the earth would describe in the retrograde sense this same elliptic orbit which it describes in the direct sense, if the initial conditions of its motion had been reversed. On this account, if a physical phenomenon is possible, the inverse phenomenon should be equally so, and one should be able to reascend the course of time. Now, it is not so in nature, and this is precisely what the principle of Carnot teaches us; heat can pass from the warm body to the cold body; it is impossible afterward to make it take the inverse route and to reestablish differences of temperature which have been effaced. Motion can be wholly dissipated and transformed into heat by friction; the contrary transformation can never be made except partially.[Pg 304]

Carnot's Principle.—Let’s start with Carnot's principle. This is the only one that doesn’t seem to follow directly from the idea of central forces; in fact, it appears to not only challenge that idea but also requires some effort to reconcile with it. If physical phenomena were solely the result of atoms moving, with their attraction based only on distance, it seems that all these phenomena should be reversible. If you reversed all the initial velocities, these atoms, still experiencing the same forces, should retrace their paths in the opposite direction, just like the Earth would retrace its elliptical orbit in reverse if the initial conditions of its motion were flipped. Therefore, if a physical phenomenon is possible, its reverse should also be possible, and one should be able to move backwards through time. However, that’s not the case in nature, and this is exactly what Carnot's principle teaches us; heat can move from a warm body to a cold body, but you can’t make it retrace that path to restore erased temperature differences. Motion can be completely dissipated and turned into heat through friction; the reverse transformation can only happen partially.[Pg 304]

We have striven to reconcile this apparent contradiction. If the world tends toward uniformity, this is not because its ultimate parts, at first unlike, tend to become less and less different; it is because, shifting at random, they end by blending. For an eye which should distinguish all the elements, the variety would remain always as great; each grain of this dust preserves its originality and does not model itself on its neighbors; but as the blend becomes more and more intimate, our gross senses perceive only the uniformity. This is why, for example, temperatures tend to a level, without the possibility of going backwards.

We have worked hard to resolve this seeming contradiction. If the world is moving towards uniformity, it's not because its individual parts, initially different, gradually become less distinct; rather, it's because they randomly shift and end up merging. If there were an eye that could identify all the elements, the variety would always remain vast; each grain of this dust maintains its uniqueness and doesn't conform to its neighbors. However, as the blend becomes tighter, our basic senses only perceive the uniformity. This is why, for instance, temperatures tend to equalize, with no chance of going back.

A drop of wine falls into a glass of water; whatever may be the law of the internal motion of the liquid, we shall soon see it colored of a uniform rosy tint, and however much from this moment one may shake it afterwards, the wine and the water do not seem capable of again separating. Here we have the type of the irreversible physical phenomenon: to hide a grain of barley in a heap of wheat, this is easy; afterwards to find it again and get it out, this is practically impossible. All this Maxwell and Boltzmann have explained; but the one who has seen it most clearly, in a book too little read because it is a little difficult to read, is Gibbs, in his `Elementary Principles of Statistical Mechanics.'

A drop of wine falls into a glass of water; no matter what the rules of internal motion in the liquid are, we will soon see it turn a uniform rosy color, and no matter how much you shake it afterwards, the wine and water don’t seem to be able to separate again. This is a perfect example of an irreversible physical phenomenon: it’s easy to hide a grain of barley in a pile of wheat, but finding it again is practically impossible. Maxwell and Boltzmann have explained all this; however, the one who understood it best, in a book that isn’t read enough because it’s a bit hard to get through, is Gibbs, in his 'Elementary Principles of Statistical Mechanics.'

For those who take this point of view, Carnot's principle is only an imperfect principle, a sort of concession to the infirmity of our senses; it is because our eyes are too gross that we do not distinguish the elements of the blend; it is because our hands are too gross that we can not force them to separate; the imaginary demon of Maxwell, who is able to sort the molecules one by one, could well constrain the world to return backward. Can it return of itself? That is not impossible; that is only infinitely improbable. The chances are that we should wait a long time for the concourse of circumstances which would permit a retrogradation; but sooner or later they will occur, after years whose number it would take millions of figures to write. These reservations, however, all remained theoretic; they were not very disquieting, and Carnot's principle retained all its practical value. But here the scene changes. The biologist, armed with his microscope, long ago noticed in his preparations irregular movements[Pg 305] of little particles in suspension; this is the Brownian movement. He first thought this was a vital phenomenon, but soon he saw that the inanimate bodies danced with no less ardor than the others; then he turned the matter over to the physicists. Unhappily, the physicists remained long uninterested in this question; one concentrates the light to illuminate the microscopic preparation, thought they; with light goes heat; thence inequalities of temperature and in the liquid interior currents which produce the movements referred to. It occurred to M. Gouy to look more closely, and he saw, or thought he saw, that this explanation is untenable, that the movements become brisker as the particles are smaller, but that they are not influenced by the mode of illumination. If then these movements never cease, or rather are reborn without cease, without borrowing anything from an external source of energy, what ought we to believe? To be sure, we should not on this account renounce our belief in the conservation of energy, but we see under our eyes now motion transformed into heat by friction, now inversely heat changed into motion, and that without loss since the movement lasts forever. This is the contrary of Carnot's principle. If this be so, to see the world return backward, we no longer have need of the infinitely keen eye of Maxwell's demon; our microscope suffices. Bodies too large, those, for example, which are a tenth of a millimeter, are hit from all sides by moving atoms, but they do not budge, because these shocks are very numerous and the law of chance makes them compensate each other; but the smaller particles receive too few shocks for this compensation to take place with certainty and are incessantly knocked about. And behold already one of our principles in peril.

For those who hold this view, Carnot's principle is just an imperfect rule, a kind of compromise due to the limitations of our senses; it’s because our eyes are too coarse that we can’t distinguish the components of the mixture; it’s because our hands are too clumsy that we can’t separate them; the imaginary demon of Maxwell, who can sort molecules one by one, could indeed force the world to reverse. Can it reverse on its own? That’s not impossible; it’s just immensely unlikely. The odds are that we would wait a long time for the right circumstances to allow for a reversal; but eventually, they will happen, after countless years that would take millions of numbers to express. However, these qualifications remained theoretical; they weren’t very alarming, and Carnot's principle kept all its practical value. But then the situation changes. The biologist, equipped with his microscope, noticed irregular movements in his preparations long ago—this is the Brownian movement. At first, he thought it was a sign of life, but soon he realized that even inanimate particles danced just as vigorously; he then handed it over to the physicists. Unfortunately, the physicists showed little interest in this issue for a long time; they thought, “One concentrates light to illuminate the microscopic sample; with light comes heat; this leads to temperature differences and internal currents in the liquid that cause the movements.” M. Gouy had the idea to look more closely, and he saw, or believed he saw, that this explanation doesn’t hold up, as the movements get more vigorous as the particles get smaller, but they aren’t affected by how they’re illuminated. If these movements never stop, or rather are constantly being renewed without drawing from an external energy source, what should we believe? Certainly, we shouldn’t abandon our belief in the conservation of energy, but we’re witnessing motion being converted into heat through friction, and inversely, heat turning into motion, all without loss since the movement continues indefinitely. This contradicts Carnot's principle. If this is true, to witness the world reversing, we no longer need the infinitely sharp eye of Maxwell's demon; our microscope is enough. Larger bodies, like those that are a tenth of a millimeter, are bombarded from all sides by moving atoms, but they don’t move because these impacts are so numerous that chance allows them to cancel each other out; however, the smaller particles receive too few impacts for this compensation to reliably occur and are constantly jostled around. And thus, one of our principles is already at risk.

The Principle of Relativity.—Let us pass to the principle of relativity; this not only is confirmed by daily experience, not only is it a necessary consequence of the hypothesis of central forces, but it is irresistibly imposed upon our good sense, and yet it also is assailed. Consider two electrified bodies; though they seem to us at rest, they are both carried along by the motion of the earth; an electric charge in motion, Rowland has taught us, is equivalent to a current; these two charged bodies are, therefore, equivalent to two parallel currents of the same[Pg 306] sense and these two currents should attract each other. In measuring this attraction, we shall measure the velocity of the earth; not its velocity in relation to the sun or the fixed stars, but its absolute velocity.

The Principle of Relativity.—Let's move on to the principle of relativity; this is not only confirmed by our everyday experiences, but it’s also a necessary result of the idea of central forces, and it’s something that we can’t ignore. However, it’s also challenged. Think about two charged objects; even though they appear to be still, they are both in motion along with the earth. Rowland has taught us that a moving electric charge is the same as a current; therefore, these two charged objects are equivalent to two parallel currents flowing in the same[Pg 306] direction, and these two currents should attract each other. When we measure this attraction, we are essentially measuring the earth’s velocity; not in relation to the sun or the fixed stars, but its absolute velocity.

I well know what will be said: It is not its absolute velocity that is measured, it is its velocity in relation to the ether. How unsatisfactory that is! Is it not evident that from the principle so understood we could no longer infer anything? It could no longer tell us anything just because it would no longer fear any contradiction. If we succeed in measuring anything, we shall always be free to say that this is not the absolute velocity, and if it is not the velocity in relation to the ether, it might always be the velocity in relation to some new unknown fluid with which we might fill space.

I fully understand what people will say: it's not its absolute speed being measured, it's its speed compared to the ether. How unsatisfactory is that! Isn't it clear that with such a principle, we couldn't draw any conclusions? It wouldn't tell us anything because it wouldn't be afraid of contradiction. If we manage to measure anything, we can always say that it's not the absolute speed, and if it’s not the speed compared to the ether, it could always be the speed in relation to some new unknown substance we might fill space with.

Indeed, experiment has taken upon itself to ruin this interpretation of the principle of relativity; all attempts to measure the velocity of the earth in relation to the ether have led to negative results. This time experimental physics has been more faithful to the principle than mathematical physics; the theorists, to put in accord their other general views, would not have spared it; but experiment has been stubborn in confirming it. The means have been varied; finally Michelson pushed precision to its last limits; nothing came of it. It is precisely to explain this obstinacy that the mathematicians are forced to-day to employ all their ingenuity.

Indeed, experiments have undermined this interpretation of the principle of relativity; all attempts to measure the Earth's velocity in relation to the ether have yielded negative results. This time, experimental physics has been more faithful to the principle than mathematical physics; the theorists, in trying to align their broader ideas, would not have been kind to it; but experiments have stubbornly confirmed it. The methods have varied; ultimately, Michelson pushed precision to its limits; nothing came of it. It is exactly to explain this stubbornness that mathematicians are now forced to use all their creativity.

Their task was not easy, and if Lorentz has got through it, it is only by accumulating hypotheses.

Their task wasn't easy, and if Lorentz managed to get through it, it's only because he piled up hypotheses.

The most ingenious idea was that of local time. Imagine two observers who wish to adjust their timepieces by optical signals; they exchange signals, but as they know that the transmission of light is not instantaneous, they are careful to cross them. When station B perceives the signal from station A, its clock should not mark the same hour as that of station A at the moment of sending the signal, but this hour augmented by a constant representing the duration of the transmission. Suppose, for example, that station A sends its signal when its clock marks the hour O, and that station B perceives it when its clock marks the hour t. The clocks are adjusted if the slowness equal[Pg 307] to t represents the duration of the transmission, and to verify it, station B sends in its turn a signal when its clock marks O; then station A should perceive it when its clock marks t. The timepieces are then adjusted.

The smartest idea was local time. Imagine two observers who want to sync their clocks using light signals. They send signals back and forth, but knowing that light takes time to travel, they take care to cross their signals. When station B receives the signal from station A, its clock won’t show the same time as station A when the signal was sent, but rather that time plus a constant that represents how long the transmission took. For example, if station A sends its signal when its clock shows O, and station B receives it when its clock shows t, the clocks will be synced if the time delay from A to t equals the transmission duration. To confirm this, station B then sends a signal when its own clock shows O; station A should receive it when its clock shows t. The clocks are then synchronized.

And in fact they mark the same hour at the same physical instant, but on the one condition, that the two stations are fixed. Otherwise the duration of the transmission will not be the same in the two senses, since the station A, for example, moves forward to meet the optical perturbation emanating from B, whereas the station B flees before the perturbation emanating from A. The watches adjusted in that way will not mark, therefore, the true time; they will mark what may be called the local time, so that one of them will be slow of the other. It matters little, since we have no means of perceiving it. All the phenomena which happen at A, for example, will be late, but all will be equally so, and the observer will not perceive it, since his watch is slow; so, as the principle of relativity requires, he will have no means of knowing whether he is at rest or in absolute motion.

And actually, they indicate the same time at the same physical moment, but only if both stations are stationary. If not, the duration of the transmission will differ in both directions, since station A, for example, moves forward to meet the optical disruption coming from B, while station B moves away from the disruption coming from A. Therefore, the clocks set up this way won’t show the actual time; they will show what might be called local time, meaning one will be behind the other. This doesn’t matter much since we have no way of noticing it. All the events at A, for instance, will be delayed, but they will all be delayed equally, and the observer won’t realize it since their clock is behind; thus, as the principle of relativity suggests, they won’t have any way of knowing whether they are at rest or in absolute motion.

Unhappily, that does not suffice, and complementary hypotheses are necessary; it is necessary to admit that bodies in motion undergo a uniform contraction in the sense of the motion. One of the diameters of the earth, for example, is shrunk by one two-hundred-millionth in consequence of our planet's motion, while the other diameter retains its normal length. Thus the last little differences are compensated. And then, there is still the hypothesis about forces. Forces, whatever be their origin, gravity as well as elasticity, would be reduced in a certain proportion in a world animated by a uniform translation; or, rather, this would happen for the components perpendicular to the translation; the components parallel would not change. Resume, then, our example of two electrified bodies; these bodies repel each other, but at the same time if all is carried along in a uniform translation, they are equivalent to two parallel currents of the same sense which attract each other. This electrodynamic attraction diminishes, therefore, the electrostatic repulsion, and the total repulsion is feebler than if the two bodies were at rest. But since to measure this repulsion we must balance it by another force, and all these other forces are reduced in the same[Pg 308] proportion, we perceive nothing. Thus all seems arranged, but are all the doubts dissipated? What would happen if one could communicate by non-luminous signals whose velocity of propagation differed from that of light? If, after having adjusted the watches by the optical procedure, we wished to verify the adjustment by the aid of these new signals, we should observe discrepancies which would render evident the common translation of the two stations. And are such signals inconceivable, if we admit with Laplace that universal gravitation is transmitted a million times more rapidly than light?

Unfortunately, that isn't enough, and we need additional hypotheses; we have to accept that moving bodies experience a uniform contraction in the direction of their motion. For instance, one of the Earth's diameters shrinks by one two-hundred-millionth due to our planet's motion, while the other diameter stays the same. This balances out the minor differences. Then, there's also the hypothesis about forces. Forces, regardless of their source—gravity or elasticity—would be reduced to some degree in a world moving uniformly; or rather, this would only be true for the components perpendicular to the motion; the parallel components would remain unchanged. Let's revisit our example of two charged bodies; these bodies repel each other, but if everything is moving together at a constant speed, they essentially act like two parallel currents in the same direction that attract each other. This electrodynamic attraction reduces the electrostatic repulsion, making the overall repulsion weaker than if the two bodies were stationary. However, since we need to measure this repulsion by balancing it against another force, and all these other forces are reduced in the same proportion, we notice nothing. So everything seems settled, but are all the doubts resolved? What would happen if we could communicate using non-luminous signals that traveled at a different speed than light? If, after syncing the clocks using light, we tried to check the synchronization with these new signals, we would see discrepancies that would reveal the common motion of the two locations. And are such signals really unthinkable if we accept, like Laplace, that universal gravitation travels a million times faster than light?

Thus, the principle of relativity has been valiantly defended in these latter times, but the very energy of the defense proves how serious was the attack.

Thus, the principle of relativity has been strongly defended in recent times, but the intensity of the defense shows just how serious the attack was.

Newton's Principle.—Let us speak now of the principle of Newton, on the equality of action and reaction. This is intimately bound up with the preceding, and it seems indeed that the fall of the one would involve that of the other. Thus we must not be astonished to find here the same difficulties.

Newton's Principle.—Now let's talk about Newton's principle regarding the equality of action and reaction. This is closely related to what we've discussed before, and it seems that if one falls, the other will too. So, we shouldn't be surprised to encounter the same challenges here.

Electrical phenomena, according to the theory of Lorentz, are due to the displacements of little charged particles, called electrons, immersed in the medium we call ether. The movements of these electrons produce perturbations in the neighboring ether; these perturbations propagate themselves in every direction with the velocity of light, and in turn other electrons, originally at rest, are made to vibrate when the perturbation reaches the parts of the ether which touch them. The electrons, therefore, act on one another, but this action is not direct, it is accomplished through the ether as intermediary. Under these conditions can there be compensation between action and reaction, at least for an observer who should take account only of the movements of matter, that is, of the electrons, and who should be ignorant of those of the ether that he could not see? Evidently not. Even if the compensation should be exact, it could not be simultaneous. The perturbation is propagated with a finite velocity; it, therefore, reaches the second electron only when the first has long ago entered upon its rest. This second electron, therefore, will undergo, after a delay, the action of the first, but will certainly not at that moment react upon it, since around this first electron nothing any longer budges.[Pg 309]

Electrical phenomena, according to Lorentz's theory, result from the movement of tiny charged particles known as electrons, which are found in the medium called ether. The movement of these electrons creates disturbances in the surrounding ether; these disturbances spread out in all directions at the speed of light, causing other initially stationary electrons to start vibrating when the disturbance reaches the ether surrounding them. Therefore, electrons interact with each other, but this interaction isn't direct; it's mediated through the ether. In this situation, can there be a balance between action and reaction, at least for an observer who only considers the movements of matter, meaning the electrons, and who is unaware of the movements of the ether that they cannot see? Clearly not. Even if the balance were perfect, it couldn't be simultaneous. The disturbance travels at a finite speed, so it only reaches the second electron after the first has already settled down. Consequently, the second electron will experience the influence of the first after a delay but won't react to it at that moment since the first electron is no longer in motion.[Pg 309]

The analysis of the facts permits us to be still more precise. Imagine, for example, a Hertzian oscillator, like those used in wireless telegraphy; it sends out energy in every direction; but we can provide it with a parabolic mirror, as Hertz did with his smallest oscillators, so as to send all the energy produced in a single direction. What happens then according to the theory? The apparatus recoils, as if it were a cannon and the projected energy a ball; and that is contrary to the principle of Newton, since our projectile here has no mass, it is not matter, it is energy. The case is still the same, moreover, with a beacon light provided with a reflector, since light is nothing but a perturbation of the electromagnetic field. This beacon light should recoil as if the light it sends out were a projectile. What is the force that should produce this recoil? It is what is called the Maxwell-Bartholi pressure. It is very minute, and it has been difficult to put it in evidence even with the most sensitive radiometers; but it suffices that it exists.

The analysis of the facts allows us to be even more precise. Imagine, for instance, a Hertzian oscillator, like the ones used in wireless telegraphy; it emits energy in all directions. However, we can equip it with a parabolic mirror, as Hertz did with his smallest oscillators, to focus all the energy in one direction. What happens according to the theory? The device recoils, as if it were a cannon and the emitted energy were a bullet; this contradicts Newton's principle because our projectile doesn't have mass; it isn't matter, it's energy. The same principle applies to a beacon light with a reflector, since light is merely a disturbance in the electromagnetic field. This beacon light should recoil as though the light it emits were a projectile. What force should cause this recoil? It's what we call the Maxwell-Bartholi pressure. It's very small, and it's been hard to demonstrate even with the most sensitive radiometers; but the fact that it exists is enough.

If all the energy issuing from our oscillator falls on a receiver, this will act as if it had received a mechanical shock, which will represent in a sense the compensation of the oscillator's recoil; the reaction will be equal to the action, but it will not be simultaneous; the receiver will move on, but not at the moment when the oscillator recoils. If the energy propagates itself indefinitely without encountering a receiver, the compensation will never occur.

If all the energy from our oscillator hits a receiver, it will behave as though it experienced a mechanical shock, which reflects the recoil of the oscillator; the reaction will match the action, but it won't happen at the same time; the receiver will move forward, but not at the instant the oscillator recoils. If the energy spreads out endlessly without hitting a receiver, the compensation will never happen.

Shall we say that the space which separates the oscillator from the receiver and which the perturbation must pass over in going from the one to the other is not void, that it is full not only of ether, but of air, or even in the interplanetary spaces of some fluid subtile but still ponderable; that this matter undergoes the shock like the receiver at the moment when the energy reaches it, and recoils in its turn when the perturbation quits it? That would save Newton's principle, but that is not true. If energy in its diffusion remained always attached to some material substratum, then matter in motion would carry along light with it, and Fizeau has demonstrated that it does nothing of the sort, at least for air. Michelson and Morley have since confirmed this. It might be supposed also that the movements of matter[Pg 310] proper are exactly compensated by those of the ether; but that would lead us to the same reflections as before now. The principle so understood will explain everything, since, whatever might be the visible movements, we always could imagine hypothetical movements which compensate them. But if it is able to explain everything, this is because it does not enable us to foresee anything; it does not enable us to decide between the different possible hypotheses, since it explains everything beforehand. It therefore becomes useless.

Shall we say that the space between the oscillator and the receiver, which the disturbance must travel to get from one to the other, isn't empty? That it’s filled not just with ether, but also with air, or even in the vastness of space, some subtle yet still measurable fluid? That this matter reacts to the disturbance like the receiver does when the energy hits it and then pushes back when the disturbance leaves? That would uphold Newton's principle, but that's not accurate. If energy during its spread was always tied to some material basis, then moving matter would drag light along with it, and Fizeau has shown that it doesn't happen that way, at least not with air. Michelson and Morley confirmed this later. It might also be assumed that the movements of matter itself are perfectly balanced by those of the ether, but that would lead us to the same thoughts as before. This principle, as understood, could explain everything, since no matter the visible movements, we could always imagine hypothetical movements that balance them out. But if it can explain everything, it does so at the expense of being able to predict anything; it doesn’t help us choose between different possible theories since it preemptively explains everything. So, it becomes pointless.

And then the suppositions that it would be necessary to make on the movements of the ether are not very satisfactory. If the electric charges double, it would be natural to imagine that the velocities of the diverse atoms of ether double also; but, for the compensation, it would be necessary that the mean velocity of the ether quadruple.

And then the assumptions we need to make about how the ether behaves aren't very satisfying. If the electric charges double, it seems reasonable to think that the speeds of the different ether atoms would also double; however, to balance this out, the average speed of the ether would have to quadruple.

This is why I have long thought that these consequences of theory, contrary to Newton's principle, would end some day by being abandoned, and yet the recent experiments on the movements of the electrons issuing from radium seem rather to confirm them.

This is why I have long believed that these consequences of theory, contrary to Newton's principle, would eventually be discarded, yet the recent experiments on the movements of electrons coming from radium seem to confirm them instead.

Lavoisier's Principle.—I arrive at the principle of Lavoisier on the conservation of mass. Certainly, this is one not to be touched without unsettling all mechanics. And now certain persons think that it seems true to us only because in mechanics merely moderate velocities are considered, but that it would cease to be true for bodies animated by velocities comparable to that of light. Now these velocities are believed at present to have been realized; the cathode rays and those of radium may be formed of very minute particles or of electrons which are displaced with velocities smaller no doubt than that of light, but which might be its one tenth or one third.

Lavoisier's Principle.—I’m getting to Lavoisier's principle about the conservation of mass. This principle is definitely something that, if disturbed, could throw all mechanics into chaos. Now, some people think it only seems true to us because we’re only looking at mechanics with moderately fast speeds, but that it wouldn’t hold if we considered objects moving at speeds close to that of light. Right now, it’s believed that such speeds have been achieved; cathode rays and those from radium might be made up of very tiny particles or electrons that are moving at speeds that are indeed less than light, but possibly one-tenth or one-third of it.

These rays can be deflected, whether by an electric field, or by a magnetic field, and we are able, by comparing these deflections, to measure at the same time the velocity of the electrons and their mass (or rather the relation of their mass to their charge). But when it was seen that these velocities approached that of light, it was decided that a correction was necessary. These molecules, being electrified, can not be displaced without[Pg 311] agitating the ether; to put them in motion it is necessary to overcome a double inertia, that of the molecule itself and that of the ether. The total or apparent mass that one measures is composed, therefore, of two parts: the real or mechanical mass of the molecule and the electrodynamic mass representing the inertia of the ether.

These rays can be redirected by either an electric field or a magnetic field, and by comparing these deflections, we can simultaneously measure the speed of the electrons and their mass (or more accurately, the relationship between their mass and their charge). However, when it became evident that these speeds were nearing that of light, it was determined that a correction was needed. These electrified molecules cannot be moved without[Pg 311] disturbing the ether; to set them in motion, you need to overcome two types of inertia: that of the molecule itself and that of the ether. Therefore, the total or apparent mass that is measured is made up of two components: the actual or mechanical mass of the molecule and the electrodynamic mass representing the inertia of the ether.

The calculations of Abraham and the experiments of Kaufmann have then shown that the mechanical mass, properly so called, is null, and that the mass of the electrons, or, at least, of the negative electrons, is of exclusively electrodynamic origin. This is what forces us to change the definition of mass; we can not any longer distinguish mechanical mass and electrodynamic mass, since then the first would vanish; there is no mass other than electrodynamic inertia. But in this case the mass can no longer be constant; it augments with the velocity, and it even depends on the direction, and a body animated by a notable velocity will not oppose the same inertia to the forces which tend to deflect it from its route, as to those which tend to accelerate or to retard its progress.

The calculations of Abraham and the experiments of Kaufmann have shown that mechanical mass, in the true sense of the term, is zero, and that the mass of electrons, or at least negative electrons, comes solely from electrodynamic effects. This forces us to redefine mass; we can no longer separate mechanical mass from electrodynamic mass because the former would disappear; there is no mass apart from electrodynamic inertia. However, in this case, mass can no longer be constant; it increases with velocity and even depends on direction. A body moving at significant velocity does not exert the same inertia against forces that try to change its direction as it does against those that aim to speed it up or slow it down.

There is still a resource; the ultimate elements of bodies are electrons, some charged negatively, the others charged positively. The negative electrons have no mass, this is understood; but the positive electrons, from the little we know of them, seem much greater. Perhaps they have, besides their electrodynamic mass, a true mechanical mass. The real mass of a body would, then, be the sum of the mechanical masses of its positive electrons, the negative electrons not counting; mass so defined might still be constant.

There is still a resource; the basic components of matter are electrons—some have a negative charge, while others have a positive charge. The negative electrons are understood to have no mass; however, based on what we know, the positive electrons seem to be much larger. They may have, in addition to their electrodynamic mass, a real mechanical mass. Therefore, the actual mass of an object would be the total of the mechanical masses of its positive electrons, with the negative electrons not being included; this defined mass could still remain constant.

Alas! this resource also evades us. Recall what we have said of the principle of relativity and of the efforts made to save it. And it is not merely a principle which it is a question of saving, it is the indubitable results of the experiments of Michelson.

Alas! this resource also escapes us. Remember what we discussed about the principle of relativity and the attempts made to preserve it. And it’s not just a principle that we’re trying to save; it’s the undeniable results of Michelson's experiments.

Well, as was above seen, Lorentz, to account for these results, was obliged to suppose that all forces, whatever their origin, were reduced in the same proportion in a medium animated by a uniform translation; this is not sufficient; it is not enough that this take place for the real forces, it must also be the same for the forces of inertia; it is therefore necessary, he says, that the[Pg 312] masses of all the particles be influenced by a translation to the same degree as the electromagnetic masses of the electrons.

Well, as we saw earlier, Lorentz had to assume that all forces, regardless of their source, were reduced by the same proportion in a medium moving uniformly. This isn't enough; it's not just the actual forces that need to be affected, but the same must apply to inertial forces as well. Therefore, he states that the[Pg 312] masses of all the particles must be influenced by a translation to the same degree as the electromagnetic masses of the electrons.

So the mechanical masses must vary in accordance with the same laws as the electrodynamic masses; they can not, therefore, be constant.

So the mechanical masses have to change according to the same laws as the electrodynamic masses; they can't, therefore, be constant.

Need I point out that the fall of Lavoisier's principle involves that of Newton's? This latter signifies that the center of gravity of an isolated system moves in a straight line; but if there is no longer a constant mass, there is no longer a center of gravity, we no longer know even what this is. This is why I said above that the experiments on the cathode rays appeared to justify the doubts of Lorentz concerning Newton's principle.

Need I mention that if Lavoisier's principle falls, so does Newton's? The latter states that the center of gravity of an isolated system moves in a straight line; however, if there’s no constant mass, there’s no center of gravity, and we don’t even know what that means anymore. That’s why I mentioned earlier that the experiments on cathode rays seemed to support Lorentz's doubts about Newton's principle.

From all these results, if they were confirmed, would arise an entirely new mechanics, which would be, above all, characterized by this fact, that no velocity could surpass that of light,[9] any more than any temperature can fall below absolute zero.

From all these results, if they're confirmed, an entirely new mechanics would emerge, which would be mainly characterized by the fact that no speed could exceed that of light,[9] any more than any temperature can drop below absolute zero.

No more for an observer, carried along himself in a translation he does not suspect, could any apparent velocity surpass that of light; and this would be then a contradiction, if we did not recall that this observer would not use the same clocks as a fixed observer, but, indeed, clocks marking 'local time.'

No observer, who is also caught up in a translation he doesn't realize, could ever see a speed greater than that of light. This would be a contradiction, unless we remember that this observer wouldn’t be using the same clocks as a stationary observer, but instead, clocks that measure 'local time.'

Here we are then facing a question I content myself with stating. If there is no longer any mass, what becomes of Newton's law? Mass has two aspects: it is at the same time a coefficient of inertia and an attracting mass entering as factor into Newtonian attraction. If the coefficient of inertia is not constant, can the attracting mass be? That is the question.

Here we are facing a question I’m okay with just stating. If there’s no longer any mass, what happens to Newton's law? Mass has two aspects: it is both a measure of inertia and an attracting mass that factors into Newtonian attraction. If the measure of inertia isn’t constant, can the attracting mass be? That’s the question.

Mayer's Principle.—At least, the principle of the conservation of energy yet remained to us, and this seemed more solid. Shall I recall to you how it was in its turn thrown into discredit? This event has made more noise than the preceding, and it is in all the memoirs. From the first words of Becquerel, and, above all, when the Curies had discovered radium, it was seen that every radioactive body was an inexhaustible source of radiation. Its activity seemed to subsist without alteration throughout the months and the years. This was in itself a strain on the[Pg 313] principles; these radiations were in fact energy, and from the same morsel of radium this issued and forever issued. But these quantities of energy were too slight to be measured; at least that was the belief and we were not much disquieted.

Mayer's Principle.—At least we still had the principle of energy conservation, and that felt more reliable. Should I remind you how it too fell out of favor? This event caused more of a stir than the last one, and it's documented in all the memoirs. From Becquerel's initial remarks, especially when the Curies discovered radium, it became clear that every radioactive substance was an endless source of radiation. Its activity appeared to remain constant over months and years. This alone was a challenge to the[Pg 313] principles; these radiations were indeed energy, and from that same piece of radium, energy was constantly being emitted. But the amounts of energy were too small to measure; at least that was the consensus, and we weren't too troubled by it.

The scene changed when Curie bethought himself to put radium in a calorimeter; it was then seen that the quantity of heat incessantly created was very notable.

The scene shifted when Curie realized he could put radium in a calorimeter; it then became clear that the amount of heat continuously produced was quite significant.

The explanations proposed were numerous; but in such case we can not say, the more the better. In so far as no one of them has prevailed over the others, we can not be sure there is a good one among them. Since some time, however, one of these explanations seems to be getting the upper hand and we may reasonably hope that we hold the key to the mystery.

The explanations offered were many; however, in this case, we can't say that more is better. Since none of them has stood out above the others, we can't be sure that there's a good one among them. For a while now, though, one of these explanations seems to be gaining traction, and we can reasonably hope that we have the key to the mystery.

Sir W. Ramsay has striven to show that radium is in process of transformation, that it contains a store of energy enormous but not inexhaustible. The transformation of radium then would produce a million times more heat than all known transformations; radium would wear itself out in 1,250 years; this is quite short, and you see that we are at least certain to have this point settled some hundreds of years from now. While waiting, our doubts remain.

Sir W. Ramsay has worked to demonstrate that radium is undergoing a transformation, containing an enormous but not unlimited amount of energy. The transformation of radium would then generate a million times more heat than all known transformations combined; radium would exhaust itself in 1,250 years. This is relatively short, and it means we can be sure this point will be clarified in a few hundred years. In the meantime, our uncertainties persist.


CHAPTER IX

The Future of Math Physics

The Principles and Experiment.—In the midst of so much ruin, what remains standing? The principle of least action is hitherto intact, and Larmor appears to believe that it will long survive the others; in reality, it is still more vague and more general.

The Principles and Experiment.—Amid all this destruction, what still stands? The principle of least action is still intact, and Larmor seems to think it will endure longer than the others; in truth, it is even more vague and general.

In presence of this general collapse of the principles, what attitude will mathematical physics take? And first, before too much excitement, it is proper to ask if all that is really true. All these derogations to the principles are encountered only among infinitesimals; the microscope is necessary to see the Brownian movement; electrons are very light; radium is very rare, and one never has more than some milligrams of it at a time. And, then, it may be asked whether, besides the infinitesimal seen, there was not another infinitesimal unseen counterpoise to the first.

In light of this overall collapse of principles, what stance will mathematical physics adopt? First, before getting too worked up, it's important to question whether all of this is actually true. These deviations from the principles only appear with infinitesimals; you need a microscope to observe Brownian motion; electrons are very light; radium is extremely rare, and one typically only has a few milligrams of it at a time. Additionally, it raises the question of whether, aside from the observed infinitesimal, there might not be another unseen infinitesimal that balances the first.

So there is an interlocutory question, and, as it seems, only experiment can solve it. We shall, therefore, only have to hand over the matter to the experimenters, and, while waiting for them to finally decide the debate, not to preoccupy ourselves with these disquieting problems, and to tranquilly continue our work as if the principles were still uncontested. Certes, we have much to do without leaving the domain where they may be applied in all security; we have enough to employ our activity during this period of doubts.

So there's a temporary question, and it looks like only experimentation can answer it. Therefore, we just need to leave it to the experimenters, and while we wait for them to settle the debate, we shouldn't worry too much about these troubling issues. Instead, we should calmly continue our work as if the principles are still up for debate. Indeed, we have plenty to accomplish without straying from the area where we can apply them safely; we have enough to keep us busy during this uncertain time.

The Rôle of the Analyst.—And as to these doubts, is it indeed true that we can do nothing to disembarrass science of them? It must indeed be said, it is not alone experimental physics that has given birth to them; mathematical physics has well contributed. It is the experimenters who have seen radium throw out energy, but it is the theorists who have put in evidence all the difficulties raised by the propagation of light across a medium in motion; but for these it is probable we should not have become[Pg 315] conscious of them. Well, then, if they have done their best to put us into this embarrassment, it is proper also that they help us to get out of it.

The Role of the Analyst.—So regarding these doubts, is it really true that we can’t do anything to remove them from science? It should be noted that it isn’t just experimental physics that has led to these issues; mathematical physics has played a significant role too. The experimenters have observed radium emitting energy, but it’s the theorists who have highlighted all the challenges posed by light traveling through a moving medium; without their input, we probably wouldn’t have even recognized these challenges. Well, if they’ve done their part to create this confusion, then it’s only fair that they also help us find a way out of it.

They must subject to critical examination all these new views I have just outlined before you, and abandon the principles only after having made a loyal effort to save them. What can they do in this sense? That is what I will try to explain.

They need to closely examine all these new ideas I've just presented to you and only let go of the principles after honestly trying to preserve them. What can they do in this situation? That's what I'll attempt to explain.

It is a question before all of endeavoring to obtain a more satisfactory theory of the electrodynamics of bodies in motion. It is there especially, as I have sufficiently shown above, that difficulties accumulate. It is useless to heap up hypotheses, we can not satisfy all the principles at once; so far, one has succeeded in safeguarding some only on condition of sacrificing the others; but all hope of obtaining better results is not yet lost. Let us take, then, the theory of Lorentz, turn it in all senses, modify it little by little, and perhaps everything will arrange itself.

It’s a question we all need to consider when trying to develop a better understanding of the electrodynamics of moving bodies. This is where the challenges really pile up, as I've shown earlier. It's pointless to keep adding hypotheses; we can’t meet all the principles simultaneously. So far, we’ve managed to protect some only by giving up others, but there's still hope for achieving better outcomes. Let’s take Lorentz’s theory, examine it from all angles, tweak it step by step, and maybe it will all come together.

Thus in place of supposing that bodies in motion undergo a contraction in the sense of the motion, and that this contraction is the same whatever be the nature of these bodies and the forces to which they are otherwise subjected, could we not make a more simple and natural hypothesis? We might imagine, for example, that it is the ether which is modified when it is in relative motion in reference to the material medium which penetrates it, that, when it is thus modified, it no longer transmits perturbations with the same velocity in every direction. It might transmit more rapidly those which are propagated parallel to the motion of the medium, whether in the same sense or in the opposite sense, and less rapidly those which are propagated perpendicularly. The wave surfaces would no longer be spheres, but ellipsoids, and we could dispense with that extraordinary contraction of all bodies.

So instead of assuming that moving bodies shrink in the direction of motion, and that this shrinkage is the same regardless of the type of body and the forces acting on it, can we suggest a simpler and more natural idea? For instance, we could envision that the ether is altered when it moves relative to the material medium that permeates it; when it's modified this way, it stops transmitting disturbances at the same speed in all directions. It might transmit disturbances that move parallel to the medium’s motion, whether in the same direction or the opposite one, more quickly, and those that move perpendicularly more slowly. The wave surfaces would no longer be spheres, but ellipsoids, and we could eliminate that unusual contraction of all bodies.

I cite this only as an example, since the modifications that might be essayed would be evidently susceptible of infinite variation.

I mention this just as an example because the changes that could be attempted would clearly allow for countless variations.

Aberration and Astronomy.—It is possible also that astronomy may some day furnish us data on this point; she it was in the main who raised the question in making us acquainted with the[Pg 316] phenomenon of the aberration of light. If we make crudely the theory of aberration, we reach a very curious result. The apparent positions of the stars differ from their real positions because of the earth's motion, and as this motion is variable, these apparent positions vary. The real position we can not ascertain, but we can observe the variations of the apparent position. The observations of the aberration show us, therefore, not the earth's motion, but the variations of this motion; they can not, therefore, give us information about the absolute motion of the earth.

Aberration and Astronomy.—It's possible that astronomy might one day provide us with data on this subject; it was primarily astronomy that raised the question by introducing us to the [Pg 316] phenomenon of light aberration. If we roughly outline the theory of aberration, we arrive at a very interesting conclusion. The apparent positions of the stars differ from their actual positions because of the earth's movement, and since this movement is not constant, these apparent positions change. We cannot determine the real position, but we can observe the variations in the apparent position. Therefore, observations of aberration show us not the earth's movement itself, but its variations; they cannot provide us with information regarding the absolute motion of the earth.

At least this is true in first approximation, but the case would be no longer the same if we could appreciate the thousandths of a second. Then it would be seen that the amplitude of the oscillation depends not alone on the variation of the motion, a variation which is well known, since it is the motion of our globe on its elliptic orbit, but on the mean value of this motion, so that the constant of aberration would not be quite the same for all the stars, and the differences would tell us the absolute motion of the earth in space.

At least this is true at first glance, but that wouldn’t be the case if we could measure in thousandths of a second. Then we would see that the amplitude of the oscillation depends not only on the change in motion, which is well understood since it’s the movement of our planet along its elliptical orbit, but also on the average value of this motion. This means that the constant of aberration wouldn’t be exactly the same for all the stars, and the differences would reveal the Earth's absolute motion in space.

This, then, would be, under another form, the ruin of the principle of relativity. We are far, it is true, from appreciating the thousandth of a second, but, after all, say some, the earth's total absolute velocity is perhaps much greater than its relative velocity with respect to the sun. If, for example, it were 300 kilometers per second in place of 30, this would suffice to make the phenomenon observable.

This would essentially be a different way of describing the collapse of the principle of relativity. It's true that we're not yet able to appreciate a thousandth of a second, but some argue that the Earth's total absolute speed might actually be much higher than its relative speed compared to the sun. For instance, if it were 300 kilometers per second instead of 30, that would be enough to make the phenomenon noticeable.

I believe that in reasoning thus one admits a too simple theory of aberration. Michelson has shown us, I have told you, that the physical procedures are powerless to put in evidence absolute motion; I am persuaded that the same will be true of the astronomic procedures, however far precision be carried.

I think that by reasoning this way, one accepts a theory of distortion that’s too simplistic. Michelson has demonstrated, as I mentioned, that physical methods can’t reveal absolute motion; I’m convinced that the same will hold true for astronomical methods, no matter how precise they become.

However that may be, the data astronomy will furnish us in this regard will some day be precious to the physicist. Meanwhile, I believe that the theorists, recalling the experience of Michelson, may anticipate a negative result, and that they would accomplish a useful work in constructing a theory of aberration which would explain this in advance.

However that may be, the data from astronomy will eventually be valuable to physicists. In the meantime, I believe that theorists, remembering Michelson's experience, might predict a negative result and that they would do a useful job by developing a theory of aberration that explains this beforehand.

Electrons and Spectra.—This dynamics of electrons can be approached from many sides, but among the ways leading thither is[Pg 317] one which has been somewhat neglected, and yet this is one of those which promise us the most surprises. It is movements of electrons which produce the lines of the emission spectra; this is proved by the Zeeman effect; in an incandescent body what vibrates is sensitive to the magnet, therefore electrified. This is a very important first point, but no one has gone farther. Why are the lines of the spectrum distributed in accordance with a regular law? These laws have been studied by the experimenters in their least details; they are very precise and comparatively simple. A first study of these distributions recalls the harmonics encountered in acoustics; but the difference is great. Not only are the numbers of vibrations not the successive multiples of a single number, but we do not even find anything analogous to the roots of those transcendental equations to which we are led by so many problems of mathematical physics: that of the vibrations of an elastic body of any form, that of the Hertzian oscillations in a generator of any form, the problem of Fourier for the cooling of a solid body.

Electrons and Spectra.—We can approach the dynamics of electrons from various angles, but one method that has been somewhat overlooked is[Pg 317] actually one of the most promising and surprising. It's the movement of electrons that creates the lines in the emission spectra; this is demonstrated by the Zeeman effect. In an incandescent material, what vibrates is responsive to magnets, meaning it's electrified. This is a crucial first point, but no one has explored further. Why are the lines of the spectrum arranged according to a regular pattern? Experimenters have studied these laws in great detail; they are very precise and relatively simple. A first look at these distributions resembles the harmonics found in acoustics, but the differences are significant. Not only are the vibration numbers not simple multiples of a single number, but we also don't find anything similar to the roots of those transcendental equations that arise in many mathematical physics problems: like the vibrations of an elastic body in any shape, the Hertzian oscillations in any type of generator, or Fourier's problem concerning the cooling of a solid body.

The laws are simpler, but they are of wholly other nature, and to cite only one of these differences, for the harmonics of high order, the number of vibrations tends toward a finite limit, instead of increasing indefinitely.

The laws are simpler, but they are of a completely different nature, and to mention just one of these differences, for the harmonics of higher order, the number of vibrations approaches a finite limit, instead of increasing endlessly.

That has not yet been accounted for, and I believe that there we have one of the most important secrets of nature. A Japanese physicist, M. Nagaoka, has recently proposed an explanation; according to him, atoms are composed of a large positive electron surrounded by a ring formed of a great number of very small negative electrons. Such is the planet Saturn with its rings. This is a very interesting attempt, but not yet wholly satisfactory; this attempt should be renewed. We will penetrate, so to speak, into the inmost recess of matter. And from the particular point of view which we to-day occupy, when we know why the vibrations of incandescent bodies differ thus from ordinary elastic vibrations, why the electrons do not behave like the matter which is familiar to us, we shall better comprehend the dynamics of electrons and it will be perhaps more easy for us to reconcile it with the principles.

That hasn't been figured out yet, and I believe that's where we find one of nature's biggest secrets. A Japanese physicist, M. Nagaoka, recently proposed an explanation; he claims that atoms consist of a large positive electron surrounded by a ring made up of many tiny negative electrons. This is similar to the planet Saturn and its rings. It's a fascinating attempt, but it's not entirely convincing yet; we need to revisit this idea. We will delve, so to speak, into the deepest parts of matter. From the perspective we have today, when we understand why the vibrations of glowing bodies differ from ordinary elastic vibrations, and why electrons don't behave like the matter we're used to, we'll have a better grasp of electron dynamics, making it perhaps easier to align it with established principles.

Conventions Preceding Experiment.—Suppose, now, that all[Pg 318] these efforts fail, and, after all, I do not believe they will, what must be done? Will it be necessary to seek to mend the broken principles by giving what we French call a coup de pouce? That evidently is always possible, and I retract nothing of what I have said above.

Conventions Preceding Experiment.—Now, let's assume that all[Pg 318] these efforts fail, and honestly, I don't think they will. What should we do then? Will we need to try to fix the flawed principles by giving what we French call a coup de pouce? That clearly is always an option, and I stand by everything I’ve said above.

Have you not written, you might say if you wished to seek a quarrel with me—have you not written that the principles, though of experimental origin, are now unassailable by experiment because they have become conventions? And now you have just told us that the most recent conquests of experiment put these principles in danger.

Have you not written, you might say if you wanted to argue with me—have you not written that the principles, even though they stem from experiments, are now beyond challenge by experiments because they have become accepted norms? And now you just told us that the latest achievements in experiments threaten these principles.

Well, formerly I was right and to-day I am not wrong. Formerly I was right, and what is now happening is a new proof of it. Take, for example, the calorimetric experiment of Curie on radium. Is it possible to reconcile it with the principle of the conservation of energy? This has been attempted in many ways. But there is among them one I should like you to notice; this is not the explanation which tends to-day to prevail, but it is one of those which have been proposed. It has been conjectured that radium was only an intermediary, that it only stored radiations of unknown nature which flashed through space in every direction, traversing all bodies, save radium, without being altered by this passage and without exercising any action upon them. Radium alone took from them a little of their energy and afterward gave it out to us in various forms.

Well, I used to be right, and today I'm not wrong. I was right back then, and what's happening now is new evidence of that. Take, for instance, Curie's calorimetric experiment with radium. Can it be reconciled with the principle of conservation of energy? People have tried to do this in many ways. But there's one explanation I want you to pay attention to; it's not the dominant one today, but it's one of the proposed ideas. Some have suggested that radium is just an intermediary, storing radiations of unknown nature that pass through space in all directions, going through all bodies except for radium, without changing them or having any effect on them. Only radium absorbs a bit of their energy and later releases it to us in different forms.

What an advantageous explanation, and how convenient! First, it is unverifiable and thus irrefutable. Then again it will serve to account for any derogation whatever to Mayer's principle; it answers in advance not only the objection of Curie, but all the objections that future experimenters might accumulate. This new and unknown energy would serve for everything.

What a great explanation, and how convenient! First, it's impossible to verify, so it can't be disproven. Plus, it will also justify any exceptions to Mayer's principle; it preempts not just Curie's objection but all the objections that future researchers might come up with. This new and unknown energy could be used for everything.

This is just what I said, and therewith we are shown that our principle is unassailable by experiment.

This is exactly what I said, and this shows us that our principle is beyond question through experimentation.

But then, what have we gained by this stroke? The principle is intact, but thenceforth of what use is it? It enabled us to foresee that in such or such circumstance we could count on such a total quantity of energy; it limited us; but now that this indefinite provision of new energy is placed at our disposal, we are no[Pg 319] longer limited by anything; and, as I have written in 'Science and Hypothesis,' if a principle ceases to be fecund, experiment without contradicting it directly will nevertheless have condemned it.

But then, what have we gained from this? The principle is still there, but what good is it now? It allowed us to predict that in certain situations we could rely on a certain amount of energy; it restricted us. But now that we have this endless supply of new energy available to us, we're no longer limited by anything. And as I wrote in 'Science and Hypothesis,' if a principle stops being useful, experiments that don’t directly contradict it will still end up proving its failure.

Future Mathematical Physics.—This, therefore, is not what would have to be done; it would be necessary to rebuild anew. If we were reduced to this necessity; we could moreover console ourselves. It would not be necessary thence to conclude that science can weave only a Penelope's web, that it can raise only ephemeral structures, which it is soon forced to demolish from top to bottom with its own hands.

Future Mathematical Physics.—So, this isn’t what needs to be done; we would have to start over from scratch. If we found ourselves in this situation, we could at least find some comfort in it. It wouldn’t have to lead us to think that science can only create something like Penelope's web, constructing only temporary structures that it is quickly forced to tear down completely with its own hands.

As I have said, we have already passed through a like crisis. I have shown you that in the second mathematical physics, that of the principles, we find traces of the first, that of central forces; it will be just the same if we must know a third. Just so with the animal that exuviates, that breaks its too narrow carapace and makes itself a fresh one; under the new envelope one will recognize the essential traits of the organism which have persisted.

As I mentioned, we have already gone through a similar crisis. I have shown you that in the second mathematical physics, that of the principles, we find remnants of the first, that of central forces; it will be the same if we need to understand a third. Just like the animal that sheds its skin, breaking out of its restrictive shell and creating a new one; under the new covering, you'll still recognize the key characteristics of the organism that have remained.

We can not foresee in what way we are about to expand; perhaps it is the kinetic theory of gases which is about to undergo development and serve as model to the others. Then the facts which first appeared to us as simple thereafter would be merely resultants of a very great number of elementary facts which only the laws of chance would make cooperate for a common end. Physical law would then assume an entirely new aspect; it would no longer be solely a differential equation, it would take the character of a statistical law.

We can't predict how we’re going to grow; maybe it’s the kinetic theory of gases that’s about to advance and set an example for others. Then, the facts that initially seemed straightforward would just be the result of a huge number of basic facts that chance would somehow align for a common purpose. Physical law would then look completely different; it wouldn’t just be a differential equation, but would reflect a statistical law.

Perhaps, too, we shall have to construct an entirely new mechanics that we only succeed in catching a glimpse of, where, inertia increasing with the velocity, the velocity of light would become an impassable limit. The ordinary mechanics, more simple, would remain a first approximation, since it would be true for velocities not too great, so that the old dynamics would still be found under the new. We should not have to regret having believed in the principles, and even, since velocities too great for the old formulas would always be only exceptional, the surest way in practise would be still to act as if we continued to[Pg 320] believe in them. They are so useful, it would be necessary to keep a place for them. To determine to exclude them altogether would be to deprive oneself of a precious weapon. I hasten to say in conclusion that we are not yet there, and as yet nothing proves that the principles will not come forth from out the fray victorious and intact.[10]

Maybe we'll need to build a completely new mechanics that we can only just catch a glimpse of, where inertia increases with speed, making the speed of light an insurmountable limit. The ordinary mechanics, which are simpler, would still hold as a first approximation because they apply to speeds that aren't too high, so the old dynamics would still be present in the new framework. We shouldn't regret having believed in the principles, and since speeds that are too high for the old formulas will always be rare, the safest approach in practice would be to act as if we still believed in them. They're so useful that we need to keep a place for them. Deciding to exclude them completely would mean giving up a valuable tool. I want to emphasize that we're not there yet, and nothing currently proves that the principles won't emerge from the conflict victorious and intact.[Pg 320][10]


PART III

THE OBJECTIVE VALUE OF SCIENCE


CHAPTER X

Is Science Man-Made?

1. The Philosophy of M. LeRoy

There are many reasons for being sceptics; should we push this scepticism to the very end or stop on the way? To go to the end is the most tempting solution, the easiest and that which many have adopted, despairing of saving anything from the shipwreck.

There are many reasons to be skeptical; should we take this skepticism all the way or pause along the way? Going all the way is the most tempting solution, the easiest one, and it’s what many have chosen, giving up on saving anything from the wreck.

Among the writings inspired by this tendency it is proper to place in the first rank those of M. LeRoy. This thinker is not only a philosopher and a writer of the greatest merit, but he has acquired a deep knowledge of the exact and physical sciences, and even has shown rare powers of mathematical invention. Let us recapitulate in a few words his doctrine, which has given rise to numerous discussions.

Among the writings inspired by this trend, it's appropriate to rank M. LeRoy's works at the top. This thinker is not only a philosopher and an excellent writer, but he has also gained a profound understanding of the exact and physical sciences, and he has even demonstrated exceptional skills in mathematical invention. Let's summarize his doctrine in a few words, which has sparked numerous discussions.

Science consists only of conventions, and to this circumstance solely does it owe its apparent certitude; the facts of science and, a fortiori, its laws are the artificial work of the scientist; science therefore can teach us nothing of the truth; it can only serve us as rule of action.

Science is just made up of conventions, and that's the only reason it seems so certain; the facts of science and, even more so, its laws are created by scientists. Therefore, science can't teach us anything about the truth; it can only guide our actions.

Here we recognize the philosophic theory known under the name of nominalism; all is not false in this theory; its legitimate domain must be left it, but out of this it should not be allowed to go.

Here we acknowledge the philosophical theory known as nominalism; not everything about this theory is wrong; its rightful scope should be respected, but it shouldn't be allowed to extend beyond that.

This is not all; M. LeRoy's doctrine is not only nominalistic; it has besides another characteristic which it doubtless owes to M. Bergson, it is anti-intellectualistic. According to M. LeRoy, the[Pg 322] intellect deforms all it touches, and that is still more true of its necessary instrument 'discourse.' There is reality only in our fugitive and changing impressions, and even this reality, when touched, vanishes.

This is not everything; M. LeRoy's theory is not just nominalistic; it also has another trait that he likely borrowed from M. Bergson: it is anti-intellectual. According to M. LeRoy, the[Pg 322] intellect distorts everything it engages with, and this is even more accurate for its essential tool, 'discourse.' There is only reality in our fleeting and changing impressions, and even this reality disappears once it is touched.

And yet M. LeRoy is not a sceptic; if he regards the intellect as incurably powerless, it is only to give more scope to other sources of knowledge, to the heart, for instance, to sentiment, to instinct or to faith.

And yet M. LeRoy isn’t a skeptic; if he sees the intellect as hopelessly limited, it’s only to allow more room for other sources of knowledge, like the heart, sentiment, instinct, or faith.

However great my esteem for M. LeRoy's talent, whatever the ingenuity of this thesis, I can not wholly accept it. Certes, I am in accord on many points with M. LeRoy, and he has even cited, in support of his view, various passages of my writings which I am by no means disposed to reject. I think myself only the more bound to explain why I can not go with him all the way.

However much I admire M. LeRoy's talent and no matter how clever his thesis is, I can't completely agree with it. Of course, I agree with M. LeRoy on many points, and he has even referenced some parts of my writings that I definitely don't dismiss. I feel it's important to explain why I can't fully support his position.

M. LeRoy often complains of being accused of scepticism. He could not help being, though this accusation is probably unjust. Are not appearances against him? Nominalist in doctrine, but realist at heart, he seems to escape absolute nominalism only by a desperate act of faith.

M. LeRoy often complains about being called a skeptic. He can't help it, although this accusation is likely unfair. Don't appearances work against him? He’s nominalist in theory, but realist at heart. It seems he only avoids being a strict nominalist through a desperate leap of faith.

The fact is that anti-intellectualistic philosophy in rejecting analysis and 'discourse,' just by that condemns itself to being intransmissible; it is a philosophy essentially internal, or, at the very least, only its negations can be transmitted; what wonder then that for an external observer it takes the shape of scepticism?

The truth is that anti-intellectual philosophy, by rejecting analysis and 'discourse,' basically makes itself impossible to communicate; it's a philosophy that's fundamentally internal, or at best, only its rejections can be shared. So, it's no surprise that to an outside observer, it appears as skepticism.

Therein lies the weak point of this philosophy; if it strives to remain faithful to itself, its energy is spent in a negation and a cry of enthusiasm. Each author may repeat this negation and this cry, may vary their form, but without adding anything.

Therein lies the weak point of this philosophy; if it aims to stay true to itself, its energy is used up in denial and a shout of enthusiasm. Each author can repeat this denial and this shout, can change their form, but without adding anything.

And, yet, would it not be more logical in remaining silent? See, you have written long articles; for that, it was necessary to use words. And therein have you not been much more 'discursive' and consequently much farther from life and truth than the animal who simply lives without philosophizing? Would not this animal be the true philosopher?

And yet, wouldn’t it make more sense to stay quiet? Look, you’ve written lengthy articles; for that, you had to use words. And haven’t you been much more 'wordy' and therefore much further from real life and truth than the animal that just lives without overthinking? Couldn’t this animal be the real philosopher?

However, because no painter has made a perfect portrait, should we conclude that the best painting is not to paint? When a zoologist dissects an animal, certainly he 'alters it.' Yes, in dissecting it, he condemns himself to never know all of it; but in[Pg 323] not dissecting it, he would condemn himself to never know anything of it and consequently to never see anything of it.

However, just because no artist has created a perfect portrait, should we think that the best choice is not to paint? When a zoologist dissects an animal, it definitely changes it. Yes, by dissecting it, they limit their understanding of the whole creature; but if they choose not to dissect it, they would be limiting themselves to knowing nothing about it and, therefore, never seeing anything of it.

Certes, in man are other forces besides his intellect; no one has ever been mad enough to deny that. The first comer makes these blind forces act or lets them act; the philosopher must speak of them; to speak of them, he must know of them the little that can be known, he should therefore see them act. How? With what eyes, if not with his intellect? Heart, instinct, may guide it, but not render it useless; they may direct the look, but not replace the eye. It may be granted that the heart is the workman, and the intellect only the instrument. Yet is it an instrument not to be done without, if not for action, at least for philosophizing? Therefore a philosopher really anti-intellectualistic is impossible. Perhaps we shall have to declare for the supremacy of action; always it is our intellect which will thus conclude; in allowing precedence to action it will thus retain the superiority of the thinking reed. This also is a supremacy not to be disdained.

Sure, there are other forces in humans besides just their intellect; no one has ever been foolish enough to deny that. Anyone can make these blind forces work or let them work; the philosopher has to talk about them; to talk about them, he needs to know the little that can be known, so he should witness them in action. How? With what eyes, if not with his intellect? The heart and instinct may guide it, but they don't make it useless; they may steer the focus, but they can't replace the eye. It's true that the heart is the one doing the work, and the intellect is just the tool. Still, it’s a tool that can’t be overlooked, at least not for action, and definitely not for thinking deeply. So a truly anti-intellectual philosopher is impossible. Maybe we need to side with the importance of action; still, it is our intellect that will reach that conclusion; in putting action first, it will still hold onto the superiority of the thinking being. This is also a form of superiority that deserves respect.

Pardon these brief reflections and pardon also their brevity, scarcely skimming the question. The process of intellectualism is not the subject I wish to treat: I wish to speak of science, and about it there is no doubt; by definition, so to speak, it will be intellectualistic or it will not be at all. Precisely the question is, whether it will be.

Pardon these quick thoughts and also their shortness, barely touching on the topic. The process of intellectual thinking isn’t what I want to discuss: I want to talk about science, and there’s no doubt about it; by definition, it will either be intellectual or it won’t exist at all. The real question is whether it will.

2. Science, Rule of Action

For M. LeRoy, science is only a rule of action. We are powerless to know anything and yet we are launched, we must act, and at all hazards we have established rules. It is the aggregate of these rules that is called science.

For M. LeRoy, science is just a set of actions. We can’t know anything for sure, but we’re pushed to take action anyway, and we have created rules despite the risks. The collection of these rules is what we refer to as science.

It is thus that men, desirous of diversion, have instituted rules of play, like those of tric-trac for instance, which, better than science itself, could rely upon the proof by universal consent. It is thus likewise that, unable to choose, but forced to choose, we toss up a coin, head or tail to win.

It is for this reason that people, seeking entertainment, have created rules for games, like tric-trac for example, which can depend on universal agreement even more than science itself. Similarly, when we have to make a choice but can't decide, we flip a coin, heads or tails to win.

The rule of tric-trac is indeed a rule of action like science, but does any one think the comparison just and not see the difference? The rules of the game are arbitrary conventions[Pg 324] and the contrary convention might have been adopted, which would have been none the less good. On the contrary, science is a rule of action which is successful, generally at least, and I add, while the contrary rule would not have succeeded.

The rules of tric-trac are definitely a set of actions like science, but does anyone really believe that comparison is fair and not notice the difference? The game rules are just arbitrary conventions[Pg 324] and a different set of conventions could have been chosen, which would have been equally valid. In contrast, science is a guiding principle of action that is generally successful, and I would argue that an alternative principle would not have worked.

If I say, to make hydrogen cause an acid to act on zinc, I formulate a rule which succeeds; I could have said, make distilled water act on gold; that also would have been a rule, only it would not have succeeded. If, therefore, scientific 'recipes' have a value, as rule of action, it is because we know they succeed, generally at least. But to know this is to know something and then why tell us we can know nothing?

If I say that making hydrogen react with acid on zinc works, I’m stating a rule that actually succeeds; I could have said to make distilled water react with gold; that too would have been a rule, but it wouldn't have worked. So, if scientific 'recipes' are valuable as a guideline for action, it's because we know they generally succeed. But to know this means we actually know something, so why tell us we can know nothing?

Science foresees, and it is because it foresees that it can be useful and serve as rule of action. I well know that its previsions are often contradicted by the event; that shows that science is imperfect, and if I add that it will always remain so, I am certain that this is a prevision which, at least, will never be contradicted. Always the scientist is less often mistaken than a prophet who should predict at random. Besides the progress though slow is continuous, so that scientists, though more and more bold, are less and less misled. This is little, but it is enough.

Science predicts, and because it predicts, it can be useful and serve as a guide for action. I know well that its predictions are often contradicted by actual events; this shows that science is imperfect, and if I add that it will always be imperfect, I am sure that this is a prediction that, at least, will never be proven wrong. In general, scientists make mistakes less often than a prophet who guesses at random. Moreover, progress, though slow, is continuous, so scientists, while becoming bolder, are less and less misled. This may seem small, but it is enough.

I well know that M. LeRoy has somewhere said that science was mistaken oftener than one thought, that comets sometimes played tricks on astronomers, that scientists, who apparently are men, did not willingly speak of their failures, and that, if they should speak of them, they would have to count more defeats than victories.

I know that M. LeRoy has mentioned somewhere that science is often wrong more than you think, that comets sometimes fool astronomers, that scientists, who seem like regular people, don’t like to talk about their failures, and that if they did, they would have to acknowledge more defeats than wins.

That day, M. LeRoy evidently overreached himself. If science did not succeed, it could not serve as rule of action; whence would it get its value? Because it is 'lived,' that is, because we love it and believe in it? The alchemists had recipes for making gold, they loved them and had faith in them, and yet our recipes are the good ones, although our faith be less lively, because they succeed.

That day, M. LeRoy clearly pushed his limits. If science doesn't succeed, it can't be a guide for action; where would its value come from? Just because it's 'experienced,' meaning we love it and believe in it? The alchemists had formulas for making gold, they cherished them and had faith in them, yet our methods are the right ones, even if our belief is less intense, because they actually work.

There is no escape from this dilemma; either science does not enable us to foresee, and then it is valueless as rule of action; or else it enables us to foresee, in a fashion more or less imperfect, and then it is not without value as means of knowledge.[Pg 325]

There’s no way out of this dilemma: either science doesn’t allow us to predict outcomes, making it useless as a guide for action, or it does help us predict, albeit imperfectly, and in that case, it has some value as a means of understanding.[Pg 325]

It should not even be said that action is the goal of science; should we condemn studies of the star Sirius, under pretext that we shall probably never exercise any influence on that star? To my eyes, on the contrary, it is the knowledge which is the end, and the action which is the means. If I felicitate myself on the industrial development, it is not alone because it furnishes a facile argument to the advocates of science; it is above all because it gives to the scientist faith in himself and also because it offers him an immense field of experience where he clashes against forces too colossal to be tampered with. Without this ballast, who knows whether he would not quit solid ground, seduced by the mirage of some scholastic novelty, or whether he would not despair, believing he had fashioned only a dream?

It shouldn’t even need to be said that action is the goal of science. Should we dismiss studies of the star Sirius just because we’ll probably never have any influence over it? To me, knowledge is the goal, and action is just the means to get there. When I celebrate industrial development, it’s not only because it provides an easy argument for the supporters of science; it’s mainly because it gives scientists confidence in themselves and opens up an enormous field of experience where they encounter forces too great to be manipulated. Without this foundation, who knows if they would abandon solid ground, tempted by the illusion of some academic novelty, or if they would become disheartened, thinking they had only created a fantasy?

3. The Crude Fact and the Scientific Fact

What was most paradoxical in M. LeRoy's thesis was that affirmation that the scientist creates the fact; this was at the same time its essential point and it is one of those which have been most discussed.

What was most paradoxical in M. LeRoy's thesis was the assertion that the scientist creates the fact; this was both its key point and one of the most debated aspects.

Perhaps, says he (I well believe that this was a concession), it is not the scientist that creates the fact in the rough; it is at least he who creates the scientific fact.

Maybe, he says (I truly believe this was a concession), it's not the scientist who creates the raw fact; at the very least, he's the one who creates the scientific fact.

This distinction between the fact in the rough and the scientific fact does not by itself appear to me illegitimate. But I complain first that the boundary has not been traced either exactly or precisely; and then that the author has seemed to suppose that the crude fact, not being scientific, is outside of science.

This difference between raw facts and scientific facts doesn’t seem illegitimate to me on its own. However, I first point out that the boundary hasn’t been defined clearly or accurately; and second, that the author seems to assume that the crude fact, since it's not scientific, is outside the realm of science.

Finally, I can not admit that the scientist creates without restraint the scientific fact, since it is the crude fact which imposes it upon him.

Finally, I can't accept that the scientist creates the scientific fact without limits, because it's the raw fact that demands it from him.

The examples given by M. LeRoy have greatly astonished me. The first is taken from the notion of atom. The atom chosen as example of fact! I avow that this choice has so disconcerted me that I prefer to say nothing about it. I have evidently misunderstood the author's thought and I could not fruitfully discuss it.

The examples provided by M. LeRoy have really surprised me. The first one is based on the concept of an atom. The atom selected as an example of fact! I must admit that this choice has thrown me off so much that I’d rather not say anything about it. Clearly, I’ve misunderstood the author’s intention, and I can't engage with it productively.

The second case taken as example is that of an eclipse where the crude phenomenon is a play of light and shadow, but where[Pg 326] the astronomer can not intervene without introducing two foreign elements, to wit, a clock and Newton's law.

The second case used as an example is an eclipse, where the basic phenomenon is a show of light and shadow. However, the astronomer cannot get involved without adding two external elements: a clock and Newton's law.[Pg 326]

Finally, M. LeRoy cites the rotation of the earth; it has been answered: but this is not a fact, and he has replied: it was one for Galileo, who affirmed it, as for the inquisitor, who denied it. It always remains that this is not a fact in the same sense as those just spoken of and that to give them the same name is to expose one's self to many confusions.

Finally, M. LeRoy mentions the rotation of the earth; it has been addressed: but this is not an established fact, and he responded: it was one for Galileo, who confirmed it, just as it was for the inquisitor, who rejected it. It still stands that this is not a fact in the same way as the ones previously mentioned, and using the same term for both leads to a lot of misunderstandings.

Here then are four degrees:

Here are four degrees:

1º. It grows dark, says the clown.

1º. It’s getting dark, says the clown.

2º. The eclipse happened at nine o'clock, says the astronomer.

2º. The eclipse occurred at nine o'clock, the astronomer states.

3º. The eclipse happened at the time deducible from the tables constructed according to Newton's law, says he again.

3º. The eclipse occurred at the time predicted by the tables created based on Newton's law, he states again.

4º. That results from the earth's turning around the sun, says Galileo finally.

4º. That comes from the earth orbiting around the sun, Galileo finally says.

Where then is the boundary between the fact in the rough and the scientific fact? To read M. LeRoy one would believe that it is between the first and the second stage, but who does not see that there is a greater distance from the second to the third, and still more from the third to the fourth.

Where is the line between raw facts and scientific facts? Reading M. LeRoy, you'd think it's between the first and second stages, but it's clear that there's a bigger gap between the second and third, and an even larger one from the third to the fourth.

Allow me to cite two examples which perhaps will enlighten us a little.

Allow me to share two examples that might help clarify things for us a bit.

I observe the deviation of a galvanometer by the aid of a movable mirror which projects a luminous image or spot on a divided scale. The crude fact is this: I see the spot displace itself on the scale, and the scientific fact is this: a current passes in the circuit.

I watch the change in a galvanometer using a movable mirror that casts a bright image or spot on a marked scale. The simple truth is this: I see the spot move along the scale, and the scientific fact is this: a current flows through the circuit.

Or again: when I make an experiment I should subject the result to certain corrections, because I know I must have made errors. These errors are of two kinds, some are accidental and these I shall correct by taking the mean; the others are systematic and I shall be able to correct those only by a thorough study of their causes. The first result obtained is then the fact in the rough, while the scientific fact is the final result after the finished corrections.

Or again: when I conduct an experiment, I need to apply certain corrections to the results since I know I must have made mistakes. There are two types of errors: some are accidental, and I can correct those by calculating the average; the others are systematic, and I can only correct them through a detailed study of their causes. The initial result obtained is just a rough fact, while the scientific fact is the final result after all the necessary corrections are made.

Reflecting on this latter example, we are led to subdivide our second stage, and in place of saying:

Reflecting on this latter example, we are led to break down our second stage, and instead of saying:

2. The eclipse happened at nine o'clock, we shall say:

2. The eclipse happened at nine o'clock, let's say:

2a. The eclipse happened when my clock pointed to nine, and[Pg 327]

2a. The eclipse occurred when my clock showed nine, and[Pg 327]

2b. My clock being ten minutes slow, the eclipse happened at ten minutes past nine.

2b. My clock was ten minutes slow, so the eclipse occurred at ten minutes after nine.

And this is not all: the first stage also should be subdivided, and not between these two subdivisions will be the least distance; it is necessary to distinguish between the impression of obscurity felt by one witnessing an eclipse, and the affirmation: It grows dark, which this impression extorts from him. In a sense it is the first which is the only true fact in the rough, and the second is already a sort of scientific fact.

And that’s not everything: the first stage should also be divided into sections, and the distance between these two sections isn’t the smallest; it’s important to differentiate between the feeling of confusion experienced by someone watching an eclipse and the statement: It’s getting dark, which this feeling compels them to make. In a way, the first is the only true fact in its raw form, while the second is already a kind of scientific fact.

Now then our scale has six stages, and even though there is no reason for halting at this figure, there we shall stop.

Now our scale has six stages, and although there's no reason to stop at this number, we will.

What strikes me at the start is this. At the first of our six stages, the fact, still completely in the rough, is, so to speak, individual, it is completely distinct from all other possible facts. From the second stage, already it is no longer the same. The enunciation of the fact would suit an infinity of other facts. So soon as language intervenes, I have at my command only a finite number of terms to express the shades, in number infinite, that my impressions might cover. When I say: It grows dark, that well expresses the impressions I feel in being present at an eclipse; but even in obscurity a multitude of shades could be imagined, and if, instead of that actually realized, had happened a slightly different shade, yet I should still have enunciated this other fact by saying: It grows dark.

What strikes me at the beginning is this. In the first of our six stages, the fact, still completely raw, is, so to speak, individual; it is entirely distinct from all other possible facts. By the second stage, it’s not the same anymore. The way I state the fact could apply to countless other facts. As soon as language comes into play, I have only a limited number of words to capture the countless nuances that my impressions might encompass. When I say: It’s getting dark, that accurately conveys the feelings I have while witnessing an eclipse; but even in darkness, a multitude of shades could be imagined, and if a slightly different shade had occurred instead of what actually happened, I would still express this other fact by saying: It’s getting dark.

Second remark: even at the second stage, the enunciation of a fact can only be true or false. This is not so of any proposition; if this proposition is the enunciation of a convention, it can not be said that this enunciation is true, in the proper sense of the word, since it could not be true apart from me and is true only because I wish it to be.

Second remark: even at the second stage, stating a fact can only be true or false. This isn't the case for any proposition; if this proposition is a statement of a convention, it can't be considered true, in the strict sense of the word, since it wouldn't be true without me and is only true because I choose for it to be.

When, for instance, I say the unit for length is the meter, this is a decree that I promulgate, it is not something ascertained which forces itself upon me. It is the same, as I think I have elsewhere shown, when it is a question, for example, of Euclid's postulate.

When I say that the unit for length is the meter, I’m making that decision; it’s not an obvious fact that I have to accept. It’s similar, as I believe I’ve demonstrated before, when discussing Euclid's postulate.

When I am asked: Is it growing dark? I always know whether I ought to reply yes or no. Although an infinity of possible facts may be susceptible of this same enunciation, it grows dark,[Pg 328] I shall always know whether the fact realized belongs or does not belong among those which answer to this enunciation. Facts are classed in categories, and if I am asked whether the fact that I ascertain belongs or does not belong in such a category, I shall not hesitate.

When someone asks me, "Is it getting dark?" I always know if I should say yes or no. Even though countless possible situations could fit this same statement, "it’s getting dark," I will always know if the situation I see belongs or doesn’t belong to that statement. Facts are sorted into categories, and if I’m asked whether the situation I notice fits or doesn’t fit in a certain category, I won’t hesitate.

Doubtless this classification is sufficiently arbitrary to leave a large part to man's freedom or caprice. In a word, this classification is a convention. This convention being given, if I am asked: Is such a fact true? I shall always know what to answer, and my reply will be imposed upon me by the witness of my senses.

Sure, here’s the modernized text: This classification is clearly arbitrary enough to allow for a lot of individual interpretation or whim. In short, this classification is a convention. Given this convention, if someone asks me: Is this fact true? I will always know how to respond, and my answer will be guided by what I perceive through my senses.

If therefore, during an eclipse, it is asked: Is it growing dark? all the world will answer yes. Doubtless those speaking a language where bright was called dark, and dark bright, would answer no. But of what importance is that?

If during an eclipse someone asks, "Is it getting dark?" everyone will say yes. Sure, if people spoke a language where "bright" meant "dark" and "dark" meant "bright," they would say no. But really, what does that matter?

In the same way, in mathematics, when I have laid down the definitions, and the postulates which are conventions, a theorem henceforth can only be true or false. But to answer the question: Is this theorem true? it is no longer to the witness of my senses that I shall have recourse, but to reasoning.

In the same way, in mathematics, once I have established the definitions and the postulates which are conventions, a theorem can only be true or false from that point on. But to answer the question: Is this theorem true? I will no longer rely on my senses, but on reasoning.

A statement of fact is always verifiable, and for the verification we have recourse either to the witness of our senses, or to the memory of this witness. This is properly what characterizes a fact. If you put the question to me: Is such a fact true? I shall begin by asking you, if there is occasion, to state precisely the conventions, by asking you, in other words, what language you have spoken; then once settled on this point, I shall interrogate my senses and shall answer yes or no. But it will be my senses that will have made answer, it will not be you when you say to me: I have spoken to you in English or in French.

A statement of fact is always something we can verify, and for that verification, we rely either on our senses or on our memories of those senses. This is what defines a fact. If you ask me, "Is this fact true?" I will first ask you to clarify the conventions, essentially asking what language you used. Once we've established that, I'll check my senses and respond with either yes or no. But it's my senses that provide the answer, not you when you say, "I spoke to you in English or French."

Is there something to change in all that when we pass to the following stages? When I observe a galvanometer, as I have just said, if I ask an ignorant visitor: Is the current passing? he looks at the wire to try to see something pass; but if I put the same question to my assistant who understands my language, he will know I mean: Does the spot move? and he will look at the scale.

Is there anything that should change in all of this as we move on to the next stages? When I look at a galvanometer, as I've just mentioned, if I ask someone who doesn't know anything: Is the current flowing? they will look at the wire to see if anything is passing through; but if I ask the same question to my assistant who understands what I mean, he will know I’m asking: Does the spot move? and he will check the scale.

What difference is there then between the statement of a fact[Pg 329] in the rough and the statement of a scientific fact? The same difference as between the statement of the same crude fact in French and in German. The scientific statement is the translation of the crude statement into a language which is distinguished above all from the common German or French, because it is spoken by a very much smaller number of people.

What’s the difference between stating a fact[Pg 329] in simple terms and stating a scientific fact? It’s the same as the difference between expressing the same basic fact in French and in German. The scientific statement is like translating the basic statement into a specialized language that's mainly different from ordinary German or French because it’s used by a much smaller group of people.

Yet let us not go too fast. To measure a current I may use a very great number of types of galvanometers or besides an electrodynamometer. And then when I shall say there is running in this circuit a current of so many amperes, that will mean: if I adapt to this circuit such a galvanometer I shall see the spot come to the division a; but that will mean equally: if I adapt to this circuit such an electrodynamometer, I shall see the spot go to the division b. And that will mean still many other things, because the current can manifest itself not only by mechanical effects, but by effects chemical, thermal, luminous, etc.

Yet let’s not rush. To measure a current, I can use a wide variety of galvanometers or even an electrodynamometer. When I say there’s a current of so many amperes flowing through this circuit, it means that if I connect a specific galvanometer to this circuit, I’ll see the needle point to division a; but it also means that if I connect a specific electrodynamometer, I’ll see the needle move to division b. Additionally, there are many other implications, since the current can show itself not only through mechanical effects but also through chemical, thermal, luminous effects, and more.

Here then is one same statement which suits a very great number of facts absolutely different. Why? It is because I assume a law according to which, whenever such a mechanical effect shall happen, such a chemical effect will happen also. Previous experiments, very numerous, have never shown this law to fail, and then I have understood that I could express by the same statement two facts so invariably bound one to the other.

Here is one statement that applies to a wide variety of completely different facts. Why? It’s because I assume a law that whenever a certain mechanical effect occurs, a specific chemical effect will also take place. Numerous previous experiments have consistently demonstrated that this law holds true, and I realized that I could describe two facts that are so consistently linked in the same way.

When I am asked: Is the current passing? I can understand that that means: Will such a mechanical effect happen? But I can understand also: Will such a chemical effect happen? I shall then verify either the existence of the mechanical effect, or that of the chemical effect; that will be indifferent, since in both cases the answer must be the same.

When I'm asked: Is the current passing? I understand that it means: Will this mechanical effect occur? But I can also understand it as: Will this chemical effect occur? I'll then check whether the mechanical effect exists or the chemical effect exists; it doesn't matter, because in both cases the answer will be the same.

And if the law should one day be found false? If it was perceived that the concordance of the two effects, mechanical and chemical, is not constant? That day it would be necessary to change the scientific language to free it from a grave ambiguity.

And what if the law were to be proven wrong one day? What if it turned out that the relationship between the mechanical and chemical effects isn’t consistent? On that day, it would be essential to revise the scientific terminology to eliminate a serious ambiguity.

And after that? Is it thought that ordinary language by aid of which are expressed the facts of daily life is exempt from ambiguity?

And after that? Is it believed that everyday language, which expresses the facts of daily life, is free from ambiguity?

Shall we thence conclude that the facts of daily life are the work of the grammarians?[Pg 330]

Can we then conclude that the details of everyday life are the work of the grammarians?[Pg 330]

You ask me: Is there a current? I try whether the mechanical effect exists, I ascertain it and I answer: Yes, there is a current. You understand at once that that means that the mechanical effect exists, and that the chemical effect, that I have not investigated, exists likewise. Imagine now, supposing an impossibility, the law we believe true, not to be, and the chemical effect not to exist. Under this hypothesis there will be two distinct facts, the one directly observed and which is true, the other inferred and which is false. It may strictly be said that we have created the second. So that error is the part of man's personal collaboration in the creation of the scientific fact.

You ask me: Is there a current? I check to see if the mechanical effect is present, confirm that it is, and respond: Yes, there is a current. You immediately understand that this means the mechanical effect exists, and that the chemical effect, which I haven’t looked into, also likely exists. Now, imagine, just for the sake of argument, that the law we believe to be true isn't, and that the chemical effect doesn’t exist. In this scenario, there would be two different facts: one that we can observe directly and that is true, and another that is inferred and false. It can be said that we essentially created the second fact. So, error reflects humanity's personal role in the formulation of scientific fact.

But if we can say that the fact in question is false, is this not just because it is not a free and arbitrary creation of our mind, a disguised convention, in which case it would be neither true nor false. And in fact it was verifiable; I had not made the verification, but I could have made it. If I answered amiss, it was because I chose to reply too quickly, without having asked nature, who alone knew the secret.

But if we can say that the fact in question is false, isn't it only because it isn't a free and arbitrary creation of our mind, a hidden agreement, in which case it wouldn't be true or false? And the fact is that it was something I could verify; I hadn't done the verification, but I could have. If I answered incorrectly, it was because I chose to respond too quickly, without asking nature, who alone knew the secret.

When, after an experiment, I correct the accidental and systematic errors to bring out the scientific fact, the case is the same; the scientific fact will never be anything but the crude fact translated into another language. When I shall say: It is such an hour, that will be a short way of saying: There is such a relation between the hour indicated by my clock, and the hour it marked at the moment of the passing of such a star and such another star across the meridian. And this convention of language once adopted, when I shall be asked: Is it such an hour? it will not depend upon me to answer yes or no.

When I adjust the random and systematic errors after an experiment to highlight the scientific fact, it’s still the same situation; the scientific fact will always just be the raw fact expressed in a different way. When I say: It is a certain hour, I'm really just saying: There is a specific relationship between the time shown on my clock and the time it recorded when certain stars crossed the meridian. Once this language convention is established, if someone asks me: Is it a certain hour? it won’t be up to me to say yes or no.

Let us pass to the stage before the last: the eclipse happened at the hour given by the tables deduced from Newton's laws. This is still a convention of language which is perfectly clear for those who know celestial mechanics or simply for those who have the tables calculated by the astronomers. I am asked: Did the eclipse happen at the hour predicted? I look in the nautical almanac, I see that the eclipse was announced for nine o'clock and I understand that the question means: Did the eclipse happen at nine o'clock? There still we have nothing to change in our conclusions. The scientific fact is only the crude fact translated into a convenient language.[Pg 331]

Let’s move on to the second-to-last stage: the eclipse occurred at the time listed in the tables based on Newton's laws. This is still a linguistic convention that is completely clear to those familiar with celestial mechanics or simply to those who have the tables prepared by astronomers. Someone asks me: Did the eclipse happen at the predicted time? I check the nautical almanac, I see that the eclipse was scheduled for nine o'clock, and I understand that the question means: Did the eclipse happen at nine o'clock? There’s nothing to adjust in our conclusions. The scientific fact is simply the raw fact expressed in convenient language.[Pg 331]

It is true that at the last stage things change. Does the earth rotate? Is this a verifiable fact? Could Galileo and the Grand Inquisitor, to settle the matter, appeal to the witness of their senses? On the contrary, they were in accord about the appearances, and whatever had been the accumulated experiences, they would have remained in accord with regard to the appearances without ever agreeing on their interpretation. It is just on that account that they were obliged to have recourse to procedures of discussion so unscientific.

It’s true that in the final stage, things shift. Does the earth spin? Is that something we can prove? Could Galileo and the Grand Inquisitor, to resolve the issue, rely on what they perceived? On the contrary, they agreed about what they saw, and no matter how much experience they had, they would have continued to see things the same way without ever agreeing on what it meant. That’s precisely why they had to resort to such unscientific methods of discussion.

This is why I think they did not disagree about a fact: we have not the right to give the same name to the rotation of the earth, which was the object of their discussion, and to the facts crude or scientific we have hitherto passed in review.

This is why I think they didn't disagree about a fact: we don't have the right to give the same name to the rotation of the earth, which was the focus of their discussion, and to the crude or scientific facts we've reviewed so far.

After what precedes, it seems superfluous to investigate whether the fact in the rough is outside of science, because there can neither be science without scientific fact, nor scientific fact without fact in the rough, since the first is only the translation of the second.

After what came before, it seems unnecessary to explore whether raw facts are outside the realm of science, because there can’t be science without scientific facts, nor scientific facts without raw facts, since the former is just a translation of the latter.

And then, has one the right to say that the scientist creates the scientific fact? First of all, he does not create it from nothing, since he makes it with the fact in the rough. Consequently he does not make it freely and as he chooses. However able the worker may be, his freedom is always limited by the properties of the raw material on which he works.

And then, can we really say that the scientist creates the scientific fact? First of all, they don’t create it from scratch, since they work with the raw facts. So, they don’t create it freely and according to their own choice. No matter how skilled the worker is, their freedom is always restricted by the properties of the raw material they’re working with.

After all, what do you mean when you speak of this free creation of the scientific fact and when you take as example the astronomer who intervenes actively in the phenomenon of the eclipse by bringing his clock? Do you mean: The eclipse happened at nine o'clock; but if the astronomer had wished it to happen at ten, that depended only on him, he had only to advance his clock an hour?

After all, what do you mean when you talk about the free creation of a scientific fact and when you cite the astronomer who actively intervenes in the eclipse by using his clock? Do you mean: The eclipse occurred at nine o'clock, but if the astronomer had wanted it to happen at ten, that was entirely up to him—he just had to move his clock ahead by an hour?

But the astronomer, in perpetrating that bad joke, would evidently have been guilty of an equivocation. When he tells me: The eclipse happened at nine, I understand that nine is the hour deduced from the crude indication of the pendulum by the usual series of corrections. If he has given me solely that crude indication, or if he has made corrections contrary to the habitual rules, he has changed the language agreed upon without forewarning[Pg 332] me. If, on the contrary, he took care to forewarn me, I have nothing to complain of, but then it is always the same fact expressed in another language.

But the astronomer, by making that bad joke, would clearly be guilty of a misunderstanding. When he tells me, "The eclipse happened at nine," I understand that nine is the time figured out from the basic reading of the pendulum using the usual corrections. If he only gave me that basic reading, or if he made corrections that go against the standard rules, he has changed the agreed-upon terms without letting me know[Pg 332]. If, on the other hand, he made sure to inform me, then I have no reason to complain, but it’s still the same fact expressed in different words.

In sum, all the scientist creates in a fact is the language in which he enunciates it. If he predicts a fact, he will employ this language, and for all those who can speak and understand it, his prediction is free from ambiguity. Moreover, this prediction once made, it evidently does not depend upon him whether it is fulfilled or not.

In summary, all the scientist creates in a fact is the language in which he states it. If he predicts a fact, he uses this language, and for everyone who can speak and understand it, his prediction is clear. Furthermore, once this prediction is made, it clearly does not depend on him whether it comes true or not.

What then remains of M. LeRoy's thesis? This remains: the scientist intervenes actively in choosing the facts worth observing. An isolated fact has by itself no interest; it becomes interesting if one has reason to think that it may aid in the prediction of other facts; or better, if, having been predicted, its verification is the confirmation of a law. Who shall choose the facts which, corresponding to these conditions, are worthy the freedom of the city in science? This is the free activity of the scientist.

What then is left of M. LeRoy's thesis? This: the scientist actively chooses which facts are worth observing. An isolated fact has no inherent interest; it becomes interesting if there’s a reason to believe it can help predict other facts; or better yet, if it has been predicted and its verification confirms a law. Who will choose the facts that meet these criteria and deserve the freedom of the city in science? This is the scientist's free activity.

And that is not all. I have said that the scientific fact is the translation of a crude fact into a certain language; I should add that every scientific fact is formed of many crude facts. This is sufficiently shown by the examples cited above. For instance, for the hour of the eclipse my clock marked the hour α at the instant of the eclipse; it marked the hour β at the moment of the last transit of the meridian of a certain star that we take as origin of right ascensions; it marked the hour γ at the moment of the preceding transit of this same star. There are three distinct facts (still it will be noticed that each of them results itself from two simultaneous facts in the rough; but let us pass this over). In place of that I say: The eclipse happened at the hour 24 (α−β) / (β−γ), and the three facts are combined in a single scientific fact. I have concluded that the three readings, α, β, γ made on my clock at three different moments lacked interest and that the only thing interesting was the combination (α−β) / (β−γ) of the three. In this conclusion is found the free activity of my mind.

And that’s not all. I've mentioned that a scientific fact is the translation of a basic fact into a specific language; I should add that every scientific fact consists of multiple basic facts. This is clearly illustrated by the examples I provided earlier. For example, my clock showed the time α at the moment of the eclipse; it showed the time β at the last transit of the meridian of a particular star that we use as the origin for right ascensions; it showed the time γ at the moment of the previous transit of this same star. There are three distinct facts (though it’s worth noting that each of them actually results from two simultaneous basic facts, but we can overlook that). Instead, I say: The eclipse occurred at the time 24 (α−β) / (β−γ), and the three facts are merged into a single scientific fact. I’ve determined that the three readings, α, β, γ recorded on my clock at three different times were not particularly interesting and that the only thing of interest was the combination (α−β) / (β−γ) of the three. In this conclusion lies the active engagement of my mind.

But I have thus used up my power; I can not make this combination (α−β) / (β−γ) have such a value and not such another, since I can not influence either the value of α, or that of β, or that of γ, which are imposed upon me as crude facts.[Pg 333]

But I have used up my ability; I can't make this combination (α−β) / (β−γ) have one value and not another, since I can't change the value of α, β, or γ, which are presented to me as basic facts.[Pg 333]

In sum, facts are facts, and if it happens that they satisfy a prediction, this is not an effect of our free activity. There is no precise frontier between the fact in the rough and the scientific fact; it can only be said that such an enunciation of fact is more crude or, on the contrary, more scientific than such another.

In short, facts are facts, and if they happen to agree with a prediction, that doesn’t come from our free will. There isn't a clear line between raw facts and scientific facts; we can only say that one statement of fact is more basic or, conversely, more scientific than another.

4. 'Nominalism' and 'the Universal Invariant'

If from facts we pass to laws, it is clear that the part of the free activity of the scientist will become much greater. But did not M. LeRoy make it still too great? This is what we are about to examine.

If we move from facts to laws, it's clear that the scientist's free activity will increase significantly. But did M. LeRoy make it too much? This is what we are going to look into.

Recall first the examples he has given. When I say: Phosphorus melts at 44°, I think I am enunciating a law; in reality it is just the definition of phosphorus; if one should discover a body which, possessing otherwise all the properties of phosphorus, did not melt at 44°, we should give it another name, that is all, and the law would remain true.

Recall first the examples he has given. When I say: Phosphorus melts at 44°, I think I am stating a law; in reality, it is just the definition of phosphorus. If someone were to find a substance that, having all the other properties of phosphorus, did not melt at 44°, we would just give it a different name, and the law would still hold true.

Just so when I say: Heavy bodies falling freely pass over spaces proportional to the squares of the times, I only give the definition of free fall. Whenever the condition shall not be fulfilled, I shall say that the fall is not free, so that the law will never be wrong. It is clear that if laws were reduced to that, they could not serve in prediction; then they would be good for nothing, either as means of knowledge or as principle of action.

Just to clarify, when I say that heavy objects falling freely cover distances proportional to the squares of the time they fall, I'm simply defining free fall. If that condition isn't met, I'll say the fall isn't free, so the law will always be accurate. It's obvious that if laws were only described that way, they wouldn't be useful for making predictions; then they wouldn't be valuable at all, either for gaining knowledge or as a basis for actions.

When I say: Phosphorus melts at 44°, I mean by that: All bodies possessing such or such a property (to wit, all the properties of phosphorus, save fusing-point) fuse at 44°. So understood, my proposition is indeed a law, and this law may be useful to me, because if I meet a body possessing these properties I shall be able to predict that it will fuse at 44°.

When I say, "Phosphorus melts at 44°," I mean that all substances with certain properties (specifically, all the properties of phosphorus, except for the melting point) melt at 44°. Understood this way, my statement is actually a law, and this law can be helpful to me because if I encounter a substance with these properties, I can predict that it will melt at 44°.

Doubtless the law may be found to be false. Then we shall read in the treatises on chemistry: "There are two bodies which chemists long confounded under the name of phosphorus; these two bodies differ only by their points of fusion." That would evidently not be the first time for chemists to attain to the separation of two bodies they were at first not able to distinguish; such, for example, are neodymium and praseodymium, long confounded under the name of didymium.[Pg 334]

No doubt the law might be proven wrong. Then we would read in chemistry books: "There are two substances that chemists have long confused as one, calling it phosphorus; these two substances only differ in their melting points." Clearly, this wouldn't be the first time chemists have succeeded in separating two substances they initially couldn’t tell apart; a good example is neodymium and praseodymium, which were long mistaken for didymium.[Pg 334]

I do not think the chemists much fear that a like mischance will ever happen to phosphorus. And if, to suppose the impossible, it should happen, the two bodies would probably not have identically the same density, identically the same specific heat, etc., so that after having determined with care the density, for instance, one could still foresee the fusion point.

I don’t think chemists really worry that a similar accident will ever happen to phosphorus. And if, just for the sake of argument, it did happen, the two substances probably wouldn’t have exactly the same density, exactly the same specific heat, etc., so after carefully determining the density, for example, one could still predict the melting point.

It is, moreover, unimportant; it suffices to remark that there is a law, and that this law, true or false, does not reduce to a tautology.

It’s also not that significant; it’s enough to point out that there’s a law, and that this law, whether true or false, doesn’t boil down to a tautology.

Will it be said that if we do not know on the earth a body which does not fuse at 44° while having all the other properties of phosphorus, we can not know whether it does not exist on other planets? Doubtless that may be maintained, and it would then be inferred that the law in question, which may serve as a rule of action to us who inhabit the earth, has yet no general value from the point of view of knowledge, and owes its interest only to the chance which has placed us on this globe. This is possible, but, if it were so, the law would be valueless, not because it reduced to a convention, but because it would be false.

Is it really possible to say that just because we haven't found a substance on Earth that doesn't melt at 44° and has all the other qualities of phosphorus, we can't know whether it exists on other planets? Sure, that could be argued, and it might suggest that the law in question, which guides those of us living on Earth, doesn't have any universal significance in terms of knowledge, and is only interesting due to the randomness of our existence on this planet. That’s a possibility, but if that were the case, the law would be worthless—not because it’s merely a convention, but because it would be incorrect.

The same is true in what concerns the fall of bodies. It would do me no good to have given the name of free fall to falls which happen in conformity with Galileo's law, if I did not know that elsewhere, in such circumstances, the fall will be probably free or approximately free. That then is a law which may be true or false, but which does not reduce to a convention.

The same applies to the fall of objects. It wouldn’t make sense for me to call it free fall based on Galileo's law if I didn’t know that under those conditions, the fall will be probably free or approximately free elsewhere. So, this is a law that could be true or false, but it’s not just a matter of convention.

Suppose the astronomers discover that the stars do not exactly obey Newton's law. They will have the choice between two attitudes; they may say that gravitation does not vary exactly as the inverse of the square of the distance, or else they may say that gravitation is not the only force which acts on the stars and that there is in addition a different sort of force.

Suppose the astronomers find out that the stars don't perfectly follow Newton's law. They will have two options; they can either say that gravity doesn't change exactly as the inverse of the square of the distance, or they could argue that gravity isn't the only force acting on the stars and that there's another type of force at play.

In the second case, Newton's law will be considered as the definition of gravitation. This will be the nominalist attitude. The choice between the two attitudes is free, and is made from considerations of convenience, though these considerations are most often so strong that there remains practically little of this freedom.

In the second case, Newton's law will be viewed as the definition of gravitation. This represents the nominalist perspective. The choice between the two perspectives is free and based on convenience, although these considerations are usually so compelling that there's hardly any real freedom left.

We can break up this proposition: (1) The stars obey Newton's[Pg 335] law, into two others; (2) gravitation obeys Newton's law; (3) gravitation is the only force acting on the stars. In this case proposition (2) is no longer anything but a definition and is beyond the test of experiment; but then it will be on proposition (3) that this check can be exercised. This is indeed necessary, since the resulting proposition (1) predicts verifiable facts in the rough.

We can break down this statement: (1) The stars follow Newton's[Pg 335] law, into two parts; (2) gravity follows Newton's law; (3) gravity is the only force acting on the stars. In this case, statement (2) is just a definition and can't be tested through experimentation; however, we can test statement (3). This is really important since the resulting statement (1) predicts observable facts in general.

It is thanks to these artifices that by an unconscious nominalism the scientists have elevated above the laws what they call principles. When a law has received a sufficient confirmation from experiment, we may adopt two attitudes: either we may leave this law in the fray; it will then remain subjected to an incessant revision, which without any doubt will end by demonstrating that it is only approximative. Or else we may elevate it into a principle by adopting conventions such that the proposition may be certainly true. For that the procedure is always the same. The primitive law enunciated a relation between two facts in the rough, A and B; between these two crude facts is introduced an abstract intermediary C, more or less fictitious (such was in the preceding example the impalpable entity, gravitation). And then we have a relation between A and C that we may suppose rigorous and which is the principle; and another between C and B which remains a law subject to revision.

It’s because of these tricks that, through an unconscious nominalism, scientists have put what they call principles above the laws. When a law has been sufficiently confirmed by experiments, we can take one of two approaches: we can either leave this law as it is, which means it will be continuously revised and will likely prove to be just an approximation. Or we can elevate it to a principle by adopting conventions so that the proposition can be considered definitely true. The process is always the same. The original law states a relationship between two rough facts, A and B; between these two basic facts lies an abstract intermediary C, which may be somewhat fictional (like the elusive entity of gravitation in the previous example). Then we establish a relationship between A and C that we can assume is rigorous and that becomes the principle; and another relationship between C and B that remains a law and is open to revision.

The principle, henceforth crystallized, so to speak, is no longer subject to the test of experiment. It is not true or false, it is convenient.

The principle, now clearly defined, is no longer up for experimental testing. It isn't about being true or false; it's about being practical.

Great advantages have often been found in proceeding in that way, but it is clear that if all the laws had been transformed into principles nothing would be left of science. Every law may be broken up into a principle and a law, but thereby it is very clear that, however far this partition be pushed, there will always remain laws.

Great benefits have often been discovered in following that approach, but it's obvious that if all the laws were turned into principles nothing would remain of science. Every law can be broken down into a principle and a law, but it's clear that no matter how far this division is taken, there will always be laws.

Nominalism has therefore limits, and this is what one might fail to recognize if one took to the very letter M. LeRoy's assertions.

Nominalism has its limits, and this is something one might overlook if they took M. LeRoy's statements too literally.

A rapid review of the sciences will make us comprehend better what are these limits. The nominalist attitude is justified only when it is convenient; when is it so?[Pg 336]

A quick look at the sciences will help us understand these limits better. The nominalist viewpoint only makes sense when it’s convenient; when is that?[Pg 336]

Experiment teaches us relations between bodies; this is the fact in the rough; these relations are extremely complicated. Instead of envisaging directly the relation of the body A and the body B, we introduce between them an intermediary, which is space, and we envisage three distinct relations: that of the body A with the figure of space, that of the body B with the figure of space, that of the two figures and to each other. Why is this detour advantageous? Because the relation of A and B was complicated, but differed little from that of and , which is simple; so that this complicated relation may be replaced by the simple relation between and and by two other relations which tell us that the differences between A and , on the one hand, between B and , on the other hand, are very small. For example, if A and B are two natural solid bodies which are displaced with slight deformation, we envisage two movable rigid figures and . The laws of the relative displacement of these figures and will be very simple; they will be those of geometry. And we shall afterward add that the body A, which always differs very little from , dilates from the effect of heat and bends from the effect of elasticity. These dilatations and flexions, just because they are very small, will be for our mind relatively easy to study. Just imagine to what complexities of language it would have been necessary to be resigned if we had wished to comprehend in the same enunciation the displacement of the solid, its dilatation and its flexure?

Experiment teaches us the relationships between objects; this is the basic fact. These relationships are incredibly complex. Instead of directly observing the relationship between object A and object B, we introduce an intermediary, which is space, and we look at three distinct relationships: the relationship of object A with the figure in space, the relationship of object B with the figure in space, and the relationship between the two figures and themselves. Why is this approach useful? Because the relationship between A and B was complex, but it closely resembles that of and , which is simpler; thus, this complicated relationship can be simplified by focusing on the straightforward relationship between and and by observing two other relationships that show that the differences between A and , on one hand, and between B and , on the other hand, are very small. For instance, if A and B are two natural solid objects that are moved with slight deformation, we consider two movable rigid figures and . The laws governing the relative movement of these figures and will be very straightforward; they will follow the principles of geometry. Later, we can add that object A, which always differs very little from , expands due to heat and bends due to elasticity. These expansions and bends, precisely because they are very small, will be relatively easy for our minds to study. Just think about how complicated the language would have to be if we wanted to describe the movement of the solid, its expansion, and its bending all in the same statement!

The relation between A and B was a rough law, and was broken up; we now have two laws which express the relations of A and , of B and , and a principle which expresses that of with . It is the aggregate of these principles that is called geometry.

The relationship between A and B was a basic law and has been separated; we now have two laws that describe the relationships of A and , of B and , along with a principle that describes how relates to . The collection of these principles is what we refer to as geometry.

Two other remarks. We have a relation between two bodies A and B, which we have replaced by a relation between two figures and ; but this same relation between the same two figures and could just as well have replaced advantageously a relation between two other bodies A´´ and B´´, entirely different from A and B. And that in many ways. If the principles of geometry had not been invented, after having studied the relation of A and B, it would be necessary to begin again ab ovo the study of the relation of A´´ and B´´. That is why geometry is so[Pg 337] precious. A geometrical relation can advantageously replace a relation which, considered in the rough state, should be regarded as mechanical, it can replace another which should be regarded as optical, etc.

Two other points. We have a relationship between two bodies A and B, which we've swapped for a relationship between two figures and ; but this same relationship between these two figures and could just as easily have replaced a relationship between two entirely different bodies A´´ and B´´, separate from A and B. And in many ways. If the principles of geometry hadn't been developed, after studying the relationship of A and B, we would need to start over completely from scratch in studying the relationship of A´´ and B´´. That’s why geometry is so[Pg 337]valuable. A geometrical relationship can effectively replace a relationship that, when seen in its basic form, should be considered mechanical, or it can replace another that should be viewed as optical, and so on.

Yet let no one say: But that proves geometry an experimental science; in separating its principles from laws whence they have been drawn, you artificially separate it itself from the sciences which have given birth to it. The other sciences have likewise principles, but that does not preclude our having to call them experimental.

Yet let no one say: But that proves geometry is an experimental science; in separating its principles from the laws from which they have been derived, you artificially separate it from the sciences that gave rise to it. Other sciences also have principles, but that doesn't mean we shouldn't call them experimental.

It must be recognized that it would have been difficult not to make this separation that is pretended to be artificial. We know the rôle that the kinematics of solid bodies has played in the genesis of geometry; should it then be said that geometry is only a branch of experimental kinematics? But the laws of the rectilinear propagation of light have also contributed to the formation of its principles. Must geometry be regarded both as a branch of kinematics and as a branch of optics? I recall besides that our Euclidean space which is the proper object of geometry has been chosen, for reasons of convenience, from among a certain number of types which preexist in our mind and which are called groups.

It’s important to acknowledge that it would have been tough not to make this separation that is claimed to be artificial. We understand the role that the kinematics of solid bodies has played in the development of geometry; should we then say that geometry is just a part of experimental kinematics? But the laws of straight-line propagation of light have also helped shape its principles. Should geometry be seen as both a part of kinematics and a part of optics? I also remember that our Euclidean space, which is the main focus of geometry, has been chosen for convenience from among several types that exist in our minds and are called groups.

If we pass to mechanics, we still see great principles whose origin is analogous, and, as their 'radius of action,' so to speak, is smaller, there is no longer reason to separate them from mechanics proper and to regard this science as deductive.

If we move on to mechanics, we still see important principles with a similar origin, and since their 'radius of action,' so to speak, is smaller, there's no longer a reason to separate them from mechanics itself or to view this science as purely deductive.

In physics, finally, the rôle of the principles is still more diminished. And in fact they are only introduced when it is of advantage. Now they are advantageous precisely because they are few, since each of them very nearly replaces a great number of laws. Therefore it is not of interest to multiply them. Besides an outcome is necessary, and for that it is needful to end by leaving abstraction to take hold of reality.

In physics, the role of principles is even more reduced. They are only brought in when it’s beneficial. They’re advantageous because there aren’t many of them, and each one nearly substitutes for a large number of laws. So, it doesn’t make sense to increase their number. Additionally, a result is essential, and to achieve that, we must ultimately allow abstraction to connect with reality.

Such are the limits of nominalism, and they are narrow.

Such are the limits of nominalism, and they are tight.

M. LeRoy has insisted, however, and he has put the question under another form.

M. LeRoy has insisted, though, and he has rephrased the question.

Since the enunciation of our laws may vary with the conventions that we adopt, since these conventions may modify even the[Pg 338] natural relations of these laws, is there in the manifold of these laws something independent of these conventions and which may, so to speak, play the rôle of universal invariant? For instance, the fiction has been introduced of beings who, having been educated in a world different from ours, would have been led to create a non-Euclidean geometry. If these beings were afterward suddenly transported into our world, they would observe the same laws as we, but they would enunciate them in an entirely different way. In truth there would still be something in common between the two enunciations, but this is because these beings do not yet differ enough from us. Beings still more strange may be imagined, and the part common to the two systems of enunciations will shrink more and more. Will it thus shrink in convergence toward zero, or will there remain an irreducible residue which will then be the universal invariant sought?

Since the way we state our laws can change depending on the conventions we follow, and since these conventions might even alter the natural relationships of these laws, is there something within this variety of laws that is independent of these conventions and could serve as a kind of universal invariant? For example, consider the idea of beings who, having been raised in a world different from ours, would create a non-Euclidean geometry. If these beings were suddenly brought into our world, they would observe the same laws we do, but they would express them in a completely different manner. In truth, there would still be some commonality between the two expressions, but that's because these beings don't differ enough from us yet. We can imagine even more alien beings, and the shared aspects of the two systems of expressions would diminish even further. Will it continue to shrink down toward zero, or will there be a remaining core that serves as the universal invariant we’re looking for?

The question calls for precise statement. Is it desired that this common part of the enunciations be expressible in words? It is clear, then, that there are not words common to all languages, and we can not pretend to construct I know not what universal invariant which should be understood both by us and by the fictitious non-Euclidean geometers of whom I have just spoken; no more than we can construct a phrase which can be understood both by Germans who do not understand French and by French who do not understand German. But we have fixed rules which permit us to translate the French enunciations into German, and inversely. It is for that that grammars and dictionaries have been made. There are also fixed rules for translating the Euclidean language into the non-Euclidean language, or, if there are not, they could be made.

The question requires a clear answer. Do we want to express this common part of the statements in words? It's clear that there aren't words that are universal to all languages, and we can't expect to create some sort of universal language that would be understood both by us and by the imaginary non-Euclidean geometers I just mentioned; just like we can't create a sentence that would be understood by Germans who don't know French and by French speakers who don't know German. However, we have established rules that allow us to translate French statements into German, and vice versa. That’s why grammars and dictionaries exist. There are also established rules for translating Euclidean language into non-Euclidean language, or if they don’t exist, we could certainly create them.

And even if there were neither interpreter nor dictionary, if the Germans and the French, after having lived centuries in separate worlds, found themselves all at once in contact, do you think there would be nothing in common between the science of the German books and that of the French books? The French and the Germans would certainly end by understanding each other, as the American Indians ended by understanding the language of their conquerors after the arrival of the Spanish.

And even if there were no interpreter or dictionary, if the Germans and the French, after having lived for centuries in separate worlds, suddenly found themselves in contact, do you think there would be nothing in common between the science in German books and that in French books? The French and Germans would definitely figure out how to understand each other, just as the American Indians eventually understood the language of their conquerors after the arrival of the Spanish.

But, it will be said, doubtless the French would be capable of[Pg 339] understanding the Germans even without having learned German, but this is because there remains between the French and the Germans something in common, since both are men. We should still attain to an understanding with our hypothetical non-Euclideans, though they be not men, because they would still retain something human. But in any case a minimum of humanity is necessary.

But, it's likely that the French could understand the Germans even without learning German, because there’s still something in common between the French and the Germans since they are both human. We should also be able to connect with our hypothetical non-Euclideans, even though they aren’t human, because they would still possess some human qualities. However, in any case, a minimum level of humanity is essential.

This is possible, but I shall observe first that this little humanness which would remain in the non-Euclideans would suffice not only to make possible the translation of a little of their language, but to make possible the translation of all their language.

This is possible, but I want to point out first that this small bit of humanity that would still exist in the non-Euclideans would be enough not only to enable the translation of a little of their language but also to allow the translation of all of their language.

Now, that there must be a minimum is what I concede; suppose there exists I know not what fluid which penetrates between the molecules of our matter, without having any action on it and without being subject to any action coming from it. Suppose beings sensible to the influence of this fluid and insensible to that of our matter. It is clear that the science of these beings would differ absolutely from ours and that it would be idle to seek an 'invariant' common to these two sciences. Or again, if these beings rejected our logic and did not admit, for instance, the principle of contradiction.

Now, I agree that there must be a minimum; let's imagine there’s some kind of fluid that seeps between the molecules of our material world, without affecting it and not being affected by it. Imagine beings who can sense the influence of this fluid but are unaware of our material world. Clearly, the science of these beings would be completely different from ours, and it would be pointless to look for an 'invariant' that connects the two sciences. Or, if these beings dismissed our logic and didn’t accept, for example, the principle of contradiction.

But truly I think it without interest to examine such hypotheses.

But honestly, I find it uninteresting to look into such theories.

And then, if we do not push whimsicality so far, if we introduce only fictitious beings having senses analogous to ours and sensible to the same impressions, and moreover admitting the principles of our logic, we shall then be able to conclude that their language, however different from ours it may be, would always be capable of translation. Now the possibility of translation implies the existence of an invariant. To translate is precisely to disengage this invariant. Thus, to decipher a cryptogram is to seek what in this document remains invariant, when the letters are permuted.

And then, if we don't take whimsicality too far, if we introduce only fictional beings with senses similar to ours and responsive to the same stimuli, and also following the principles of our logic, we can conclude that their language, no matter how different from ours, would always be translatable. Now, the possibility of translation implies that there’s something constant. To translate is specifically to identify this constant. So, deciphering a cryptogram is about finding what stays the same in this document when the letters are rearranged.

What now is the nature of this invariant it is easy to understand, and a word will suffice us. The invariant laws are the relations between the crude facts, while the relations between the 'scientific facts' remain always dependent on certain conventions.

What is the nature of this invariant? It’s easy to grasp, and a single word will do. The invariant laws are the connections between the basic facts, while the connections between the 'scientific facts' are always dependent on specific conventions.


CHAPTER XI

Science and Reality

5. Contingence and Determinism

I do not intend to treat here the question of the contingence of the laws of nature, which is evidently insoluble, and on which so much has already been written. I only wish to call attention to what different meanings have been given to this word, contingence, and how advantageous it would be to distinguish them.

I don’t plan to address the issue of the contingency of natural laws here, which is clearly unsolvable and has already been discussed extensively. I just want to highlight the different meanings that have been attached to the term "contingency" and how helpful it would be to differentiate between them.

If we look at any particular law, we may be certain in advance that it can only be approximate. It is, in fact, deduced from experimental verifications, and these verifications were and could be only approximate. We should always expect that more precise measurements will oblige us to add new terms to our formulas; this is what has happened, for instance, in the case of Mariotte's law.

If we examine any specific law, we can be sure right away that it will only be an approximation. It's actually based on experimental confirmations, which were and can only be approximate. We should always anticipate that more accurate measurements will force us to add new elements to our formulas; this is what happened, for example, with Mariotte's law.

Moreover the statement of any law is necessarily incomplete. This enunciation should comprise the enumeration of all the antecedents in virtue of which a given consequent can happen. I should first describe all the conditions of the experiment to be made and the law would then be stated: If all the conditions are fulfilled, the phenomenon will happen.

Moreover, any statement of a law is inevitably incomplete. This explanation should include the listing of all the factors that can lead to a specific outcome. I should first outline all the conditions for the experiment to be conducted, and then the law would be stated: If all the conditions are met, the phenomenon will occur.

But we shall be sure of not having forgotten any of these conditions only when we shall have described the state of the entire universe at the instant t; all the parts of this universe may, in fact, exercise an influence more or less great on the phenomenon which must happen at the instant t + dt.

But we can be certain that we haven't overlooked any of these conditions only when we've described the state of the entire universe at the moment t; all the parts of this universe can actually have varying degrees of influence on the event that will occur at the moment t + dt.

Now it is clear that such a description could not be found in the enunciation of the law; besides, if it were made, the law would become incapable of application; if one required so many conditions, there would be very little chance of their ever being all realized at any moment.

Now it’s clear that such a description couldn't be found in the wording of the law; furthermore, if it were provided, the law would be impossible to apply; if so many conditions were required, there would be minimal chance of all of them being met at any point in time.

Then as one can never be certain of not having forgotten some essential condition, it can not be said: If such and such conditions[Pg 341] are realized, such a phenomenon will occur; it can only be said: If such and such conditions are realized, it is probable that such a phenomenon will occur, very nearly.

Then, since you can never be completely sure that you haven't overlooked some essential condition, you can't say: If certain conditions[Pg 341] are met, a specific phenomenon will definitely happen; you can only say: If certain conditions are met, it's likely that a specific phenomenon will occur, quite closely.

Take the law of gravitation, which is the least imperfect of all known laws. It enables us to foresee the motions of the planets. When I use it, for instance, to calculate the orbit of Saturn, I neglect the action of the stars, and in doing so I am certain of not deceiving myself, because I know that these stars are too far away for their action to be sensible.

Take the law of gravitation, which is the least flawed of all known laws. It allows us to predict the movements of the planets. For example, when I use it to calculate Saturn's orbit, I ignore the influence of the stars, and I am confident that I am not misleading myself because I understand that these stars are too distant for their impact to be significant.

I announce, then, with a quasi-certitude that the coordinates of Saturn at such an hour will be comprised between such and such limits. Yet is that certitude absolute? Could there not exist in the universe some gigantic mass, much greater than that of all the known stars and whose action could make itself felt at great distances? That mass might be animated by a colossal velocity, and after having circulated from all time at such distances that its influence had remained hitherto insensible to us, it might come all at once to pass near us. Surely it would produce in our solar system enormous perturbations that we could not have foreseen. All that can be said is that such an event is wholly improbable, and then, instead of saying: Saturn will be near such a point of the heavens, we must limit ourselves to saying: Saturn will probably be near such a point of the heavens. Although this probability may be practically equivalent to certainty, it is only a probability.

I announce, then, with a sort of confidence that the coordinates of Saturn at that time will be within certain limits. But is that confidence absolute? Could there be some enormous mass in the universe, much larger than all the known stars, whose influence could be felt at great distances? That mass could be moving at an incredible speed and, having traveled at such distances for so long that its effect has gone unnoticed, it might suddenly come close to us. Surely, it would cause huge disturbances in our solar system that we wouldn't have predicted. All we can really say is that such an event is very unlikely, and instead of saying: Saturn will be near a specific point in the sky, we should say: Saturn will probably be near a specific point in the sky. While this probability may be practically the same as certainty, it is still just a probability.

For all these reasons, no particular law will ever be more than approximate and probable. Scientists have never failed to recognize this truth; only they believe, right or wrong, that every law may be replaced by another closer and more probable, that this new law will itself be only provisional, but that the same movement can continue indefinitely, so that science in progressing will possess laws more and more probable, that the approximation will end by differing as little as you choose from exactitude and the probability from certitude.

For all these reasons, no law will ever be more than approximate and likely. Scientists have always acknowledged this truth; they just believe, whether correctly or incorrectly, that every law can be replaced by another that is closer and more likely. This new law will also be temporary, but the process can go on indefinitely, so that as science progresses, it will have laws that are increasingly probable, with the approximation eventually differing as little as you want from exactness and the probability from certainty.

If the scientists who think thus are right, still could it be said that the laws of nature are contingent, even though each law, taken in particular, may be qualified as contingent? Or must one require, before concluding the contingence of the natural laws,[Pg 342] that this progress have an end, that the scientist finish some day by being arrested in his search for a closer and closer approximation, and that, beyond a certain limit, he thereafter meet in nature only caprice?

If the scientists who think this way are correct, can we still say that the laws of nature are dependent on circumstances, even though each law, when looked at individually, might be seen as conditional? Or do we have to insist, before concluding that the natural laws are contingent,[Pg 342] that this progress must come to an end, that the scientist will eventually stop his quest for a more precise understanding, and that, beyond a certain point, he will only encounter randomness in nature?

In the conception of which I have just spoken (and which I shall call the scientific conception), every law is only a statement imperfect and provisional, but it must one day be replaced by another, a superior law, of which it is only a crude image. No place therefore remains for the intervention of a free will.

In the idea I just talked about (which I’ll call the scientific idea), every law is just an incomplete and temporary statement, but it will eventually be replaced by another, a better law, of which it is just a rough representation. So, there’s no room for the influence of free will.

It seems to me that the kinetic theory of gases will furnish us a striking example.

It seems to me that the kinetic theory of gases will give us a striking example.

You know that in this theory all the properties of gases are explained by a simple hypothesis; it is supposed that all the gaseous molecules move in every direction with great velocities and that they follow rectilineal paths which are disturbed only when one molecule passes very near the sides of the vessel or another molecule. The effects our crude senses enable us to observe are the mean effects, and in these means, the great deviations compensate, or at least it is very improbable that they do not compensate; so that the observable phenomena follow simple laws such as that of Mariotte or of Gay-Lussac. But this compensation of deviations is only probable. The molecules incessantly change place and in these continual displacements the figures they form pass successively through all possible combinations. Singly these combinations are very numerous; almost all are in conformity with Mariotte's law, only a few deviate from it. These also will happen, only it would be necessary to wait a long time for them. If a gas were observed during a sufficiently long time it would certainly be finally seen to deviate, for a very short time, from Mariotte's law. How long would it be necessary to wait? If it were desired to calculate the probable number of years, it would be found that this number is so great that to write only the number of places of figures employed would still require half a score places of figures. No matter; enough that it may be done.

You know that in this theory, all the properties of gases are explained by a simple hypothesis. It's assumed that all gaseous molecules move rapidly in every direction and follow straight paths, which are only disturbed when a molecule comes very close to the walls of the container or to another molecule. The effects we can observe with our limited senses are average effects, and in these averages, the large deviations balance out, or at least it's very unlikely that they don't balance out. This means that the observable phenomena follow simple laws like those of Boyle or Gay-Lussac. However, this balancing of deviations is only probable. The molecules are constantly changing places, and in these continuous movements, the arrangements they form go through all possible combinations. Individually, these combinations are very numerous; almost all comply with Boyle's law, and only a few deviate from it. Those deviations will occur too, but it would take a long time for them to happen. If a gas were observed for a long enough period, it would eventually be seen to deviate, even if just for a brief moment, from Boyle's law. How long would one need to wait? If we wanted to calculate the probable number of years, we’d find that this number is so large that writing just the figure placeholders needed would require at least half a dozen digits. But that's fine; it's enough to know it can be done.

I do not care to discuss here the value of this theory. It is evident that if it be adopted, Mariotte's law will thereafter appear only as contingent, since a day will come when it will not[Pg 343] be true. And yet, think you the partisans of the kinetic theory are adversaries of determinism? Far from it; they are the most ultra of mechanists. Their molecules follow rigid paths, from which they depart only under the influence of forces which vary with the distance, following a perfectly determinate law. There remains in their system not the smallest place either for freedom, or for an evolutionary factor, properly so-called, or for anything whatever that could be called contingence. I add, to avoid mistake, that neither is there any evolution of Mariotte's law itself; it ceases to be true after I know not how many centuries; but at the end of a fraction of a second it again becomes true and that for an incalculable number of centuries.

I don’t want to discuss the value of this theory here. It’s clear that if it’s accepted, Mariotte’s law will only seem temporary, because eventually there will be a time when it won’t be true anymore. Do you think that the supporters of the kinetic theory are against determinism? Not at all; they are the most extreme mechanists. Their molecules follow strict paths, only deviating when influenced by forces that change with distance, adhering to a completely determined law. In their system, there’s not a hint of freedom, an evolutionary factor, or anything that could be called contingency. I want to clarify that there’s also no evolution of Mariotte’s law itself; it stops being true after an unknown number of centuries, but after just a fraction of a second, it becomes true again for an immeasurable number of centuries.

And since I have pronounced the word evolution, let us clear away another mistake. It is often said: Who knows whether the laws do not evolve and whether we shall not one day discover that they were not at the Carboniferous epoch what they are to-day? What are we to understand by that? What we think we know about the past state of our globe, we deduce from its present state. And how is this deduction made? It is by means of laws supposed known. The law, being a relation between the antecedent and the consequent, enables us equally well to deduce the consequent from the antecedent, that is, to foresee the future, and to deduce the antecedent from the consequent, that is, to conclude from the present to the past. The astronomer who knows the present situation of the stars can from it deduce their future situation by Newton's law, and this is what he does when he constructs ephemerides; and he can equally deduce from it their past situation. The calculations he thus can make can not teach him that Newton's law will cease to be true in the future, since this law is precisely his point of departure; not more can they tell him it was not true in the past. Still, in what concerns the future, his ephemerides can one day be tested and our descendants will perhaps recognize that they were false. But in what concerns the past, the geologic past which had no witnesses, the results of his calculation, like those of all speculations where we seek to deduce the past from the present, escape by their very nature every species of test. So that if the laws of nature were not the same in the Carboniferous age as at the present[Pg 344] epoch, we shall never be able to know it, since we can know nothing of this age, only what we deduce from the hypothesis of the permanence of these laws.

And since I mentioned evolution, let’s clear up another misconception. It’s often said: Who knows if the laws don't change and if we won’t discover one day that they were different in the Carboniferous period than they are today? What does that really mean? What we think we know about the past state of our planet comes from its current state. And how do we make this deduction? It’s through laws we assume to be known. The law, being a relationship between the cause and the effect, allows us to deduce the effect from the cause, meaning we can predict the future, and to deduce the cause from the effect, meaning we can infer the past from the present. An astronomer who knows the current positions of the stars can use that information to predict their future positions based on Newton's law, and that’s exactly what they do when they create ephemerides; they can also infer their past positions from that data. However, the calculations they make can’t tell them that Newton's law will no longer hold true in the future, since that law is the foundation of their work; and they can’t tell them it wasn’t true in the past either. Nonetheless, regarding the future, their ephemerides can eventually be tested, and someday, our descendants might recognize them as incorrect. But when it comes to the past, the geological past which had no witnesses, the results of their calculations, like all speculations where we try to infer the past from the present, cannot be tested at all. So if the laws of nature were not the same during the Carboniferous period as they are now[Pg 344], we will never know, as we can only know about that time based on the assumption that these laws have remained consistent.

Perhaps it will be said that this hypothesis might lead to contradictory results and that we shall be obliged to abandon it. Thus, in what concerns the origin of life, we may conclude that there have always been living beings, since the present world shows us always life springing from life; and we may also conclude that there have not always been, since the application of the existent laws of physics to the present state of our globe teaches us that there was a time when this globe was so warm that life on it was impossible. But contradictions of this sort can always be removed in two ways; it may be supposed that the actual laws of nature are not exactly what we have assumed; or else it may be supposed that the laws of nature actually are what we have assumed, but that it has not always been so.

Maybe it’ll be said that this hypothesis might lead to conflicting results, and that we’ll have to let it go. So, when it comes to the origin of life, we can conclude that there have always been living beings since the current world constantly shows us life coming from life; and we can also conclude that there hasn’t always been life, as applying the existing laws of physics to the current state of our planet indicates that there was a time when the Earth was so hot that life couldn’t exist. But contradictions like this can always be resolved in two ways: we can assume that the current laws of nature aren’t exactly what we think they are; or we can assume that the laws of nature are indeed what we’ve assumed, but that it hasn’t always been the case.

It is evident that the actual laws will never be sufficiently well known for us not to be able to adopt the first of these two solutions and for us to be constrained to infer the evolution of natural laws.

It’s clear that the actual laws will never be widely understood enough for us to adopt the first of these two solutions, which leaves us with inferring the evolution of natural laws.

On the other hand, suppose such an evolution; assume, if you wish, that humanity lasts sufficiently long for this evolution to have witnesses. The same antecedent shall produce, for instance, different consequents at the Carboniferous epoch and at the Quaternary. That evidently means that the antecedents are closely alike; if all the circumstances were identical, the Carboniferous epoch would be indistinguishable from the Quaternary. Evidently this is not what is supposed. What remains is that such antecedent, accompanied by such accessory circumstance, produces such consequent; and that the same antecedent, accompanied by such other accessory circumstance, produces such other consequent. Time does not enter into the affair.

On the other hand, imagine such an evolution; let’s assume, if you like, that humanity lasts long enough for this evolution to have witnesses. The same antecedent will produce, for example, different results in the Carboniferous period and in the Quaternary. This clearly means that the antecedents are very similar; if all the conditions were the same, the Carboniferous period would be indistinguishable from the Quaternary. Clearly, this is not what is being assumed. What remains is that a certain antecedent, along with a specific accessory circumstance, produces a certain result; and that the same antecedent, with a different accessory circumstance, produces another result. Time doesn't play a role in this situation.

The law, such as ill-informed science would have stated it, and which would have affirmed that this antecedent always produces this consequent, without taking account of the accessory circumstances, this law, which was only approximate and probable, must be replaced by another law more approximate and more probable, which brings in these accessory circumstances. We[Pg 345] always come back, therefore, to that same process which we have analyzed above, and if humanity should discover something of this sort, it would not say that it is the laws which have evoluted, but the circumstances which have changed.

The law, as misguided science might have described it, which claimed that this cause always leads to that effect without considering the additional circumstances, was merely an approximation and not entirely reliable. It needs to be replaced by a more accurate and reliable law that takes these additional circumstances into account. We[Pg 345] consistently return to the same process we analyzed earlier, and if humanity were to discover something like this, it wouldn’t claim that the laws have evolved, but rather that the circumstances have changed.

Here, therefore, are several different senses of the word contingence. M. LeRoy retains them all and he does not sufficiently distinguish them, but he introduces a new one. Experimental laws are only approximate, and if some appear to us as exact, it is because we have artificially transformed them into what I have above called a principle. We have made this transformation freely, and as the caprice which has determined us to make it is something eminently contingent, we have communicated this contingence to the law itself. It is in this sense that we have the right to say that determinism supposes freedom, since it is freely that we become determinists. Perhaps it will be found that this is to give large scope to nominalism and that the introduction of this new sense of the word contingence will not help much to solve all those questions which naturally arise and of which we have just been speaking.

Here are several different meanings of the word "contingence." M. LeRoy keeps them all but doesn't clearly differentiate between them, while also introducing a new one. Experimental laws are only approximate, and if some seem exact to us, it’s because we’ve changed them into what I previously referred to as a principle. We have made this change consciously, and since the whim that led us to do so is completely contingent, we’ve transferred this contingency to the law itself. It is in this context that we can say determinism implies freedom, as we become determinists by choice. Perhaps this broadens the concept of nominalism, and introducing this new meaning of "contingence" may not significantly clarify the questions we've just been discussing.

I do not at all wish to investigate here the foundations of the principle of induction; I know very well that I should not succeed; it is as difficult to justify this principle as to get on without it. I only wish to show how scientists apply it and are forced to apply it.

I don’t want to explore the foundations of the principle of induction here; I know I wouldn’t succeed. It’s just as hard to justify this principle as it is to manage without it. I only want to demonstrate how scientists use it and have to use it.

When the same antecedent recurs, the same consequent must likewise recur; such is the ordinary statement. But reduced to these terms this principle could be of no use. For one to be able to say that the same antecedent recurred, it would be necessary for the circumstances all to be reproduced, since no one is absolutely indifferent, and for them to be exactly reproduced. And, as that will never happen, the principle can have no application.

When the same cause happens again, the same effect should also happen again; that's the usual idea. But put this way, this principle wouldn't be helpful. For someone to claim that the same cause occurred again, all the circumstances would have to be recreated since no one is completely indifferent, and they would need to be perfectly replicated. And since that will never happen, the principle can't really be applied.

We should therefore modify the enunciation and say: If an antecedent A has once produced a consequent B, an antecedent , slightly different from A, will produce a consequent , slightly different from B. But how shall we recognize that the antecedents A and are 'slightly different'? If some one of the circumstances can be expressed by a number, and this number[Pg 346] has in the two cases values very near together, the sense of the phrase 'slightly different' is relatively clear; the principle then signifies that the consequent is a continuous function of the antecedent. And as a practical rule, we reach this conclusion that we have the right to interpolate. This is in fact what scientists do every day, and without interpolation all science would be impossible.

We should adjust our statement to say: If an antecedent A has produced a consequent B, then an antecedent , which is slightly different from A, will produce a consequent , which is slightly different from B. But how do we determine that the antecedents A and are 'slightly different'? If one of the circumstances can be represented by a number, and this number[Pg 346] has values that are very close together in both cases, then the meaning of 'slightly different' is relatively clear; this principle indicates that the consequent is a continuous function of the antecedent. As a practical guideline, we conclude that we are allowed to interpolate. This is essentially what scientists do every day, and without interpolation, all science would be impossible.

Yet observe one thing. The law sought may be represented by a curve. Experiment has taught us certain points of this curve. In virtue of the principle we have just stated, we believe these points may be connected by a continuous graph. We trace this graph with the eye. New experiments will furnish us new points of the curve. If these points are outside of the graph traced in advance, we shall have to modify our curve, but not to abandon our principle. Through any points, however numerous they may be, a continuous curve may always be passed. Doubtless, if this curve is too capricious, we shall be shocked (and we shall even suspect errors of experiment), but the principle will not be directly put at fault.

Yet notice one thing. The law we're looking for can be represented by a curve. Experiments have taught us certain points on this curve. Based on the principle we just stated, we believe these points can be connected by a continuous graph. We trace this graph visually. New experiments will give us new points on the curve. If these points fall outside of the previously drawn graph, we will need to adjust our curve, but we won't abandon our principle. For any number of points, a continuous curve can always be drawn through them. Of course, if this curve is too erratic, we might be taken aback (and suspect errors in the experiment), but the principle won’t be directly blamed.

Furthermore, among the circumstances of a phenomenon, there are some that we regard as negligible, and we shall consider A and as slightly different if they differ only by these accessory circumstances. For instance, I have ascertained that hydrogen unites with oxygen under the influence of the electric spark, and I am certain that these two gases will unite anew, although the longitude of Jupiter may have changed considerably in the interval. We assume, for instance, that the state of distant bodies can have no sensible influence on terrestrial phenomena, and that seems in fact requisite, but there are cases where the choice of these practically indifferent circumstances admits of more arbitrariness or, if you choose, requires more tact.

Furthermore, among the circumstances of a phenomenon, there are some that we consider insignificant, and we will regard A and as slightly different if they only differ by these minor circumstances. For example, I have found that hydrogen combines with oxygen when affected by an electric spark, and I'm sure these two gases will combine again, even if Jupiter's position has changed significantly in the meantime. We assume, for instance, that the state of distant bodies has no noticeable effect on events on Earth, and that seems necessary; however, there are situations where the choice of these practically irrelevant circumstances allows for more flexibility or, if you prefer, requires more sensitivity.

One more remark: The principle of induction would be inapplicable if there did not exist in nature a great quantity of bodies like one another, or almost alike, and if we could not infer, for instance, from one bit of phosphorus to another bit of phosphorus.

One more point: The principle of induction wouldn't apply if there weren't a large number of similar objects in nature, or almost similar, and if we couldn't draw conclusions, for example, from one piece of phosphorus to another piece of phosphorus.

If we reflect on these considerations, the problem of determinism and of contingence will appear to us in a new light.[Pg 347]

If we think about these ideas, the issue of determinism and chance will come to us in a new way.[Pg 347]

Suppose we were able to embrace the series of all phenomena of the universe in the whole sequence of time. We could envisage what might be called the sequences; I mean relations between antecedent and consequent. I do not wish to speak of constant relations or laws, I envisage separately (individually, so to speak) the different sequences realized.

Suppose we could take in all the events of the universe throughout time. We could imagine what could be called the sequences; I mean the connections between causes and effects. I’m not talking about constant relationships or laws; I’m picturing the distinct sequences as they occur individually.

We should then recognize that among these sequences there are no two altogether alike. But, if the principle of induction, as we have just stated it, is true, there will be those almost alike and that can be classed alongside one another. In other words, it is possible to make a classification of sequences.

We should acknowledge that among these sequences, no two are exactly the same. However, if the principle of induction, as we've just explained, is true, there will be those that are very similar and can be grouped together. In other words, it's possible to categorize sequences.

It is to the possibility and the legitimacy of such a classification that determinism, in the end, reduces. This is all that the preceding analysis leaves of it. Perhaps under this modest form it will seem less appalling to the moralist.

It is to the possibility and the legitimacy of such a classification that determinism ultimately boils down to. This is all that the previous analysis reveals. Maybe in this simpler form, it will seem less shocking to the moralist.

It will doubtless be said that this is to come back by a detour to M. LeRoy's conclusion which a moment ago we seemed to reject: we are determinists voluntarily. And in fact all classification supposes the active intervention of the classifier. I agree that this may be maintained, but it seems to me that this detour will not have been useless and will have contributed to enlighten us a little.

It will surely be said that we are taking a roundabout way back to M. LeRoy's conclusion, which we seemed to dismiss a moment ago: we are determinists by choice. And in fact, all classification assumes the active participation of the person classifying. I agree that this can be argued, but I believe this detour hasn't been wasted and has helped to shed some light on the matter.

6. Objectivity of Science

I arrive at the question set by the title of this article: What is the objective value of science? And first what should we understand by objectivity?

I come to the question posed by the title of this article: What is the actual value of science? And first, what do we mean by objectivity?

What guarantees the objectivity of the world in which we live is that this world is common to us with other thinking beings. Through the communications that we have with other men, we receive from them ready-made reasonings; we know that these reasonings do not come from us and at the same time we recognize in them the work of reasonable beings like ourselves. And as these reasonings appear to fit the world of our sensations, we think we may infer that these reasonable beings have seen the same thing as we; thus it is we know we have not been dreaming.

What ensures the objectivity of the world we live in is that it's shared with other thinking beings. Through our interactions with others, we receive pre-formed reasoning; we understand that these thoughts aren’t our own, yet we recognize them as the product of rational beings like ourselves. And since these reasonings seem to align with the world of our sensations, we believe we can conclude that these rational beings have perceived the same things we have; that’s how we know we haven’t been dreaming.

Such, therefore, is the first condition of objectivity; what is objective must be common to many minds and consequently transmissible from one to the other, and as this transmission can only[Pg 348] come about by that 'discourse' which inspires so much distrust in M. LeRoy, we are even forced to conclude: no discourse, no objectivity.

Such is the first requirement for objectivity: what is objective must be shared by many minds and, therefore, able to be communicated from one person to another. Since this communication can only[Pg 348] happen through that 'discourse' which creates so much distrust in M. LeRoy, we must conclude: no discourse, no objectivity.

The sensations of others will be for us a world eternally closed. We have no means of verifying that the sensation I call red is the same as that which my neighbor calls red.

The feelings of others will always be a world that’s completely closed off to us. We have no way of confirming that the sensation I refer to as red is the same as what my neighbor calls red.

Suppose that a cherry and a red poppy produce on me the sensation A and on him the sensation B and that, on the contrary, a leaf produces on me the sensation B and on him the sensation A. It is clear we shall never know anything about it; since I shall call red the sensation A and green the sensation B, while he will call the first green and the second red. In compensation, what we shall be able to ascertain is that, for him as for me, the cherry and the red poppy produce the same sensation, since he gives the same name to the sensations he feels and I do the same.

Suppose that a cherry and a red poppy make me feel sensation A and him sensation B, while a leaf gives me sensation B and him sensation A. It's obvious we'll never really know what's what; I'll call red the sensation A and green the sensation B, while he'll call the first green and the second red. However, what we can figure out is that, for both of us, the cherry and the red poppy create the same sensation, since he labels the sensations he experiences with the same names I use for mine.

Sensations are therefore intransmissible, or rather all that is pure quality in them is intransmissible and forever impenetrable. But it is not the same with relations between these sensations.

Sensations are therefore not transferable, or rather, all that is pure quality in them is not transferable and forever incomprehensible. However, the same does not apply to the relationships between these sensations.

From this point of view, all that is objective is devoid of all quality and is only pure relation. Certes, I shall not go so far as to say that objectivity is only pure quantity (this would be to particularize too far the nature of the relations in question), but we understand how some one could have been carried away into saying that the world is only a differential equation.

From this perspective, everything objective lacks any quality and is just pure relationship. Surely, I won’t go so far as to claim that objectivity is only pure quantity (that would be over-specifying the nature of the relationships involved), but it's easy to see how someone could have been led to say that the world is just a differential equation.

With due reserve regarding this paradoxical proposition, we must nevertheless admit that nothing is objective which is not transmissible, and consequently that the relations between the sensations can alone have an objective value.

With proper caution regarding this contradictory statement, we must still acknowledge that nothing is objective unless it can be communicated, and therefore, only the relationships between sensations can have an objective value.

Perhaps it will be said that the esthetic emotion, which is common to all mankind, is proof that the qualities of our sensations are also the same for all men and hence are objective. But if we think about this, we shall see that the proof is not complete; what is proved is that this emotion is aroused in John as in James by the sensations to which James and John give the same name or by the corresponding combinations of these sensations; either because this emotion is associated in John with the sensation A, which John calls red, while parallelly it is[Pg 349] associated in James with the sensation B, which James calls red; or better because this emotion is aroused, not by the qualities themselves of the sensations, but by the harmonious combination of their relations of which we undergo the unconscious impression.

Perhaps some might argue that the aesthetic emotion, which is shared by all humanity, proves that our sensory experiences are the same for everyone and therefore objective. However, if we reflect on this, we’ll see that the proof is incomplete; what it actually shows is that this emotion is triggered in John the same way it is in James by the sensations to which both give the same name or by the corresponding combinations of those sensations. This could be because this emotion is linked in John with the sensation A, which he calls red, while at the same time it is associated in James with the sensation B, which he also calls red. Or more accurately, this emotion is sparked not by the qualities of the sensations themselves but by the harmonious combination of their relationships that we experience unconsciously.

Such a sensation is beautiful, not because it possesses such a quality, but because it occupies such a place in the woof of our associations of ideas, so that it can not be excited without putting in motion the 'receiver' which is at the other end of the thread and which corresponds to the artistic emotion.

Such a feeling is beautiful, not because it has that quality, but because it holds a special place in the fabric of our ideas. It can't be triggered without activating the 'receiver' at the other end of the thread, which relates to the artistic emotion.

Whether we take the moral, the esthetic or the scientific point of view, it is always the same thing. Nothing is objective except what is identical for all; now we can only speak of such an identity if a comparison is possible, and can be translated into a 'money of exchange' capable of transmission from one mind to another. Nothing, therefore, will have objective value except what is transmissible by 'discourse,' that is, intelligible.

Whether we look at it from a moral, aesthetic, or scientific perspective, it all comes down to the same thing. Nothing is objective except what is the same for everyone; we can only talk about that sameness if a comparison is possible, and it can be expressed in a form that can be shared from one person to another. Therefore, nothing will have objective value except what can be conveyed through 'discourse,' meaning it is understandable.

But this is only one side of the question. An absolutely disordered aggregate could not have objective value since it would be unintelligible, but no more can a well-ordered assemblage have it, if it does not correspond to sensations really experienced. It seems to me superfluous to recall this condition, and I should not have dreamed of it, if it had not lately been maintained that physics is not an experimental science. Although this opinion has no chance of being adopted either by physicists or by philosophers, it is well to be warned so as not to let oneself slip over the declivity which would lead thither. Two conditions are therefore to be fulfilled, and if the first separates reality[11] from the dream, the second distinguishes it from the romance.

But this is just one side of the issue. An entirely disorganized mixture couldn’t have any objective value because it would be incomprehensible, but a well-organized collection also can't have it if it doesn't correspond to real sensations experienced. I find it unnecessary to bring this up, and I wouldn’t have thought to mention it if it hadn’t recently been argued that physics isn’t an experimental science. Although this view is unlikely to be accepted by either physicists or philosophers, it’s good to be cautious and not slip down the slope that could lead there. Therefore, two conditions must be met, and while the first separates reality[11] from dreams, the second differentiates it from fiction.

Now what is science? I have explained in the preceding article, it is before all a classification, a manner of bringing together facts which appearances separate, though they were bound together by some natural and hidden kinship. Science, in other words, is a system of relations. Now we have just said, it is in the relations alone that objectivity must be sought; it[Pg 350] would be vain to seek it in beings considered as isolated from one another.

Now, what is science? I explained in the previous article that, above all, it's about categorizing things, a way of connecting facts that seem separate but are actually linked by some natural and hidden relationship. In other words, science is a system of relationships. As we just mentioned, objectivity must be found solely in these relationships; it would be pointless to look for it in entities viewed as isolated from one another.

To say that science can not have objective value since it teaches us only relations, this is to reason backward, since, precisely, it is relations alone which can be regarded as objective.

To claim that science cannot have objective value because it only teaches us about relationships is to think backward, because it is precisely relationships that can be considered objective.

External objects, for instance, for which the word object was invented, are really objects and not fleeting and fugitive appearances, because they are not only groups of sensations, but groups cemented by a constant bond. It is this bond, and this bond alone, which is the object in itself, and this bond is a relation.

External objects, for example, for which the word object was created, are truly objects and not just temporary and elusive appearances because they aren't just collections of sensations, but collections connected by a constant relationship. It is this connection, and this connection alone, that is the object in itself, and this connection is a relationship.

Therefore, when we ask what is the objective value of science, that does not mean: Does science teach us the true nature of things? but it means: Does it teach us the true relations of things?

Therefore, when we ask what the objective value of science is, it doesn't mean: Does science teach us the true nature of things? but it means: Does it teach us the true relationships between things?

To the first question, no one would hesitate to reply, no; but I think we may go farther; not only science can not teach us the nature of things; but nothing is capable of teaching it to us, and if any god knew it, he could not find words to express it. Not only can we not divine the response, but if it were given to us we could understand nothing of it; I ask myself even whether we really understand the question.

To the first question, no one would hesitate to say no; but I think we can go further. Not only can science not teach us the nature of things, but nothing can truly teach it to us. Even if a god knew it, he wouldn't be able to find the words to express it. Not only can we not figure out the answer, but if it were given to us, we wouldn’t understand it at all. I even wonder if we really grasp the question itself.

When, therefore, a scientific theory pretends to teach us what heat is, or what is electricity, or life, it is condemned beforehand; all it can give us is only a crude image. It is, therefore, provisional and crumbling.

When a scientific theory claims to explain what heat is, or what electricity is, or what life is, it's already doomed to fail; all it can offer us is just a rough idea. It's, therefore, temporary and shaky.

The first question being out of reason, the second remains. Can science teach us the true relations of things? What it joins together should that be put asunder, what it puts asunder should that be joined together?

The first question being unreasonable, the second still stands. Can science teach us the true relationships between things? What it connects, should that be separated; what it separates, should that be connected?

To understand the meaning of this new question, it is needful to refer to what was said above on the conditions of objectivity. Have these relations an objective value? That means: Are these relations the same for all? Will they still be the same for those who shall come after us?

To understand the meaning of this new question, it's necessary to refer to what was mentioned earlier about the conditions of objectivity. Do these relationships have objective value? That means: Are these relationships the same for everyone? Will they still be the same for those who come after us?

It is clear that they are not the same for the scientist and the ignorant person. But that is unimportant, because if the ignorant person does not see them all at once, the scientist may succeed in making him see them by a series of experiments and[Pg 351] reasonings. The thing essential is that there are points on which all those acquainted with the experiments made can reach accord.

It’s obvious that the understanding differs between the scientist and someone who lacks knowledge. However, that doesn’t really matter because if the uninformed person can’t grasp everything at once, the scientist can help them understand through a series of experiments and[Pg 351]reasoning. What truly matters is that there are aspects where everyone familiar with the experiments can agree.

The question is to know whether this accord will be durable and whether it will persist for our successors. It may be asked whether the unions that the science of to-day makes will be confirmed by the science of to-morrow. To affirm that it will be so we can not invoke any a priori reason; but this is a question of fact, and science has already lived long enough for us to be able to find out by asking its history whether the edifices it builds stand the test of time, or whether they are only ephemeral constructions.

The question is whether this agreement will last and if it will continue for future generations. We might wonder if the connections established by today’s science will be validated by tomorrow’s science. We can't assume that will be the case based on any a priori reasoning; it’s a matter of fact. Science has already existed long enough for us to look at its history and determine whether the structures it creates endure through time or if they are just temporary constructs.

Now what do we see? At the first blush, it seems to us that the theories last only a day and that ruins upon ruins accumulate. To-day the theories are born, to-morrow they are the fashion, the day after to-morrow they are classic, the fourth day they are superannuated, and the fifth they are forgotten. But if we look more closely, we see that what thus succumb are the theories properly so called, those which pretend to teach us what things are. But there is in them something which usually survives. If one of them taught us a true relation, this relation is definitively acquired, and it will be found again under a new disguise in the other theories which will successively come to reign in place of the old.

Now what do we see? At first glance, it seems like theories last only a day and that layers upon layers of ruins accumulate. Today, theories are born; tomorrow they're in vogue, the day after tomorrow they’re considered classic, by the fourth day they’re out of date, and by the fifth they’re forgotten. But if we look more closely, we see that what fails are the theories themselves, those that claim to teach us what things are. However, there’s something in them that usually survives. If one of them reveals a true relationship, that relationship is definitively established, and it will reappear under a new guise in the other theories that will eventually take the place of the old ones.

Take only a single example: The theory of the undulations of the ether taught us that light is a motion; to-day fashion favors the electromagnetic theory which teaches us that light is a current. We do not consider whether we could reconcile them and say that light is a current, and that this current is a motion. As it is probable in any case that this motion would not be identical with that which the partisans of the old theory presume, we might think ourselves justified in saying that this old theory is dethroned. And yet something of it remains, since between the hypothetical currents which Maxwell supposes there are the same relations as between the hypothetical motions that Fresnel supposed. There is, therefore, something which remains over and this something is the essential. This it is which explains how we see the present physicists pass without any embarrassment from the language of Fresnel to that of Maxwell. Doubtless[Pg 352] many connections that were believed well established have been abandoned, but the greatest number remain and it would seem must remain.

Take just one example: The theory of ether waves taught us that light is a motion; today, the trend is leaning towards the electromagnetic theory, which teaches us that light is a current. We don’t consider whether we could combine them and say that light is a current, and that this current is a motion. It's likely that this motion wouldn't be the same as what supporters of the old theory assume, so we might feel justified in saying that the old theory is overthrown. Yet, some aspects of it remain, because between the hypothetical currents that Maxwell suggests, there are the same relationships as those between the hypothetical motions that Fresnel proposed. Therefore, something remains, and this something is essential. This is what helps explain how current physicists can smoothly shift from Fresnel’s language to Maxwell’s without any awkwardness. Certainly, many connections that were once considered well-established have been discarded, but most remain, and it seems they must stay.

And for these, then, what is the measure of their objectivity? Well, it is precisely the same as for our belief in external objects. These latter are real in this, that the sensations they make us feel appear to us as united to each other by I know not what indestructible cement and not by the hazard of a day. In the same way science reveals to us between phenomena other bonds finer but not less solid; these are threads so slender that they long remained unperceived, but once noticed there remains no way of not seeing them; they are therefore not less real than those which give their reality to external objects; small matter that they are more recently known, since neither can perish before the other.

And for these, what is the standard of their objectivity? Well, it's exactly the same as our belief in external objects. The latter are real in that the sensations they create feel connected in a way that seems unbreakable, not just a random occurrence. Similarly, science uncovers relationships between phenomena that are more subtle but equally strong; these are such fine threads that they went unnoticed for a long time, but once you see them, you can't unsee them. Therefore, they are just as real as those that give reality to external objects. It's no big deal that they were discovered more recently, since neither can disappear before the other.

It may be said, for instance, that the ether is no less real than any external body; to say this body exists is to say there is between the color of this body, its taste, its smell, an intimate bond, solid and persistent; to say the ether exists is to say there is a natural kinship between all the optical phenomena, and neither of the two propositions has less value than the other.

It can be argued, for example, that the ether is just as real as any physical object; to say that this object exists is to acknowledge the close and lasting connection between its color, taste, and smell; to say the ether exists is to affirm the natural relationship among all optical phenomena, and neither statement is any less valuable than the other.

And the scientific syntheses have in a sense even more reality than those of the ordinary senses, since they embrace more terms and tend to absorb in them the partial syntheses.

And scientific syntheses have an even greater sense of reality than those from the ordinary senses because they encompass more factors and tend to incorporate the partial syntheses.

It will be said that science is only a classification and that a classification can not be true, but convenient. But it is true that it is convenient, it is true that it is so not only for me, but for all men; it is true that it will remain convenient for our descendants; it is true finally that this can not be by chance.

It might be argued that science is just a way of organizing things and that such organization can't be truly accurate, only useful. But it is indeed useful, and not just for me, but for everyone; it will continue to be helpful for future generations as well; and ultimately, it’s clear that this isn't just a coincidence.

In sum, the sole objective reality consists in the relations of things whence results the universal harmony. Doubtless these relations, this harmony, could not be conceived outside of a mind which conceives them. But they are nevertheless objective because they are, will become, or will remain, common to all thinking beings.

In short, the only objective reality is the relationships between things that lead to universal harmony. Of course, these relationships and this harmony can’t be imagined without a mind that understands them. However, they are still objective because they are, will be, or have always been common to all thinking beings.

This will permit us to revert to the question of the rotation of the earth which will give us at the same time a chance to make clear what precedes by an example.

This will allow us to return to the question of the Earth's rotation, which will also give us an opportunity to clarify what came before with an example.

7. The Rotation of the Earth

"... Therefore," have I said in Science and Hypothesis, "this affirmation, the earth turns round, has no meaning ... or rather these two propositions, the earth turns round, and, it is more convenient to suppose that the earth turns round, have one and the same meaning."

"... Therefore," I said in Science and Hypothesis, "this statement, that the earth rotates, has no real meaning ... or rather, these two ideas, the earth rotates, and it’s easier to assume that the earth rotates, have the same meaning."

These words have given rise to the strangest interpretations. Some have thought they saw in them the rehabilitation of Ptolemy's system, and perhaps the justification of Galileo's condemnation.

These words have led to the oddest interpretations. Some have believed they found in them the revival of Ptolemy's system, and maybe even the justification for Galileo's condemnation.

Those who had read attentively the whole volume could not, however, delude themselves. This truth, the earth turns round, was put on the same footing as Euclid's postulate, for example. Was that to reject it? But better; in the same language it may very well be said: These two propositions, the external world exists, or, it is more convenient to suppose that it exists, have one and the same meaning. So the hypothesis of the rotation of the earth would have the same degree of certitude as the very existence of external objects.

Those who carefully read the entire book couldn't fool themselves. This truth, that the earth revolves, was treated the same way as Euclid's postulate, for example. Was that meant to dismiss it? But more so; in the same way, it could be argued: These two statements, that the external world exists, or that it's easier to assume it exists, mean the same thing. So the idea of the earth's rotation would have the same level of certainty as the existence of external objects.

But after what we have just explained in the fourth part, we may go farther. A physical theory, we have said, is by so much the more true as it puts in evidence more true relations. In the light of this new principle, let us examine the question which occupies us.

But after what we just explained in the fourth part, we can go further. A physical theory, as we've mentioned, is more accurate the more true relationships it highlights. In light of this new principle, let's take a closer look at the question we're discussing.

No, there is no absolute space; these two contradictory propositions: 'The earth turns round' and 'The earth does not turn round' are, therefore, neither of them more true than the other. To affirm one while denying the other, in the kinematic sense, would be to admit the existence of absolute space.

No, there isn’t any absolute space; these two conflicting statements: 'The earth rotates' and 'The earth does not rotate' are, therefore, neither one more true than the other. To assert one while rejecting the other, in the kinematic sense, would mean accepting the existence of absolute space.

But if the one reveals true relations that the other hides from us, we can nevertheless regard it as physically more true than the other, since it has a richer content. Now in this regard no doubt is possible.

But if one reveals the true connections that the other keeps hidden from us, we can still consider it to be more accurate than the other, since it has a richer meaning. In this respect, there is no doubt.

Behold the apparent diurnal motion of the stars, and the diurnal motion of the other heavenly bodies, and besides, the flattening of the earth, the rotation of Foucault's pendulum, the gyration of cyclones, the trade-winds, what not else? For the[Pg 354] Ptolemaist all these phenomena have no bond between them; for the Copernican they are produced by the one same cause. In saying, the earth turns round, I affirm that all these phenomena have an intimate relation, and that is true, and that remains true, although there is not and can not be absolute space.

Look at the obvious daily movement of the stars and the daily motion of other celestial bodies. Also, consider the earth's flattening, the rotation of Foucault's pendulum, the spinning of cyclones, the trade winds, and so on. For the[Pg 354] Ptolemaists, all these events are unrelated; for the Copernicans, they all come from the same cause. By saying that the earth rotates, I assert that all these phenomena are closely connected, and that is true, and that remains true, even though there is no such thing as absolute space.

So much for the rotation of the earth upon itself; what shall we say of its revolution around the sun? Here again, we have three phenomena which for the Ptolemaist are absolutely independent and which for the Copernican are referred back to the same origin; they are the apparent displacements of the planets on the celestial sphere, the aberration of the fixed stars, the parallax of these same stars. Is it by chance that all the planets admit an inequality whose period is a year, and that this period is precisely equal to that of aberration, precisely equal besides to that of parallax? To adopt Ptolemy's system is to answer, yes; to adopt that of Copernicus is to answer, no; this is to affirm that there is a bond between the three phenomena, and that also is true, although there is no absolute space.

So much for the Earth's rotation on its axis; what about its orbit around the sun? Again, we have three phenomena that, for the Ptolemaic view, are completely separate, while for the Copernican view, they all come from the same source. These are the apparent movements of the planets in the sky, the aberration of the fixed stars, and the parallax of those same stars. Is it just a coincidence that all the planets have an irregularity with a cycle of one year, and that this cycle is exactly the same as the aberration cycle, which is also the same as the parallax cycle? To choose Ptolemy's system is to say yes; to choose Copernicus's is to say no; this affirms that there is a connection between the three phenomena, and that is also true, even though there is no absolute space.

In Ptolemy's system, the motions of the heavenly bodies can not be explained by the action of central forces, celestial mechanics is impossible. The intimate relations that celestial mechanics reveals to us between all the celestial phenomena are true relations; to affirm the immobility of the earth would be to deny these relations, that would be to fool ourselves.

In Ptolemy's system, the movements of the heavenly bodies can't be explained by central forces; celestial mechanics just doesn't work. The close connections that celestial mechanics shows us between all celestial phenomena are real connections; claiming that the Earth is stationary would mean denying these connections, which would be self-deception.

The truth for which Galileo suffered remains, therefore, the truth, although it has not altogether the same meaning as for the vulgar, and its true meaning is much more subtle, more profound and more rich.

The truth that Galileo endured for still stands, but it doesn't quite have the same meaning for everyone, and its real significance is much more nuanced, deeper, and richer.

8. Science for Its Own Sake

Not against M. LeRoy do I wish to defend science for its own sake; maybe this is what he condemns, but this is what he cultivates, since he loves and seeks truth and could not live without it. But I have some thoughts to express.

Not against M. LeRoy do I wish to defend science for its own sake; maybe this is what he criticizes, but this is what he nurtures, since he loves and seeks truth and couldn't live without it. But I have some thoughts to share.

We can not know all facts and it is necessary to choose those which are worthy of being known. According to Tolstoi, scientists make this choice at random, instead of making it, which would be reasonable, with a view to practical applications. On[Pg 355] the contrary, scientists think that certain facts are more interesting than others, because they complete an unfinished harmony, or because they make one foresee a great number of other facts. If they are wrong, if this hierarchy of facts that they implicitly postulate is only an idle illusion, there could be no science for its own sake, and consequently there could be no science. As for me, I believe they are right, and, for example, I have shown above what is the high value of astronomical facts, not because they are capable of practical applications, but because they are the most instructive of all.

We can't know every fact, so we have to choose the ones worth knowing. According to Tolstoy, scientists make this choice randomly instead of doing it logically for practical purposes. On[Pg 355] the other hand, scientists believe some facts are more interesting than others because they help complete an unfinished picture or allow us to predict a lot of other facts. If they’re wrong, and if this hierarchy of facts they assume is just a pointless illusion, then science wouldn't exist for its own sake, and as a result, there wouldn't be science at all. Personally, I believe they’re right, and for instance, I've shown above the great value of astronomical facts—not because they can be practically applied, but because they are the most enlightening of all.

It is only through science and art that civilization is of value. Some have wondered at the formula: science for its own sake; and yet it is as good as life for its own sake, if life is only misery; and even as happiness for its own sake, if we do not believe that all pleasures are of the same quality, if we do not wish to admit that the goal of civilization is to furnish alcohol to people who love to drink.

It’s only through science and art that civilization has any value. Some have questioned the idea of pursuing science just for its own sake; however, it’s just as meaningful as living for its own sake if life is just suffering; and just as valid as seeking happiness for its own sake, if we don’t believe that all pleasures are equal, or if we don’t want to accept that the aim of civilization is to provide alcohol to those who enjoy drinking.

Every act should have an aim. We must suffer, we must work, we must pay for our place at the game, but this is for seeing's sake; or at the very least that others may one day see.

Every action should have a purpose. We have to endure, we have to put in the effort, we have to earn our spot in the game, but this is so that we can witness it; or at the very least, that others might see it one day.

All that is not thought is pure nothingness; since we can think only thoughts and all the words we use to speak of things can express only thoughts, to say there is something other than thought, is therefore an affirmation which can have no meaning.

All that isn't thought is pure nothingness; since we can only think thoughts and all the words we use to talk about things can only express thoughts, claiming there’s something other than thought is therefore a statement that makes no sense.

And yet—strange contradiction for those who believe in time—geologic history shows us that life is only a short episode between two eternities of death, and that, even in this episode, conscious thought has lasted and will last only a moment. Thought is only a gleam in the midst of a long night.

And yet—strange contradiction for those who believe in time—geologic history shows that life is just a brief moment between two eternities of death, and that, even during this moment, conscious thought has lasted and will last only for a fleeting time. Thought is just a flash in the midst of a long darkness.

But it is this gleam which is everything.

But this shimmer is everything.


 

SCIENCE AND METHOD

 


INTRODUCTION

I bring together here different studies relating more or less directly to questions of scientific methodology. The scientific method consists in observing and experimenting; if the scientist had at his disposal infinite time, it would only be necessary to say to him: 'Look and notice well'; but, as there is not time to see everything, and as it is better not to see than to see wrongly, it is necessary for him to make choice. The first question, therefore, is how he should make this choice. This question presents itself as well to the physicist as to the historian; it presents itself equally to the mathematician, and the principles which should guide each are not without analogy. The scientist conforms to them instinctively, and one can, reflecting on these principles, foretell the future of mathematics.

I’ve gathered various studies here that relate, more or less directly, to questions about scientific methodology. The scientific method involves observing and experimenting; if scientists had unlimited time, we could simply tell them: 'Look carefully.' But since we don’t have time to see everything and it's often better not to see than to see incorrectly, they have to make choices. So, the first question is how they should make those choices. This question applies to both physicists and historians; it’s equally relevant for mathematicians, and the principles guiding each are somewhat similar. Scientists naturally follow these principles, and by reflecting on them, we can predict the future of mathematics.

We shall understand them better yet if we observe the scientist at work, and first of all it is necessary to know the psychologic mechanism of invention and, in particular, that of mathematical creation. Observation of the processes of the work of the mathematician is particularly instructive for the psychologist.

We will understand them even better if we watch the scientist at work, and first, it’s important to understand the psychological process of invention, especially in terms of mathematical creation. Observing how a mathematician works is especially enlightening for psychologists.

In all the sciences of observation account must be taken of the errors due to the imperfections of our senses and our instruments. Luckily, we may assume that, under certain conditions, these errors are in part self-compensating, so as to disappear in the average; this compensation is due to chance. But what is chance? This idea is difficult to justify or even to define; and yet what I have just said about the errors of observation, shows that the scientist can not neglect it. It therefore is necessary to give a definition as precise as possible of this concept, so indispensable yet so illusive.

In all observational sciences, we need to consider the mistakes that come from the limitations of our senses and tools. Fortunately, we can assume that, under certain conditions, these mistakes tend to balance each other out and cancel out in the average; this balance occurs by chance. But what does chance really mean? This concept is hard to explain or even define; however, my earlier statement about observational errors shows that scientists cannot ignore it. Therefore, it’s essential to provide the most precise definition possible for this concept, which is crucial yet so elusive.

These are generalities applicable in sum to all the sciences; and for example the mechanism of mathematical invention does not differ sensibly from the mechanism of invention in general. Later I attack questions relating more particularly to certain special sciences and first to pure mathematics.[Pg 360]

These are broad principles that apply to all sciences; for instance, the process of mathematical invention is not significantly different from the process of invention in general. Later, I will address questions that are more specifically related to certain specialized sciences, starting with pure mathematics.[Pg 360]

In the chapters devoted to these, I have to treat subjects a little more abstract. I have first to speak of the notion of space; every one knows space is relative, or rather every one says so, but many think still as if they believed it absolute; it suffices to reflect a little however to perceive to what contradictions they are exposed.

In the chapters focused on these topics, I need to discuss some concepts that are a bit more abstract. First, I’ll talk about the idea of space; everyone says space is relative, but many still think of it as if it were absolute. However, if you think about it for a moment, you can see the contradictions they face.

The questions of teaching have their importance, first in themselves, then because reflecting on the best way to make new ideas penetrate virgin minds is at the same time reflecting on how these notions were acquired by our ancestors, and consequently on their true origin, that is to say, in reality on their true nature. Why do children usually understand nothing of the definitions which satisfy scientists? Why is it necessary to give them others? This is the question I set myself in the succeeding chapter and whose solution should, I think, suggest useful reflections to the philosophers occupied with the logic of the sciences.

The questions about teaching are important, both on their own and because considering the best ways to introduce new ideas to untainted minds also means reflecting on how our ancestors learned these concepts, and ultimately, what their true origins and nature are. Why do kids often not grasp the definitions that satisfy scientists? Why do we need to provide them with different ones? This is the question I will explore in the next chapter, and I believe its solution should prompt valuable insights for philosophers focused on the logic of the sciences.

On the other hand, many geometers believe we can reduce mathematics to the rules of formal logic. Unheard-of efforts have been made to do this; to accomplish it, some have not hesitated, for example, to reverse the historic order of the genesis of our conceptions and to try to explain the finite by the infinite. I believe I have succeeded in showing, for all those who attack the problem unprejudiced, that here there is a fallacious illusion. I hope the reader will understand the importance of the question and pardon me the aridity of the pages devoted to it.

On the flip side, many geometers think we can simplify mathematics down to the rules of formal logic. Unimaginable efforts have been made to achieve this; to succeed, some have not hesitated, for instance, to flip the historical order of how our concepts developed and to attempt to explain the finite through the infinite. I believe I have shown, for anyone tackling the problem with an open mind, that there is a misleading illusion at play here. I hope the reader grasps the significance of the question and forgives me for the dryness of the pages dedicated to it.

The concluding chapters relative to mechanics and astronomy will be easier to read.

The final chapters on mechanics and astronomy will be easier to understand.

Mechanics seems on the point of undergoing a complete revolution. Ideas which appeared best established are assailed by bold innovators. Certainly it would be premature to decide in their favor at once simply because they are innovators.

Mechanics seems ready for a complete revolution. Ideas that once seemed solid are being challenged by daring innovators. It would definitely be premature to immediately side with them just because they are bringing new ideas.

But it is of interest to make known their doctrines, and this is what I have tried to do. As far as possible I have followed the historic order; for the new ideas would seem too astonishing unless we saw how they arose.

But it's important to share their beliefs, and that’s what I've tried to do. I've followed the historical order as much as possible; the new ideas would seem too surprising if we didn't see how they developed.

Astronomy offers us majestic spectacles and raises gigantic problems. We can not dream of applying to them directly the[Pg 361] experimental method; our laboratories are too small. But analogy with phenomena these laboratories permit us to attain may nevertheless guide the astronomer. The Milky Way, for example, is an assemblage of suns whose movements seem at first capricious. But may not this assemblage be compared to that of the molecules of a gas, whose properties the kinetic theory of gases has made known to us? It is thus by a roundabout way that the method of the physicist may come to the aid of the astronomer.

Astronomy presents us with incredible sights and raises huge questions. We can’t directly apply the experimental method to them because our labs are too small. However, drawing parallels with phenomena that these labs allow us to study can still help guide astronomers. The Milky Way, for instance, is a collection of stars whose movements seem random at first. But can we not compare this collection to the molecules of a gas, whose properties we’ve learned about through the kinetic theory of gases? In this way, the physicist's method can indirectly assist the astronomer.

Finally I have endeavored to give in a few lines the history of the development of French geodesy; I have shown through what persevering efforts, and often what dangers, the geodesists have procured for us the knowledge we have of the figure of the earth. Is this then a question of method? Yes, without doubt, this history teaches us in fact by what precautions it is necessary to surround a serious scientific operation and how much time and pains it costs to conquer one new decimal.

Finally, I've tried to summarize the history of French geodesy in just a few lines. I've highlighted the persistent efforts and often the dangers that geodesists have faced to provide us with the understanding we have of the Earth's shape. So, is this a matter of method? Yes, undoubtedly. This history shows us the precautions needed for a serious scientific endeavor and how much time and effort it takes to achieve just one new decimal.


BOOK I

SCIENCE AND THE SCIENTIST


CHAPTER I

Choosing Facts

Tolstoi somewhere explains why 'science for its own sake' is in his eyes an absurd conception. We can not know all facts, since their number is practically infinite. It is necessary to choose; then we may let this choice depend on the pure caprice of our curiosity; would it not be better to let ourselves be guided by utility, by our practical and above all by our moral needs; have we nothing better to do than to count the number of lady-bugs on our planet?

Tolstoy explains somewhere why 'science for its own sake' seems absurd to him. We can’t know all facts, since there are practically infinite amounts. We have to make choices; we could let that choice be based on our curiosity, but wouldn’t it be better to be guided by what’s useful, by our practical and especially by our moral needs? Do we have nothing better to do than count the number of ladybugs on our planet?

It is clear the word utility has not for him the sense men of affairs give it, and following them most of our contemporaries. Little cares he for industrial applications, for the marvels of electricity or of automobilism, which he regards rather as obstacles to moral progress; utility for him is solely what can make man better.

It’s obvious that the word utility doesn’t have the same meaning for him as it does for businesspeople and most people today. He cares little about industrial applications, the wonders of electricity, or cars, which he sees more as barriers to moral progress; for him, utility is only what can improve humanity.

For my part, it need scarce be said, I could never be content with either the one or the other ideal; I want neither that plutocracy grasping and mean, nor that democracy goody and mediocre, occupied solely in turning the other cheek, where would dwell sages without curiosity, who, shunning excess, would not die of disease, but would surely die of ennui. But that is a matter of taste and is not what I wish to discuss.

For me, it hardly needs to be said, I could never be satisfied with either ideal. I want neither that grasping and petty plutocracy nor that goody-two-shoes and average democracy, focused only on turning the other cheek, where wise people lack curiosity, who, avoiding extremes, wouldn’t die from illness, but would definitely die from boredom. But that’s a matter of personal preference, and it’s not what I want to discuss.

The question nevertheless remains and should fix our attention; if our choice can only be determined by caprice or by immediate utility, there can be no science for its own sake, and consequently no science. But is that true? That a choice must be made is incontestable; whatever be our activity, facts go quicker than we, and we can not catch them; while the scientist discovers one fact,[Pg 363] there happen milliards of milliards in a cubic millimeter of his body. To wish to comprise nature in science would be to want to put the whole into the part.

The question still stands and deserves our attention; if our choices can only be driven by whim or immediate usefulness, then there can be no science for its own sake, and therefore no real science at all. But is that really true? It's undeniable that a choice must be made; no matter what we're doing, facts move faster than we do, and we can't grasp them all; while the scientist uncovers one fact,[Pg 363] billions of billions are happening in a cubic millimeter of his body. To try to fit all of nature into science would be like trying to fit the whole into just a part.

But scientists believe there is a hierarchy of facts and that among them may be made a judicious choice. They are right, since otherwise there would be no science, yet science exists. One need only open the eyes to see that the conquests of industry which have enriched so many practical men would never have seen the light, if these practical men alone had existed and if they had not been preceded by unselfish devotees who died poor, who never thought of utility, and yet had a guide far other than caprice.

But scientists believe there’s a hierarchy of facts and that we can make smart choices among them. They’re correct because without that, there would be no science, but science does exist. Just look around to see that the achievements of the industry that have benefited so many practical people would never have happened if those practical people were the only ones around and if they hadn’t been preceded by selfless individuals who died in poverty, never thinking about practicality, yet had a purpose that went beyond whim.

As Mach says, these devotees have spared their successors the trouble of thinking. Those who might have worked solely in view of an immediate application would have left nothing behind them, and, in face of a new need, all must have been begun over again. Now most men do not love to think, and this is perhaps fortunate when instinct guides them, for most often, when they pursue an aim which is immediate and ever the same, instinct guides them better than reason would guide a pure intelligence. But instinct is routine, and if thought did not fecundate it, it would no more progress in man than in the bee or ant. It is needful then to think for those who love not thinking, and, as they are numerous, it is needful that each of our thoughts be as often useful as possible, and this is why a law will be the more precious the more general it is.

As Mach points out, these followers have saved their successors the trouble of thinking. Those who might have worked only for immediate results would have left nothing behind, and with every new need, everything would have to start all over again. Most people don't enjoy thinking, and this might actually be a good thing when their instincts guide them, because often, when they aim for something immediate and constant, their instincts lead them better than pure reasoning would guide an intelligent mind. However, instinct is just routine, and if thought doesn't enrich it, it won't progress in humans any more than it does in bees or ants. So, it's essential to think for those who don't like thinking, and since they are many, it's important that each of our thoughts be as useful as possible, which is why a law becomes more valuable the more general it is.

This shows us how we should choose: the most interesting facts are those which may serve many times; these are the facts which have a chance of coming up again. We have been so fortunate as to be born in a world where there are such. Suppose that instead of 60 chemical elements there were 60 milliards of them, that they were not some common, the others rare, but that they were uniformly distributed. Then, every time we picked up a new pebble there would be great probability of its being formed of some unknown substance; all that we knew of other pebbles would be worthless for it; before each new object we should be as the new-born babe; like it we could only obey our caprices or our needs. Biologists would be just as much at a loss if there were only individuals and no species and if heredity did not make sons like their fathers.[Pg 364]

This illustrates how we should make our choices: the most intriguing facts are those that can be useful multiple times; these are the facts that are likely to recur. We are fortunate to be born into a world where such facts exist. Imagine if instead of 60 chemical elements, there were 60 billion of them, and they were evenly distributed, with some being common and others rare. In that case, every time we picked up a new pebble, there would be a high probability it would be made of some unknown substance; all our knowledge about other pebbles would be useless for it. We would face each new object like a newborn baby, able only to follow our whims or needs. Biologists would struggle just as much if there were only individuals and no species, and if heredity didn't make children similar to their parents.[Pg 364]

In such a world there would be no science; perhaps thought and even life would be impossible, since evolution could not there develop the preservational instincts. Happily it is not so; like all good fortune to which we are accustomed, this is not appreciated at its true worth.

In such a world, there would be no science; maybe thought and even life would be impossible, since evolution couldn't develop the instincts for preservation. Thankfully, that's not the case; like all good fortune we're used to, we don't appreciate it at its true value.

Which then are the facts likely to reappear? They are first the simple facts. It is clear that in a complex fact a thousand circumstances are united by chance, and that only a chance still much less probable could reunite them anew. But are there any simple facts? And if there are, how recognize them? What assurance is there that a thing we think simple does not hide a dreadful complexity? All we can say is that we ought to prefer the facts which seem simple to those where our crude eye discerns unlike elements. And then one of two things: either this simplicity is real, or else the elements are so intimately mingled as not to be distinguishable. In the first case there is chance of our meeting anew this same simple fact, either in all its purity or entering itself as element in a complex manifold. In the second case this intimate mixture has likewise more chances of recurring than a heterogeneous assemblage; chance knows how to mix, it knows not how to disentangle, and to make with multiple elements a well-ordered edifice in which something is distinguishable, it must be made expressly. The facts which appear simple, even if they are not so, will therefore be more easily revived by chance. This it is which justifies the method instinctively adopted by the scientist, and what justifies it still better, perhaps, is that oft-recurring facts appear to us simple, precisely because we are used to them.

Which facts are likely to come back? First, there are the simple facts. It's clear that in a complex fact, a thousand circumstances come together by chance, and only an even less likely chance could bring them together again. But are there any simple facts? And if there are, how do we recognize them? What guarantee do we have that something we think is simple doesn’t hide a terrible complexity? All we can say is that we should prefer facts that seem simple to those where our unrefined perception picks out different elements. And then it’s either one of two things: either this simplicity is real, or the elements are so closely mixed that they can’t be distinguished. In the first case, there’s a chance we might encounter this same simple fact again, either in its pure form or as part of a complex system. In the second case, this close mixture is also more likely to recur than a disordered collection; chance knows how to mix but doesn’t know how to untangle, and to create a well-ordered structure from multiple elements where something can be distinguished, it must be done intentionally. Therefore, the facts that seem simple, even if they aren’t, will be more easily revived by chance. This is what justifies the instinctive method adopted by scientists, and what further justifies it, perhaps, is that frequently occurring facts appear simple to us, precisely because we’re accustomed to them.

But where is the simple fact? Scientists have been seeking it in the two extremes, in the infinitely great and in the infinitely small. The astronomer has found it because the distances of the stars are immense, so great that each of them appears but as a point, so great that the qualitative differences are effaced, and because a point is simpler than a body which has form and qualities. The physicist on the other hand has sought the elementary phenomenon in fictively cutting up bodies into infinitesimal cubes, because the conditions of the problem, which undergo slow and continuous variation in passing from one point of the[Pg 365] body to another, may be regarded as constant in the interior of each of these little cubes. In the same way the biologist has been instinctively led to regard the cell as more interesting than the whole animal, and the outcome has shown his wisdom, since cells belonging to organisms the most different are more alike, for the one who can recognize their resemblances, than are these organisms themselves. The sociologist is more embarrassed; the elements, which for him are men, are too unlike, too variable, too capricious, in a word, too complex; besides, history never begins over again. How then choose the interesting fact, which is that which begins again? Method is precisely the choice of facts; it is needful then to be occupied first with creating a method, and many have been imagined, since none imposes itself, so that sociology is the science which has the most methods and the fewest results.

But where is the simple truth? Scientists have been searching for it in two extremes: the infinitely large and the infinitely small. The astronomer has found it because the distances of the stars are so immense that each one appears only as a point, so vast that the qualitative differences vanish, and because a point is simpler than a solid object with shape and qualities. The physicist, on the other hand, has looked for fundamental phenomena by breaking bodies down into tiny cubes, since the problem conditions, which change slowly and continuously from one point of the body to another, can be seen as constant within each of these small cubes. Similarly, the biologist has naturally focused on the cell as more interesting than the entire animal, and this has proven wise, as cells from very different organisms are more alike to someone who can recognize their similarities than the organisms themselves are. The sociologist faces more challenges; for him, the elements, which are people, are too different, too variable, too unpredictable—in short, too complex. Plus, history never resets. So how can one select the interesting fact, which is the one that repeats itself? Method is essentially about choosing facts; therefore, it is necessary to first develop a method, and many have been conceived, since none stands out as definitive, resulting in sociology being the science with the most methods and the fewest results.

Therefore it is by the regular facts that it is proper to begin; but after the rule is well established, after it is beyond all doubt, the facts in full conformity with it are erelong without interest since they no longer teach us anything new. It is then the exception which becomes important. We cease to seek resemblances; we devote ourselves above all to the differences, and among the differences are chosen first the most accentuated, not only because they are the most striking, but because they will be the most instructive. A simple example will make my thought plainer: Suppose one wishes to determine a curve by observing some of its points. The practician who concerns himself only with immediate utility would observe only the points he might need for some special object. These points would be badly distributed on the curve; they would be crowded in certain regions, rare in others, so that it would be impossible to join them by a continuous line, and they would be unavailable for other applications. The scientist will proceed differently; as he wishes to study the curve for itself, he will distribute regularly the points to be observed, and when enough are known he will join them by a regular line and then he will have the entire curve. But for that how does he proceed? If he has determined an extreme point of the curve, he does not stay near this extremity, but goes first to the other end; after the two extremities the most instructive point will be the mid-point, and so on.[Pg 366]

So, it’s important to start with the regular facts; however, once the rule is firmly established and beyond doubt, those facts that align with it quickly lose their interest since they no longer reveal anything new. At that point, the exception becomes significant. We stop looking for similarities; we focus instead on the differences, prioritizing the most pronounced ones, not just because they stand out, but because they'll be the most informative. A simple example will clarify my point: Imagine you want to define a curve by observing certain points on it. A practitioner focused solely on immediate utility would only look at the points he needs for a specific purpose. These points would be unevenly spaced on the curve—crowded in some areas and sparse in others—making it impossible to connect them with a continuous line, and they wouldn't be useful for other applications. The scientist, however, approaches this differently; he aims to study the curve as a whole, so he distributes the points to observe evenly. Once he has enough data, he connects them with a regular line to reveal the entire curve. But how does he do this? If he identifies an extreme point of the curve, he doesn’t linger there; instead, he first explores the other end. Following the two extremes, the most valuable point will be the midpoint, and so forth.[Pg 366]

So when a rule is established we should first seek the cases where this rule has the greatest chance of failing. Thence, among other reasons, come the interest of astronomic facts, and the interest of the geologic past; by going very far away in space or very far away in time, we may find our usual rules entirely overturned, and these grand overturnings aid us the better to see or the better to understand the little changes which may happen nearer to us, in the little corner of the world where we are called to live and act. We shall better know this corner for having traveled in distant countries with which we have nothing to do.

So when we establish a rule, we should first look for the situations where that rule is most likely to fail. For various reasons, this leads us to be interested in astronomical facts and the distant geological past; by exploring far away in space or time, we might find that our usual rules don't apply, and these significant shifts help us understand the small changes that can happen closer to home, in the small corner of the world where we live and act. We'll have a better understanding of this corner because of our travels to distant places that seem unrelated to us.

But what we ought to aim at is less the ascertainment of resemblances and differences than the recognition of likenesses hidden under apparent divergences. Particular rules seem at first discordant, but looking more closely we see in general that they resemble each other; different as to matter, they are alike as to form, as to the order of their parts. When we look at them with this bias, we shall see them enlarge and tend to embrace everything. And this it is which makes the value of certain facts which come to complete an assemblage and to show that it is the faithful image of other known assemblages.

But what we should focus on is not just identifying similarities and differences, but recognizing the likenesses that hide beneath apparent differences. Specific rules might seem to clash at first, but when we examine them more closely, we notice that they generally share similarities; different in content, they are similar in structure and the arrangement of their components. Viewing them this way allows us to see them expand and begin to include everything. This is what gives certain facts their value, as they help create a collection that accurately reflects other known collections.

I will not further insist, but these few words suffice to show that the scientist does not choose at random the facts he observes. He does not, as Tolstoi says, count the lady-bugs, because, however interesting lady-bugs may be, their number is subject to capricious variations. He seeks to condense much experience and much thought into a slender volume; and that is why a little book on physics contains so many past experiences and a thousand times as many possible experiences whose result is known beforehand.

I won’t push the issue, but these few words are enough to show that a scientist doesn’t randomly pick the facts they observe. They don’t, as Tolstoi puts it, just count ladybugs because, no matter how interesting ladybugs are, their numbers can vary unpredictably. Instead, they aim to distill extensive experience and deep thought into a concise volume; that’s why a small book on physics includes so many past experiences and a thousand times more potential experiences with known outcomes.

But we have as yet looked at only one side of the question. The scientist does not study nature because it is useful; he studies it because he delights in it, and he delights in it because it is beautiful. If nature were not beautiful, it would not be worth knowing, and if nature were not worth knowing, life would not be worth living. Of course I do not here speak of that beauty which strikes the senses, the beauty of qualities and of appearances; not that I undervalue such beauty, far from it, but it has nothing to do with science; I mean that profounder beauty which[Pg 367] comes from the harmonious order of the parts and which a pure intelligence can grasp. This it is which gives body, a structure so to speak, to the iridescent appearances which flatter our senses, and without this support the beauty of these fugitive dreams would be only imperfect, because it would be vague and always fleeting. On the contrary, intellectual beauty is sufficient unto itself, and it is for its sake, more perhaps than for the future good of humanity, that the scientist devotes himself to long and difficult labors.

But we have only looked at one side of the issue so far. The scientist doesn’t study nature because it’s useful; he studies it because he finds joy in it, and he finds joy in it because it’s beautiful. If nature weren’t beautiful, it wouldn’t be worth knowing, and if nature weren’t worth knowing, life wouldn’t be worth living. Of course, I’m not talking about the kind of beauty that catches the eye, the beauty of qualities and appearances; not that I undervalue such beauty—far from it—but it has nothing to do with science. I mean that deeper beauty which[Pg 367] comes from the harmonious order of its parts and can be grasped by pure intelligence. This is what gives substance, a structure so to speak, to the sparkling appearances that appeal to our senses, and without this foundation, the beauty of these fleeting dreams would be incomplete because it would be vague and always changing. In contrast, intellectual beauty is self-sufficient, and it’s for its sake, perhaps more than for the future good of humanity, that the scientist commits to long and challenging work.

It is, therefore, the quest of this especial beauty, the sense of the harmony of the cosmos, which makes us choose the facts most fitting to contribute to this harmony, just as the artist chooses from among the features of his model those which perfect the picture and give it character and life. And we need not fear that this instinctive and unavowed prepossession will turn the scientist aside from the search for the true. One may dream a harmonious world, but how far the real world will leave it behind! The greatest artists that ever lived, the Greeks, made their heavens; how shabby it is beside the true heavens, ours!

It is, therefore, the pursuit of this particular kind of beauty, the appreciation of the harmony of the universe, that leads us to select the facts that best contribute to this harmony, just as an artist chooses from the features of their model those that enhance the artwork and give it character and life. And we shouldn’t worry that this instinctive and unspoken bias will distract the scientist from the quest for truth. One can envision a harmonious world, but how much more distant the real world can be! The greatest artists of all time, the Greeks, created their heavens; how inferior they seem compared to our actual heavens!

And it is because simplicity, because grandeur, is beautiful, that we preferably seek simple facts, sublime facts, that we delight now to follow the majestic course of the stars, now to examine with the microscope that prodigious littleness which is also a grandeur, now to seek in geologic time the traces of a past which attracts because it is far away.

And it's because simplicity and greatness are beautiful that we often look for simple truths and extraordinary facts. We enjoy watching the glorious path of the stars and examining, with a microscope, the amazing smallness that is also magnificent. We seek, in geological time, the signs of a distant past that fascinates us simply because it is so far away.

We see too that the longing for the beautiful leads us to the same choice as the longing for the useful. And so it is that this economy of thought, this economy of effort, which is, according to Mach, the constant tendency of science, is at the same time a source of beauty and a practical advantage. The edifices that we admire are those where the architect has known how to proportion the means to the end, where the columns seem to carry gaily, without effort, the weight placed upon them, like the gracious caryatids of the Erechtheum.

We also notice that the desire for beauty leads us to make the same choices as the desire for usefulness. Thus, this economy of thought and effort, which Mach sees as the constant tendency of science, is also a source of beauty and practical benefit. The structures we admire are those where the architect has successfully balanced the means with the end, where the columns appear to effortlessly support the weight on them, just like the elegant caryatids of the Erechtheum.

Whence comes this concordance? Is it simply that the things which seem to us beautiful are those which best adapt themselves to our intelligence, and that consequently they are at the same time the implement this intelligence knows best how to use?[Pg 368] Or is there here a play of evolution and natural selection? Have the peoples whose ideal most conformed to their highest interest exterminated the others and taken their place? All pursued their ideals without reference to consequences, but while this quest led some to destruction, to others it gave empire. One is tempted to believe it. If the Greeks triumphed over the barbarians and if Europe, heir of Greek thought, dominates the world, it is because the savages loved loud colors and the clamorous tones of the drum which occupied only their senses, while the Greeks loved the intellectual beauty which hides beneath sensuous beauty, and this intellectual beauty it is which makes intelligence sure and strong.

Where does this agreement come from? Is it just that the things we find beautiful are the ones that best fit our understanding, and therefore they are also the tools our intelligence knows how to use best?[Pg 368] Or is this a result of evolution and natural selection? Did the societies whose ideals aligned most closely with their greatest interests eliminate the others and take their place? All pursued their ideals without considering the outcomes, but while this pursuit led some to ruin, it granted others power. One might be inclined to think so. If the Greeks beat the barbarians and if Europe, the successor of Greek thought, rules the world, it’s because the savages were drawn to bright colors and the loud sounds of drums that engaged only their senses, while the Greeks appreciated the intellectual beauty that lies beneath superficial beauty, and it's this intellectual beauty that makes intelligence reliable and strong.

Doubtless such a triumph would horrify Tolstoi, and he would not like to acknowledge that it might be truly useful. But this disinterested quest of the true for its own beauty is sane also and able to make man better. I well know that there are mistakes, that the thinker does not always draw thence the serenity he should find therein, and even that there are scientists of bad character. Must we, therefore, abandon science and study only morals? What! Do you think the moralists themselves are irreproachable when they come down from their pedestal?

No doubt such a success would horrify Tolstoi, and he wouldn’t want to admit that it could actually be useful. But this selfless pursuit of truth for its own beauty is also rational and can improve humanity. I know that there are mistakes, that thinkers don’t always find the peace they should from their work, and that some scientists have poor character. Should we then give up on science and only study ethics? What! Do you really believe that moralists themselves are beyond reproach when they step down from their pedestal?


CHAPTER II

The Future of Math

To foresee the future of mathematics, the true method is to study its history and its present state.

To predict the future of mathematics, the best approach is to look at its history and current status.

Is this not for us mathematicians in a way a professional procedure? We are accustomed to extrapolate, which is a means of deducing the future from the past and present, and as we well know what this amounts to, we run no risk of deceiving ourselves about the range of the results it gives us.

Is this not, in a way, a professional routine for us mathematicians? We are used to extrapolating, which is a method of predicting the future based on the past and present. Since we understand what this really means, we aren't at risk of fooling ourselves about the extent of the results it provides us.

We have had hitherto prophets of evil. They blithely reiterate that all problems capable of solution have already been solved, and that nothing is left but gleaning. Happily the case of the past reassures us. Often it was thought all problems were solved or at least an inventory was made of all admitting solution. And then the sense of the word solution enlarged, the insoluble problems became the most interesting of all, and others unforeseen presented themselves. For the Greeks a good solution was one employing only ruler and compasses; then it became one obtained by the extraction of roots, then one using only algebraic or logarithmic functions. The pessimists thus found themselves always outflanked, always forced to retreat, so that at present I think there are no more.

We’ve had our share of doomsayers. They confidently claim that all solvable problems have already been addressed, and that there’s nothing left but to pick up the pieces. Thankfully, history proves otherwise. In the past, people often believed that all problems had been solved, or at least that they could identify which ones could be solved. Then, the definition of a solution expanded, the really tough problems turned out to be the most fascinating, and new challenges emerged unexpectedly. For the Greeks, a good solution required just a ruler and compass; then it evolved to include finding square roots, and later, using algebraic or logarithmic functions. The pessimists found themselves consistently outmaneuvered and forced to back down, so I believe there are hardly any left today.

My intention, therefore, is not to combat them, as they are dead; we well know that mathematics will continue to develop, but the question is how, in what direction? You will answer, 'in every direction,' and that is partly true; but if it were wholly true it would be a little appalling. Our riches would soon become encumbering and their accumulation would produce a medley as impenetrable as the unknown true was for the ignorant.

My goal, then, isn’t to argue against them since they’re gone; we know that mathematics will keep evolving, but the real question is how and in what direction? You might say, “in every direction,” and that’s partly correct; but if it were completely true, it would be a bit frightening. Our wealth of knowledge could quickly become overwhelming, and its accumulation would create a confusion as difficult to navigate as the unknown was for those who didn’t understand it.

The historian, the physicist, even, must make a choice among facts; the head of the scientist, which is only a corner of the universe, could never contain the universe entire; so that among the innumerable facts nature offers, some will be passed by, others retained.[Pg 370]

The historian, the physicist, and others must choose from facts; the scientist's mind, which is just a small part of the universe, can never hold the whole universe. So, among the countless facts that nature presents, some will be overlooked, while others will be chosen.[Pg 370]

Just so, a fortiori, in mathematics; no more can the geometer hold fast pell-mell all the facts presenting themselves to him; all the more because he it is, almost I had said his caprice, that creates these facts. He constructs a wholly new combination by putting together its elements; nature does not in general give it to him ready made.

Just like that, a fortiori, in mathematics; a geometer can’t just grab all the facts that come to him without thinking; especially since he’s the one, or rather his whims, who creates these facts. He builds a completely new combination by assembling its components; nature usually doesn’t provide it to him completed.

Doubtless it sometimes happens that the mathematician undertakes a problem to satisfy a need in physics; that the physicist or engineer asks him to calculate a number for a certain application. Shall it be said that we geometers should limit ourselves to awaiting orders, and, in place of cultivating our science for our own delectation, try only to accommodate ourselves to the wants of our patrons? If mathematics has no other object besides aiding those who study nature, it is from these we should await orders. Is this way of looking at it legitimate? Certainly not; if we had not cultivated the exact sciences for themselves, we should not have created mathematics the instrument, and the day the call came from the physicist we should have been helpless.

It's true that sometimes mathematicians take on problems to meet a need in physics; that physicists or engineers ask them to calculate a number for a specific application. Should we geometers just wait for orders, and instead of pursuing our science for our own enjoyment, only try to cater to the needs of our clients? If the only purpose of mathematics is to help those studying nature, we should wait for requests from them. Is this perspective valid? Absolutely not; if we hadn't developed the exact sciences for their own sake, we wouldn't have created mathematics as a tool, and when the physicist needed help, we would have been unable to assist.

Nor do the physicists wait to study a phenomenon until some urgent need of material life has made it a necessity for them; and they are right. If the scientists of the eighteenth century had neglected electricity as being in their eyes only a curiosity without practical interest, we should have had in the twentieth century neither telegraphy, nor electro-chemistry, nor electro-technics. The physicists, compelled to choose, are therefore not guided in their choice solely by utility. How then do they choose between the facts of nature? We have explained it in the preceding chapter: the facts which interest them are those capable of leading to the discovery of a law, and so they are analogous to many other facts which do not seem to us isolated, but closely grouped with others. The isolated fact attracts all eyes, those of the layman as well as of the scientist. But what the genuine physicist alone knows how to see, is the bond which unites many facts whose analogy is profound but hidden. The story of Newton's apple is probably not true, but it is symbolic; let us speak of it then as if it were true. Well then, we must believe that before Newton plenty of men had seen apples fall; not one knew[Pg 371] how to conclude anything therefrom. Facts would be sterile were there not minds capable of choosing among them, discerning those behind which something was hidden, and of recognizing what is hiding, minds which under the crude fact perceive the soul of the fact.

Nor do physicists wait to study a phenomenon until an urgent need for material life makes it necessary for them; and they’re right. If the scientists of the eighteenth century had ignored electricity as just a curiosity with no practical use, we wouldn’t have had telegraphy, electro-chemistry, or electro-technics in the twentieth century. Physicists, when forced to choose, are not guided by usefulness alone. So how do they decide between the facts of nature? We explained it in the previous chapter: they are interested in facts that can lead to the discovery of a law, which are often tied to many other facts that seem interconnected rather than isolated. The isolated fact catches everyone's attention, both laypeople and scientists. But what only the true physicist can see is the connection that unites many facts whose similarities are deep but hidden. The story of Newton's apple is probably not true, but it’s symbolic; let’s talk about it as if it were true. So, we must believe that many people before Newton had seen apples fall; yet not one of them knew how to draw any conclusions from it. Facts would be meaningless if there weren’t minds capable of selecting among them, recognizing those behind which something is concealed, and understanding what is hidden—minds that perceive the essence of the fact beneath its surface.

We find just the same thing in mathematics. From the varied elements at our disposal we can get millions of different combinations; but one of these combinations, in so far as it is isolated, is absolutely void of value. Often we have taken great pains to construct it, but it serves no purpose, if not perhaps to furnish a task in secondary education. Quite otherwise will it be when this combination shall find place in a class of analogous combinations and we shall have noticed this analogy. We are no longer in the presence of a fact, but of a law. And upon that day the real discoverer will not be the workman who shall have patiently built up certain of these combinations; it will be he who brings to light their kinship. The first will have seen merely the crude fact, only the other will have perceived the soul of the fact. Often to fix this kinship it suffices him to make a new word, and this word is creative. The history of science furnishes us a crowd of examples familiar to all.

We see the same thing in mathematics. With the different elements we have, we can create millions of unique combinations; however, one of these combinations, when taken on its own, is completely worthless. Many times we've put in a lot of effort to create it, but it serves no real purpose, except maybe as an exercise in high school. It’s a different story when this combination fits into a group of similar combinations, and we recognize that similarity. Now we’re not just looking at a fact, but a law. On that day, the true discoverer won’t be the person who painstakingly built these combinations; it will be the one who highlights their connection. The first will have noticed just the basic fact, while the other will have understood the essence of the fact. Often, to clarify this connection, all it takes is for them to create a new term, and that term is innovative. The history of science gives us many examples that everyone knows.

The celebrated Vienna philosopher Mach has said that the rôle of science is to produce economy of thought, just as machines produce economy of effort. And that is very true. The savage reckons on his fingers or by heaping pebbles. In teaching children the multiplication table we spare them later innumerable pebble bunchings. Some one has already found out, with pebbles or otherwise, that 6 times 7 is 42 and has had the idea of noting the result, and so we need not do it over again. He did not waste his time even if he reckoned for pleasure: his operation took him only two minutes; it would have taken in all two milliards if a milliard men had had to do it over after him.

The famous Vienna philosopher Mach stated that the role of science is to create efficiency in thought, just as machines create efficiency in labor. And that’s absolutely correct. A primitive person counts on their fingers or by stacking stones. Teaching kids the multiplication table saves them countless stone stackings later on. Someone has already figured out, whether with stones or otherwise, that 6 times 7 equals 42 and thought to record the result so we don’t have to redo it. He didn’t waste his time, even if he was calculating for fun: his calculation took just two minutes; if a billion people had to redo it after him, it would have taken an entire two billion minutes.

The importance of a fact then is measured by its yield, that is to say, by the amount of thought it permits us to spare.

The value of a fact is determined by its usefulness, meaning the extent to which it allows us to conserve our mental effort.

In physics the facts of great yield are those entering into a very general law, since from it they enable us to foresee a great number of others, and just so it is in mathematics. Suppose I have undertaken a complicated calculation and laboriously[Pg 372] reached a result: I shall not be compensated for my trouble if thereby I have not become capable of foreseeing the results of other analogous calculations and guiding them with a certainty that avoids the gropings to which one must be resigned in a first attempt. On the other hand, I shall not have wasted my time if these gropings themselves have ended by revealing to me the profound analogy of the problem just treated with a much more extended class of other problems; if they have shown me at once the resemblances and differences of these, if in a word they have made me perceive the possibility of a generalization. Then it is not a new result I have won, it is a new power.

In physics, the most significant facts are those that contribute to a broad law, as they allow us to predict many other outcomes, just like in mathematics. Imagine I've tackled a complex calculation and painstakingly arrived at a result: I won't feel rewarded for my effort unless I've also gained the ability to anticipate the results of similar calculations and confidently guide them, avoiding the uncertainty that comes with trying something for the first time. On the flip side, I won't have wasted my time if these initial struggles reveal the deep connections between the problem at hand and a much wider range of other problems. If they highlight both the similarities and differences among them, and ultimately help me see the potential for a generalization, then I've not just achieved a new result; I've gained a new capability.

The simple example that comes first to mind is that of an algebraic formula which gives us the solution of a type of numeric problems when finally we replace the letters by numbers. Thanks to it, a single algebraic calculation saves us the pains of ceaselessly beginning over again new numeric calculations. But this is only a crude example; we all know there are analogies inexpressible by a formula and all the more precious.

The first example that comes to mind is an algebraic formula that provides a solution for certain types of numeric problems once we substitute the letters with numbers. Because of this, a single algebraic calculation spares us the hassle of constantly starting new numeric calculations from scratch. However, this is just a basic example; we all know there are analogies that can't be expressed by a formula and are even more valuable.

A new result is of value, if at all, when in unifying elements long known but hitherto separate and seeming strangers one to another it suddenly introduces order where apparently disorder reigned. It then permits us to see at a glance each of these elements and its place in the assemblage. This new fact is not merely precious by itself, but it alone gives value to all the old facts it combines. Our mind is weak as are the senses; it would lose itself in the world's complexity were this complexity not harmonious; like a near-sighted person, it would see only the details and would be forced to forget each of these details before examining the following, since it would be incapable of embracing all. The only facts worthy our attention are those which introduce order into this complexity and so make it accessible.

A new finding is only valuable if it brings together elements that are long known but have previously seemed unrelated. It suddenly creates order where chaos seemed to exist. This allows us to see at a glance each element and its role in the whole. This new piece of information isn't just valuable on its own; it also adds value to all the old facts it connects. Our minds are as fragile as our senses; without a harmonious structure, we'd get lost in the complexity of the world. Like someone who is nearsighted, we'd only notice the details and forget each one before moving on to the next, unable to grasp everything at once. The only facts that deserve our attention are those that bring order to this complexity, making it understandable.

Mathematicians attach great importance to the elegance of their methods and their results. This is not pure dilettantism. What is it indeed that gives us the feeling of elegance in a solution, in a demonstration? It is the harmony of the diverse parts, their symmetry, their happy balance; in a word it is all that introduces order, all that gives unity, that permits us to see clearly and to comprehend at once both the ensemble and the[Pg 373] details. But this is exactly what yields great results; in fact the more we see this aggregate clearly and at a single glance, the better we perceive its analogies with other neighboring objects, consequently the more chances we have of divining the possible generalizations. Elegance may produce the feeling of the unforeseen by the unexpected meeting of objects we are not accustomed to bring together; there again it is fruitful, since it thus unveils for us kinships before unrecognized. It is fruitful even when it results only from the contrast between the simplicity of the means and the complexity of the problem set; it makes us then think of the reason for this contrast and very often makes us see that chance is not the reason; that it is to be found in some unexpected law. In a word, the feeling of mathematical elegance is only the satisfaction due to any adaptation of the solution to the needs of our mind, and it is because of this very adaptation that this solution can be for us an instrument. Consequently this esthetic satisfaction is bound up with the economy of thought. Again the comparison of the Erechtheum comes to my mind, but I must not use it too often.

Mathematicians really value the elegance of their methods and results. This isn’t just a hobbyist's appreciation. What gives us the sense of elegance in a solution or demonstration? It’s the harmony of different parts, their symmetry, their balanced nature; in short, it’s everything that brings order and unity, allowing us to clearly understand both the whole and the specifics at once. This clarity leads to significant outcomes; the better we grasp this overall picture at a glance, the more we notice its similarities with other related concepts, increasing our chances of discovering possible generalizations. Elegance can create a sense of surprise through the unexpected connections between ideas we don’t usually combine; this is also productive, as it reveals relationships we hadn’t recognized before. It’s valuable even when it arises solely from the contrast between the simplicity of the methods and the complexity of the problem; this difference prompts us to question why this is the case and often leads us to realize that randomness isn’t the reason; rather, it points to some unexpected principle. In essence, the feeling of mathematical elegance is the satisfaction that comes from a solution fitting the needs of our thinking, and it's this very fit that allows the solution to serve as a tool for us. Therefore, this aesthetic satisfaction is linked to the efficiency of thought. The comparison to the Erechtheum pops into my mind again, but I shouldn't rely on it too much.

It is for the same reason that, when a rather long calculation has led to some simple and striking result, we are not satisfied until we have shown that we should have been able to foresee, if not this entire result, at least its most characteristic traits. Why? What prevents our being content with a calculation which has told us, it seems, all we wished to know? It is because, in analogous cases, the long calculation might not again avail, and that this is not so about the reasoning often half intuitive which would have enabled us to foresee. This reasoning being short, we see at a single glance all its parts, so that we immediately perceive what must be changed to adapt it to all the problems of the same nature which can occur. And then it enables us to foresee if the solution of these problems will be simple, it shows us at least if the calculation is worth undertaking.

It’s for the same reason that when a lengthy calculation leads to a simple and impressive result, we aren't satisfied until we can prove that we should have been able to foresee, if not the entire result, at least its most distinctive features. Why? What stops us from being happy with a calculation that seems to have given us all the information we wanted? It's because in similar cases, a long calculation might not work again, while the reasoning—often somewhat intuitive—that could have guided our foresight usually doesn’t require much effort. This reasoning is concise, allowing us to see all its components at a glance, which helps us quickly recognize what needs to be adjusted to apply it to other problems of a similar nature. It also lets us anticipate whether the solutions to these problems will be straightforward and at least indicates whether the calculation is worth pursuing.

What we have just said suffices to show how vain it would be to seek to replace by any mechanical procedure the free initiative of the mathematician. To obtain a result of real value, it is not enough to grind out calculations, or to have a machine to put[Pg 374] things in order; it is not order alone, it is unexpected order, which is worth while. The machine may gnaw on the crude fact, the soul of the fact will always escape it.

What we just said is enough to show how pointless it would be to try to replace the mathematician's free initiative with any mechanical process. To achieve a genuinely valuable result, it's not enough to just churn out calculations or have a machine organize things; it's not just about order, but about unexpected order that truly matters. The machine may chew on the raw data, but the essence of the data will always elude it.

Since the middle of the last century, mathematicians are more and more desirous of attaining absolute rigor; they are right, and this tendency will be more and more accentuated. In mathematics rigor is not everything, but without it there is nothing. A demonstration which is not rigorous is nothingness. I think no one will contest this truth. But if it were taken too literally, we should be led to conclude that before 1820, for example, there was no mathematics; this would be manifestly excessive; the geometers of that time understood voluntarily what we explain by prolix discourse. This does not mean that they did not see it at all; but they passed over it too rapidly, and to see it well would have necessitated taking the pains to say it.

Since the middle of the last century, mathematicians have increasingly wanted to achieve absolute rigor; they’re right, and this trend will continue to grow. In mathematics, rigor isn’t everything, but without it, there’s nothing. A proof that isn’t rigorous is meaningless. I think no one would dispute this truth. However, if we take it too literally, we might conclude that before 1820, for example, there was no mathematics; that would be clearly excessive. The mathematicians of that time understood voluntarily what we explain in lengthy discussions. This doesn’t mean they didn’t grasp it at all; they just moved through it too quickly, and to understand it well would have required the effort to articulate it clearly.

But is it always needful to say it so many times? Those who were the first to emphasize exactness before all else have given us arguments that we may try to imitate; but if the demonstrations of the future are to be built on this model, mathematical treatises will be very long; and if I fear the lengthenings, it is not solely because I deprecate encumbering libraries, but because I fear that in being lengthened out, our demonstrations may lose that appearance of harmony whose usefulness I have just explained.

But is it really necessary to say it so many times? Those who were the first to stress precision above everything else have provided us with arguments that we might try to follow; however, if future demonstrations are built on this model, mathematical papers will end up being very lengthy. And if I'm concerned about the length, it's not just because I dislike cluttering libraries, but because I worry that as they get longer, our demonstrations might lose that sense of harmony whose importance I've just explained.

The economy of thought is what we should aim at, so it is not enough to supply models for imitation. It is needful for those after us to be able to dispense with these models and, in place of repeating an argument already made, summarize it in a few words. And this has already been attained at times. For instance, there was a type of reasoning found everywhere, and everywhere alike. They were perfectly exact but long. Then all at once the phrase 'uniformity of convergence' was hit upon and this phrase made those arguments needless; we were no longer called upon to repeat them, since they could be understood. Those who conquer difficulties then do us a double service: first they teach us to do as they at need, but above all they enable us as often as possible to avoid doing as they, yet without sacrifice of exactness.

The economy of thought is what we should strive for, so it's not enough just to provide models for others to copy. Those who come after us need to be able to do without these models and instead of repeating a previously made argument, summarize it in just a few words. This has already been achieved at times. For example, there was a type of reasoning that was found everywhere, and it was consistently precise but lengthy. Then suddenly, the term 'uniformity of convergence' was coined, which made those arguments unnecessary; we no longer needed to repeat them since they could be easily understood. Those who tackle challenges do us a double favor: first, they teach us to do the same when necessary, but more importantly, they help us avoid repeating their efforts as often as possible, all while maintaining accuracy.

We have just seen by one example the importance of words in mathematics, but many others could be cited. It is hard to believe how much a well-chosen word can economize thought, as Mach says. Perhaps I have already said somewhere that mathematics is the art of giving the same name to different things. It is proper that these things, differing in matter, be alike in form, that they may, so to speak, run in the same mold. When the language has been well chosen, we are astonished to see that all the proofs made for a certain object apply immediately to many new objects; there is nothing to change, not even the words, since the names have become the same.

We’ve just looked at one example of how important words are in math, but many more could be mentioned. It’s hard to believe how much a well-chosen word can save on thinking, as Mach says. I might have mentioned before that math is the art of giving the same name to different things. It's fitting that these things, which differ in substance, should be similar in form so they can, in a way, fit the same mold. When the language is chosen wisely, we’re amazed to find that all the proofs made for one object work right away for many new objects; there's nothing to change, not even the words, since the names have become the same.

A well-chosen word usually suffices to do away with the exceptions from which the rules stated in the old way suffer; this is why we have created negative quantities, imaginaries, points at infinity, and what not. And exceptions, we must not forget, are pernicious because they hide the laws.

A carefully chosen word typically eliminates the exceptions that the traditional rules struggle with; this is why we've developed negative numbers, imaginary numbers, points at infinity, and so on. And we shouldn't overlook that exceptions are harmful because they obscure the underlying principles.

Well, this is one of the characteristics by which we recognize the facts which yield great results. They are those which allow of these happy innovations of language. The crude fact then is often of no great interest; we may point it out many times without having rendered great service to science. It takes value only when a wiser thinker perceives the relation for which it stands, and symbolizes it by a word.

Well, this is one of the traits that help us identify the facts that lead to significant results. They are the ones that enable these joyful innovations in language. The basic fact is often not very interesting; we can highlight it many times without contributing much to science. It only gains value when a smarter thinker recognizes the relationship it represents and captures it with a word.

Moreover the physicists do just the same. They have invented the word 'energy,' and this word has been prodigiously fruitful, because it also made the law by eliminating the exceptions, since it gave the same name to things differing in matter and like in form.

Moreover, physicists do the same thing. They came up with the word 'energy,' and this term has been incredibly effective, as it created the law by removing exceptions, since it labeled things that differ in substance but are similar in form with the same name.

Among words that have had the most fortunate influence I would select 'group' and 'invariant.' They have made us see the essence of many mathematical reasonings; they have shown us in how many cases the old mathematicians considered groups without knowing it, and how, believing themselves far from one another, they suddenly found themselves near without knowing why.

Among words that have had the most positive impact, I would choose 'group' and 'invariant.' They have helped us understand the core of many mathematical concepts; they have revealed how often past mathematicians dealt with groups without realizing it, and how, thinking they were distant from each other, they unexpectedly found common ground without knowing why.

To-day we should say that they had dealt with isomorphic groups. We now know that in a group the matter is of little interest, the form alone counts, and that when we know a group[Pg 376] we thus know all the isomorphic groups; and thanks to these words 'group' and 'isomorphism,' which condense in a few syllables this subtile rule and quickly make it familiar to all minds, the transition is immediate and can be done with every economy of thought effort. The idea of group besides attaches to that of transformation. Why do we put such a value on the invention of a new transformation? Because from a single theorem it enables us to get ten or twenty; it has the same value as a zero adjoined to the right of a whole number.

Today we would say that they dealt with isomorphic groups. We now understand that in a group, the specifics don’t matter; only the structure does. When we know a group[Pg 376], we know all the isomorphic groups. Thanks to the terms 'group' and 'isomorphism,' which effectively summarize this subtle rule and make it easily understandable, the transition is straightforward and requires minimal mental effort. The concept of a group is also linked to that of transformation. Why do we place such importance on creating a new transformation? Because from one theorem, we can derive ten or twenty others; it’s as valuable as adding a zero to the right of a whole number.

This then it is which has hitherto determined the direction of mathematical advance, and just as certainly will determine it in the future. But to this end the nature of the problems which come up contributes equally. We can not forget what must be our aim. In my opinion this aim is double. Our science borders upon both philosophy and physics, and we work for our two neighbors; so we have always seen and shall still see mathematicians advancing in two opposite directions.

This is what has shaped the direction of mathematical progress up to now and will certainly continue to do so in the future. However, the nature of the problems we face also plays a crucial role. We must not lose sight of our goal. In my view, this goal is twofold. Our field overlaps with both philosophy and physics, and we serve both of our neighboring disciplines; thus, we have always seen and will continue to see mathematicians advancing in two opposite directions.

On the one hand, mathematical science must reflect upon itself, and that is useful since reflecting on itself is reflecting on the human mind which has created it, all the more because it is the very one of its creations for which it has borrowed least from without. This is why certain mathematical speculations are useful, such as those devoted to the study of the postulates, of unusual geometries, of peculiar functions. The more these speculations diverge from ordinary conceptions, and consequently from nature and applications, the better they show us what the human mind can create when it frees itself more and more from the tyranny of the external world, the better therefore they let us know it in itself.

On one hand, mathematical science needs to reflect on itself, and this is valuable because such reflection is also contemplation of the human mind that has developed it, especially since it is one of the creations that relies the least on external influences. This is why certain mathematical explorations are beneficial, like those focused on studying postulates, unusual geometries, and unique functions. The more these explorations stray from standard ideas, and thus from nature and practical applications, the better they demonstrate what the human mind can create when it increasingly liberates itself from the constraints of the outside world, and in turn, they help us understand the mind itself better.

But it is toward the other side, the side of nature, that we must direct the bulk of our army. There we meet the physicist or the engineer, who says to us: "Please integrate this differential equation for me; I might need it in a week in view of a construction which should be finished by that time." "This equation," we answer, "does not come under one of the integrable types; you know there are not many." "Yes, I know; but then what good are you?" Usually to understand each other is enough; the engineer in reality does not need the integral in finite terms;[Pg 377] he needs to know the general look of the integral function, or he simply wants a certain number which could readily be deduced from this integral if it were known. Usually it is not known, but the number can be calculated without it if we know exactly what number the engineer needs and with what approximation.

But we need to focus most of our energy on the side of nature. There we encounter the physicist or the engineer, who says to us: "Can you please solve this differential equation for me? I might need it in a week for a project that should be finished by then." "This equation," we respond, "is not one of the solvable types; you know there aren't many." "I know that; but then what use are you?" Usually, it’s enough for us to understand each other; the engineer doesn’t actually need the integral in exact terms; he just needs to have a general idea of what the integral function looks like or he simply wants a specific number that could easily be figured out from this integral if it were known. It usually isn’t known, but the number can be calculated without it if we know exactly what number the engineer needs and what level of accuracy is required.

Formerly an equation was considered solved only when its solution had been expressed by aid of a finite number of known functions; but that is possible scarcely once in a hundred times. What we always can do, or rather what we should always seek to do, is to solve the problem qualitatively so to speak; that is to say, seek to know the general form of the curve which represents the unknown function.

Previously, an equation was only deemed solved when its solution could be expressed using a finite number of known functions; however, that only happens about once in a hundred times. What we can always do, or rather what we should always aim to do, is to solve the problem qualitatively, so to speak; in other words, we should strive to understand the general shape of the curve that represents the unknown function.

It remains to find the quantitative solution of the problem; but if the unknown can not be determined by a finite calculation, it may always be represented by a convergent infinite series which enables us to calculate it. Can that be regarded as a true solution? We are told that Newton sent Leibnitz an anagram almost like this: aaaaabbbeeeeij, etc. Leibnitz naturally understood nothing at all of it; but we, who have the key, know that this anagram meant, translated into modern terms: "I can integrate all differential equations"; and we are tempted to say that Newton had either great luck or strange delusions. He merely wished to say he could form (by the method of indeterminate coefficients) a series of powers formally satisfying the proposed equation.

It remains to find the quantitative solution to the problem; but if the unknown can't be determined by a finite calculation, it can always be represented by a convergent infinite series that allows us to calculate it. Can that really be seen as a true solution? It’s said that Newton sent Leibnitz an anagram almost like this: aaaaabbbeeeeij, etc. Leibnitz understandably didn’t grasp its meaning at all; but we, who have the key, know that this anagram meant, translated into modern terms: "I can integrate all differential equations"; and we might be tempted to say that Newton had either incredible luck or some odd delusions. He simply wanted to convey that he could create (using the method of undetermined coefficients) a series of powers that formally satisfied the proposed equation.

Such a solution would not satisfy us to-day, and for two reasons: because the convergence is too slow and because the terms follow each other without obeying any law. On the contrary, the series Θ seems to us to leave nothing to be desired, first because it converges very quickly (this is for the practical man who wishes to get at a number as quickly as possible) and next because we see at a glance the law of the terms (this is to satisfy the esthetic need of the theorist).

Such a solution wouldn't satisfy us today for two reasons: first, because the convergence is too slow, and second, because the terms follow one another without any clear pattern. In contrast, the series Θ seems perfect to us, primarily because it converges very quickly (which is important for the practical person who wants to get to a number as fast as possible) and also because we can easily see the pattern of the terms (which meets the aesthetic needs of the theorist).

But then there are no longer solved problems and others which are not; there are only problems more or less solved, according as they are solved by a series converging more or less rapidly, or ruled by a law more or less harmonious. It often happens however that an imperfect solution guides us toward a[Pg 378] better one. Sometimes the series converges so slowly that the computation is impracticable and we have only succeeded in proving the possibility of the problem.

But now there are no longer completely solved problems and others that aren’t; there are only problems more or less solved, depending on whether they are solved by a series that converges more or less quickly, or by a law that is more or less harmonious. However, it often happens that an imperfect solution leads us toward a[Pg 378] better one. Sometimes the series converges so slowly that the calculation is impractical, and we have only managed to demonstrate the feasibility of the problem.

And then the engineer finds this a mockery, and justly, since it will not aid him to complete his construction by the date fixed. He little cares to know if it will benefit engineers of the twenty-second century. But as for us, we think differently and we are sometimes happier to have spared our grandchildren a day's work than to have saved our contemporaries an hour.

And then the engineer sees this as a joke, and rightly so, because it won't help him finish his project by the deadline. He doesn’t care if it will benefit engineers of the twenty-second century. But as for us, we see it differently, and sometimes we feel happier to have saved our grandchildren a day's work than to have saved our peers an hour.

Sometimes by groping, empirically, so to speak, we reach a formula sufficiently convergent. "What more do you want?" says the engineer. And yet, in spite of all, we are not satisfied; we should have liked to foresee that convergence. Why? Because if we had known how to foresee it once, we would know how to foresee it another time. We have succeeded; that is a small matter in our eyes if we can not validly expect to do so again.

Sometimes by feeling our way, practically speaking, we come up with a formula that’s close enough. "What more do you want?" the engineer asks. And yet, despite everything, we're still not satisfied; we wish we could have predicted that convergence. Why? Because if we had known how to predict it once, we would know how to do it again. We’ve succeeded; that doesn’t feel like much to us if we can’t reasonably expect to do so again.

In proportion as science develops, its total comprehension becomes more difficult; then we seek to cut it in pieces and to be satisfied with one of these pieces: in a word, to specialize. If we went on in this way, it would be a grievous obstacle to the progress of science. As we have said, it is by unexpected union between its diverse parts that it progresses. To specialize too much would be to forbid these drawings together. It is to be hoped that congresses like those of Heidelberg and Rome, by putting us in touch with one another, will open for us vistas over neighboring domains and oblige us to compare them with our own, to range somewhat abroad from our own little village; thus they will be the best remedy for the danger just mentioned.

As science advances, it becomes harder to understand as a whole; we then try to break it down into smaller parts and settle for just one of those parts: in other words, to specialize. If we continue down this path, it would be a serious barrier to scientific progress. As we've mentioned, science moves forward through unexpected connections between its various aspects. Over-specialization would prevent these connections from forming. It’s hoped that conferences like those in Heidelberg and Rome, by connecting us with one another, will broaden our horizons into related fields and push us to compare them with our own, encouraging us to step beyond our small circles; in this way, they will be the best solution to the risks we've just discussed.

But I have lingered too long over generalities; it is time to enter into detail.

But I've spent too much time on generalities; it's time to get into the details.

Let us pass in review the various special sciences which combined make mathematics; let us see what each has accomplished, whither it tends and what we may hope from it. If the preceding views are correct, we should see that the greatest advances in the past have happened when two of these sciences have united, when we have become conscious of the similarity of their form, despite the difference of their matter, when they have so modeled themselves upon each other that each could profit by the other's[Pg 379] conquests. We should at the same time foresee in combinations of the same sort the progress of the future.

Let's take a look at the various branches of science that together make up mathematics. We'll explore what each has achieved, where it is heading, and what we can expect from it. If the previous ideas are accurate, we should recognize that the biggest strides in the past occurred when two of these fields came together. This happened when we became aware of their similar structures, even though their subjects differ, and when they shaped themselves based on each other so that each could benefit from the accomplishments of the other[Pg 379]. At the same time, we should anticipate that similar combinations will drive future progress.

Math

Progress in arithmetic has been much slower than in algebra and analysis, and it is easy to see why. The feeling of continuity is a precious guide which the arithmetician lacks; each whole number is separated from the others—it has, so to speak, its own individuality. Each of them is a sort of exception and this is why general theorems are rarer in the theory of numbers; this is also why those which exist are more hidden and longer elude the searchers.

Progress in arithmetic has been much slower than in algebra and analysis, and it's easy to understand why. The sense of continuity is a valuable guide that arithmeticians don't have; each whole number stands apart from the others—it has, in a way, its own individuality. Each one is a kind of exception, which makes general theorems in number theory less common; this also explains why those that do exist are often more obscure and take longer to discover.

If arithmetic is behind algebra and analysis, the best thing for it to do is to seek to model itself upon these sciences so as to profit by their advance. The arithmetician ought therefore to take as guide the analogies with algebra. These analogies are numerous and if, in many cases, they have not yet been studied sufficiently closely to become utilizable, they at least have long been foreseen, and even the language of the two sciences shows they have been recognized. Thus we speak of transcendent numbers and thus we account for the future classification of these numbers already having as model the classification of transcendent functions, and still we do not as yet very well see how to pass from one classification to the other; but had it been seen, it would already have been accomplished and would no longer be the work of the future.

If arithmetic is behind algebra and analysis, the best thing it can do is to try to model itself after these subjects to benefit from their progress. Therefore, the arithmetician should use the analogies with algebra as a guide. These analogies are numerous, and while they haven't been closely studied enough to be fully utilized in many cases, they have long been anticipated, and even the terminology of the two subjects indicates that they have been acknowledged. For example, we talk about transcendent numbers, and we justify the upcoming classification of these numbers by using the existing classification of transcendent functions; yet, we still don't clearly see how to transition from one classification to the other. However, if it had been clear, it would already have been done and would not still be a task for the future.

The first example that comes to my mind is the theory of congruences, where is found a perfect parallelism to the theory of algebraic equations. Surely we shall succeed in completing this parallelism, which must hold for instance between the theory of algebraic curves and that of congruences with two variables. And when the problems relative to congruences with several variables shall be solved, this will be a first step toward the solution of many questions of indeterminate analysis.

The first example that comes to mind is the theory of congruences, which has a perfect parallel to the theory of algebraic equations. We will surely succeed in completing this parallel, which must exist, for example, between the theory of algebraic curves and that of congruences with two variables. Once we solve the problems related to congruences with several variables, it will be a significant step toward answering many questions in indeterminate analysis.

Math

The theory of algebraic equations will still long hold the attention of geometers; numerous and very different are the sides whence it may be attacked.[Pg 380]

The theory of algebraic equations will continue to capture the interest of geometers for a long time; there are many diverse angles from which it can be approached.[Pg 380]

We need not think algebra is ended because it gives us rules to form all possible combinations; it remains to find the interesting combinations, those which satisfy such and such a condition. Thus will be formed a sort of indeterminate analysis where the unknowns will no longer be whole numbers, but polynomials. This time it is algebra which will model itself upon arithmetic, following the analogy of the whole number to the integral polynomial with any coefficients or to the integral polynomial with integral coefficients.

We shouldn’t assume that algebra is finished just because it provides rules for creating all possible combinations; we still need to discover the interesting combinations that meet certain conditions. This will lead to a kind of indeterminate analysis where the unknowns are no longer whole numbers, but polynomials. In this case, algebra will adapt to resemble arithmetic, drawing an analogy between whole numbers and integral polynomials, whether they have any coefficients or specifically integral coefficients.

Geometry

It looks as if geometry could contain nothing which is not already included in algebra or analysis; that geometric facts are only algebraic or analytic facts expressed in another language. It might then be thought that after our review there would remain nothing more for us to say relating specially to geometry. This would be to fail to recognize the importance of well-constructed language, not to comprehend what is added to the things themselves by the method of expressing these things and consequently of grouping them.

It seems that geometry can’t include anything that isn’t already part of algebra or analysis; that geometric truths are just algebraic or analytic truths stated in a different way. One might think that after our discussion, there’s nothing left to say specifically about geometry. However, this would overlook the significance of well-crafted language and miss how the way we express these concepts and organize them adds value to the concepts themselves.

First the geometric considerations lead us to set ourselves new problems; these may be, if you choose, analytic problems, but such as we never would have set ourselves in connection with analysis. Analysis profits by them however, as it profits by those it has to solve to satisfy the needs of physics.

First, the geometric considerations lead us to establish new problems; these may be, if you prefer, analytical problems, but they are ones we would never have created in relation to analysis. Nevertheless, analysis benefits from them, just as it benefits from the problems it must solve to meet the requirements of physics.

A great advantage of geometry lies in the fact that in it the senses can come to the aid of thought, and help find the path to follow, and many minds prefer to put the problems of analysis into geometric form. Unhappily our senses can not carry us very far, and they desert us when we wish to soar beyond the classical three dimensions. Does this mean that, beyond the restricted domain wherein they seem to wish to imprison us, we should rely only on pure analysis and that all geometry of more than three dimensions is vain and objectless? The greatest masters of a preceding generation would have answered 'yes'; to-day we are so familiarized with this notion that we can speak of it, even in a university course, without arousing too much astonishment.

A major benefit of geometry is that it allows our senses to support our thinking and help us find the right path. Many people prefer to turn analytical problems into geometric forms. Unfortunately, our senses can only take us so far, and they fail us when we try to think beyond the traditional three dimensions. Does this mean that, beyond the limited space where they seem to want to confine us, we should rely solely on pure analysis, making all geometry of more than three dimensions pointless and meaningless? The greatest thinkers from a previous generation would have said 'yes'; today, we are so accustomed to this idea that we can discuss it even in a university course without causing too much surprise.

But what good is it? That is easy to see: First it gives us a[Pg 381] very convenient terminology, which expresses concisely what the ordinary analytic language would say in prolix phrases. Moreover, this language makes us call like things by the same name and emphasize analogies it will never again let us forget. It enables us therefore still to find our way in this space which is too big for us and which we can not see, always recalling visible space, which is only an imperfect image of it doubtless, but which is nevertheless an image. Here again, as in all the preceding examples, it is analogy with the simple which enables us to comprehend the complex.

But what’s the point? It’s clear: First, it gives us a[Pg 381] very handy vocabulary that sums up what ordinary analytic language would express in long-winded phrases. Additionally, this language helps us label similar things the same way and highlights analogies that we won't ever forget. It allows us to navigate this vast space that’s beyond our perception, constantly reminding us of visible space, which, although just a flawed representation, is still a representation. Again, as in all the previous examples, it’s the analogy with the simple that helps us understand the complex.

This geometry of more than three dimensions is not a simple analytic geometry; it is not purely quantitative, but qualitative also, and it is in this respect above all that it becomes interesting. There is a science called analysis situs and which has for its object the study of the positional relations of the different elements of a figure, apart from their sizes. This geometry is purely qualitative; its theorems would remain true if the figures, instead of being exact, were roughly imitated by a child. We may also make an analysis situs of more than three dimensions. The importance of analysis situs is enormous and can not be too much emphasized; the advantage obtained from it by Riemann, one of its chief creators, would suffice to prove this. We must achieve its complete construction in the higher spaces; then we shall have an instrument which will enable us really to see in hyperspace and supplement our senses.

This geometry of more than three dimensions isn't just basic analytic geometry; it's not only about numbers but also about qualities, and that's what makes it particularly interesting. There's a field called analysis situs, which focuses on studying the positional relationships of different elements in a figure, independent of their sizes. This type of geometry is entirely qualitative; its theorems would still hold true even if the figures were roughly drawn by a child instead of being precise. We can also perform analysis situs in more than three dimensions. The significance of analysis situs is huge and can't be overstated; the benefits Riemann, one of its main founders, gained from it are enough to support this claim. We need to fully develop its structure in higher-dimensional spaces; then we'll have a tool that allows us to effectively perceive hyperspace and enhance our senses.

The problems of analysis situs would perhaps not have suggested themselves if the analytic language alone had been spoken; or rather, I am mistaken, they would have occurred surely, since their solution is essential to a crowd of questions in analysis, but they would have come singly, one after another, and without our being able to perceive their common bond.

The issues of analysis situs might not have come to mind if we were only using the analytic language; or rather, I’m wrong— they definitely would have emerged since solving them is crucial to many questions in analysis. However, they would have appeared one by one, without us being able to see their connection.

Cantorism

I have spoken above of our need to go back continually to the first principles of our science, and of the advantage of this for the study of the human mind. This need has inspired two endeavors which have taken a very prominent place in the most recent annals of mathematics. The first is Cantorism, which has[Pg 382] rendered our science such conspicuous service. Cantor introduced into science a new way of considering mathematical infinity. One of the characteristic traits of Cantorism is that in place of going up to the general by building up constructions more and more complicated and defining by construction, it starts from the genus supremum and defines only, as the scholastics would have said, per genus proximum et differentiam specificam. Thence comes the horror it has sometimes inspired in certain minds, for instance in Hermite, whose favorite idea was to compare the mathematical to the natural sciences. With most of us these prejudices have been dissipated, but it has come to pass that we have encountered certain paradoxes, certain apparent contradictions that would have delighted Zeno, the Eleatic and the school of Megara. And then each must seek the remedy. For my part, I think, and I am not the only one, that the important thing is never to introduce entities not completely definable in a finite number of words. Whatever be the cure adopted, we may promise ourselves the joy of the doctor called in to follow a beautiful pathologic case.

I’ve talked before about our need to continually return to the basic principles of our science and how beneficial this is for studying the human mind. This necessity has driven two major efforts that are notable in recent mathematics. The first is Cantorism, which has greatly served our field. Cantor brought a fresh perspective on mathematical infinity. One key aspect of Cantorism is that instead of building up to the general by creating increasingly complex constructions and defining them through those constructions, it starts from the genus supremum and only defines, as the scholastics would say, per genus proximum et differentiam specificam. This has sometimes caused discomfort in certain thinkers, like Hermite, who preferred to compare mathematics with natural sciences. While most of us have gotten past these biases, we’ve still faced some paradoxes and apparent contradictions that would have amused Zeno, the Eleatic, and the Megarian school. Now, each of us must find a solution. Personally, I believe, and I’m not alone in this, that it’s crucial to avoid introducing concepts that can’t be completely defined in a finite number of words. Whatever remedy we choose, we can look forward to the excitement of a doctor called in to manage a fascinating pathological case.

Exploring the Postulates

On the other hand, efforts have been made to enumerate the axioms and postulates, more or less hidden, which serve as foundation to the different theories of mathematics. Professor Hilbert has obtained the most brilliant results. It seems at first that this domain would be very restricted and there would be nothing more to do when the inventory should be ended, which could not take long. But when we shall have enumerated all, there will be many ways of classifying all; a good librarian always finds something to do, and each new classification will be instructive for the philosopher.

On the other hand, efforts have been made to list the axioms and postulates, more or less hidden, that serve as the foundation for various mathematical theories. Professor Hilbert has achieved the most impressive results. At first glance, it seems this area would be very limited and that once the inventory is complete, there wouldn't be much left to do, and that shouldn't take long. But once we've listed everything, there will be many ways to organize it all; a good librarian always finds something to do, and every new classification will be enlightening for the philosopher.

Here I end this review which I could not dream of making complete. I think these examples will suffice to show by what mechanism the mathematical sciences have made their progress in the past and in what direction they must advance in the future.

Here I conclude this review, which I could never imagine making complete. I believe these examples are enough to demonstrate how the mathematical sciences have progressed in the past and the direction they need to move in the future.


CHAPTER III

Math Innovation

The genesis of mathematical creation is a problem which should intensely interest the psychologist. It is the activity in which the human mind seems to take least from the outside world, in which it acts or seems to act only of itself and on itself, so that in studying the procedure of geometric thought we may hope to reach what is most essential in man's mind.

The origin of mathematical creation is a topic that should greatly interest psychologists. It's an activity where the human mind appears to rely the least on the outside world, engaging solely with itself and its own processes. By examining how geometric thinking works, we may be able to uncover what is most fundamental in the human mind.

This has long been appreciated, and some time back the journal called L'enseignement mathématique, edited by Laisant and Fehr, began an investigation of the mental habits and methods of work of different mathematicians. I had finished the main outlines of this article when the results of that inquiry were published, so I have hardly been able to utilize them and shall confine myself to saying that the majority of witnesses confirm my conclusions; I do not say all, for when the appeal is to universal suffrage unanimity is not to be hoped.

This has been appreciated for a long time, and a while back the journal called L'enseignement mathématique, edited by Laisant and Fehr, started looking into the mental habits and work methods of different mathematicians. I had completed most of this article by the time the results of that research were published, so I couldn't really incorporate them and will just say that most of the respondents support my conclusions; I won’t claim all do, since when it comes to universal opinion, you can't expect complete agreement.

A first fact should surprise us, or rather would surprise us if we were not so used to it. How does it happen there are people who do not understand mathematics? If mathematics invokes only the rules of logic, such as are accepted by all normal minds; if its evidence is based on principles common to all men, and that none could deny without being mad, how does it come about that so many persons are here refractory?

A surprising fact should catch our attention, or at least it would if we weren’t so accustomed to it. How is it possible that some people don’t understand mathematics? If math is based only on logical rules that every rational person accepts; if its truths are founded on principles shared by everyone, and that no one could dispute without being irrational, then why are so many people resistant to it?

That not every one can invent is nowise mysterious. That not every one can retain a demonstration once learned may also pass. But that not every one can understand mathematical reasoning when explained appears very surprising when we think of it. And yet those who can follow this reasoning only with difficulty are in the majority: that is undeniable, and will surely not be gainsaid by the experience of secondary-school teachers.

That not everyone can invent is not mysterious at all. It's also understandable that not everyone can remember a demonstration once learned. But it's quite surprising that not everyone can grasp mathematical reasoning when it's explained. And yet, the majority of people find it difficult to follow this kind of reasoning: that's a fact, and secondary school teachers will definitely agree based on their experience.

And further: how is error possible in mathematics? A sane mind should not be guilty of a logical fallacy, and yet there are[Pg 384] very fine minds who do not trip in brief reasoning such as occurs in the ordinary doings of life, and who are incapable of following or repeating without error the mathematical demonstrations which are longer, but which after all are only an accumulation of brief reasonings wholly analogous to those they make so easily. Need we add that mathematicians themselves are not infallible?

And also: how is it possible to make mistakes in mathematics? A rational mind shouldn’t commit a logical error, yet there are[Pg 384] brilliant individuals who don’t stumble over simple reasoning like that found in everyday life, but who struggle to follow or accurately repeat longer mathematical proofs, which, in the end, are just a series of brief reasonings very similar to those they handle easily. Should we mention that mathematicians themselves aren't perfect?

The answer seems to me evident. Imagine a long series of syllogisms, and that the conclusions of the first serve as premises of the following: we shall be able to catch each of these syllogisms, and it is not in passing from premises to conclusion that we are in danger of deceiving ourselves. But between the moment in which we first meet a proposition as conclusion of one syllogism, and that in which we reencounter it as premise of another syllogism occasionally some time will elapse, several links of the chain will have unrolled; so it may happen that we have forgotten it, or worse, that we have forgotten its meaning. So it may happen that we replace it by a slightly different proposition, or that, while retaining the same enunciation, we attribute to it a slightly different meaning, and thus it is that we are exposed to error.

The answer seems clear to me. Picture a long sequence of syllogisms, where the conclusions of the first serve as the premises for the next: we can follow each of these syllogisms, and we don’t risk deceiving ourselves when moving from premises to conclusion. However, between the moment we first encounter a proposition as a conclusion in one syllogism and when we see it again as a premise in another syllogism, some time might pass, and several links in the chain will have unraveled. This means we might forget it, or worse, we could forget its meaning. As a result, we might replace it with a slightly different proposition, or while keeping the same wording, we could assign it a slightly different meaning, and that’s how we become vulnerable to mistakes.

Often the mathematician uses a rule. Naturally he begins by demonstrating this rule; and at the time when this proof is fresh in his memory he understands perfectly its meaning and its bearing, and he is in no danger of changing it. But subsequently he trusts his memory and afterward only applies it in a mechanical way; and then if his memory fails him, he may apply it all wrong. Thus it is, to take a simple example, that we sometimes make slips in calculation because we have forgotten our multiplication table.

Often, a mathematician uses a rule. He starts by demonstrating this rule; and when this proof is fresh in his mind, he understands its meaning and significance perfectly, and he’s unlikely to alter it. However, later on, he relies on his memory and may end up applying it mechanically. If his memory fails him, he might apply it incorrectly. For example, this is why we sometimes make mistakes in calculations because we've forgotten our multiplication table.

According to this, the special aptitude for mathematics would be due only to a very sure memory or to a prodigious force of attention. It would be a power like that of the whist-player who remembers the cards played; or, to go up a step, like that of the chess-player who can visualize a great number of combinations and hold them in his memory. Every good mathematician ought to be a good chess-player, and inversely; likewise he should be a good computer. Of course that sometimes happens; thus Gauss[Pg 385] was at the same time a geometer of genius and a very precocious and accurate computer.

According to this, a special talent for mathematics would come only from a very reliable memory or an incredible ability to focus. It would be a skill like that of a card player who remembers the cards that have been played; or, to take it up a notch, like a chess player who can visualize many combinations and keep them in their mind. Every good mathematician should also be a good chess player, and vice versa; they should also be skilled at calculations. This does happen sometimes; for example, Gauss[Pg 385] was both a genius geometer and an exceptionally talented and precise calculator.

But there are exceptions; or rather I err; I can not call them exceptions without the exceptions being more than the rule. Gauss it is, on the contrary, who was an exception. As for myself, I must confess, I am absolutely incapable even of adding without mistakes. In the same way I should be but a poor chess-player; I would perceive that by a certain play I should expose myself to a certain danger; I would pass in review several other plays, rejecting them for other reasons, and then finally I should make the move first examined, having meantime forgotten the danger I had foreseen.

But there are exceptions; or rather I’m mistaken; I can’t call them exceptions without them being more common than the rule. Gauss, on the other hand, truly was an exception. As for me, I have to admit that I’m completely unable to even add without making mistakes. Similarly, I would be a terrible chess player; I would realize that a certain move puts me in danger; I would consider several other moves, rejecting them for different reasons, and then ultimately make the move I initially thought of, all while forgetting the danger I had anticipated.

In a word, my memory is not bad, but it would be insufficient to make me a good chess-player. Why then does it not fail me in a difficult piece of mathematical reasoning where most chess-players would lose themselves? Evidently because it is guided by the general march of the reasoning. A mathematical demonstration is not a simple juxtaposition of syllogisms, it is syllogisms placed in a certain order, and the order in which these elements are placed is much more important than the elements themselves. If I have the feeling, the intuition, so to speak, of this order, so as to perceive at a glance the reasoning as a whole, I need no longer fear lest I forget one of the elements, for each of them will take its allotted place in the array, and that without any effort of memory on my part.

In short, my memory is decent, but it's not good enough to make me a strong chess player. So why does it work for me in a tough piece of math where most chess players would get lost? Clearly, it's because it's guided by the overall direction of the reasoning. A mathematical proof isn’t just a collection of syllogisms; it consists of syllogisms arranged in a specific order, and that order matters much more than the individual elements themselves. If I have the feeling, or intuition, of this order, allowing me to see the reasoning as a whole at a glance, I won’t worry about forgetting any of the elements, because each will naturally fit into its place in the structure, and that happens without any effort to remember on my part.

It seems to me then, in repeating a reasoning learned, that I could have invented it. This is often only an illusion; but even then, even if I am not so gifted as to create it by myself, I myself re-invent it in so far as I repeat it.

It appears to me that, in restating a thought I've learned, I could have come up with it on my own. This is often just an illusion; but even so, even if I don’t have the talent to create it myself, I still re-invent it to the extent that I repeat it.

We know that this feeling, this intuition of mathematical order, that makes us divine hidden harmonies and relations, can not be possessed by every one. Some will not have either this delicate feeling so difficult to define, or a strength of memory and attention beyond the ordinary, and then they will be absolutely incapable of understanding higher mathematics. Such are the majority. Others will have this feeling only in a slight degree, but they will be gifted with an uncommon memory and a great power of attention. They will learn by heart the details[Pg 386] one after another; they can understand mathematics and sometimes make applications, but they cannot create. Others, finally, will possess in a less or greater degree the special intuition referred to, and then not only can they understand mathematics even if their memory is nothing extraordinary, but they may become creators and try to invent with more or less success according as this intuition is more or less developed in them.

We know that this feeling, this instinct about mathematical order, which allows us to perceive hidden harmonies and connections, isn't something everyone has. Some people lack this subtle sense, which is hard to define, as well as a memory and focus that go beyond the norm, making them completely unable to grasp advanced mathematics. That's the majority. Others might have this feeling to a small extent, but they’ll possess an exceptional memory and strong concentration. They can memorize the details[Pg 386] one after another; they can understand math and sometimes apply it, but they can't innovate. Finally, some will have this special intuition to varying degrees, allowing them not only to grasp mathematics even if their memory isn't remarkable, but also to create and try to invent with varying levels of success, depending on how developed this intuition is in them.

In fact, what is mathematical creation? It does not consist in making new combinations with mathematical entities already known. Any one could do that, but the combinations so made would be infinite in number and most of them absolutely without interest. To create consists precisely in not making useless combinations and in making those which are useful and which are only a small minority. Invention is discernment, choice.

In fact, what is mathematical creation? It’s not just about making new combinations with existing mathematical entities. Anyone could do that, but the combinations made would be infinite and most would be completely irrelevant. Creating is about avoiding pointless combinations and focusing on those that are useful, which are only a small minority. Invention is about discernment and choice.

How to make this choice I have before explained; the mathematical facts worthy of being studied are those which, by their analogy with other facts, are capable of leading us to the knowledge of a mathematical law just as experimental facts lead us to the knowledge of a physical law. They are those which reveal to us unsuspected kinship between other facts, long known, but wrongly believed to be strangers to one another.

How to make this choice I have already explained; the mathematical facts worth studying are those that, by relating to other facts, can help us understand a mathematical law just like experimental facts help us uncover a physical law. They show us unexpected connections between other facts that have long been known but were mistakenly thought to be unrelated.

Among chosen combinations the most fertile will often be those formed of elements drawn from domains which are far apart. Not that I mean as sufficing for invention the bringing together of objects as disparate as possible; most combinations so formed would be entirely sterile. But certain among them, very rare, are the most fruitful of all.

Among selected combinations, the most productive will often be those made up of elements from widely different fields. I’m not suggesting that just mixing together the most unrelated objects guarantees innovation; in fact, most combinations created this way would be totally unproductive. However, a few very rare combinations can turn out to be the most fruitful of all.

To invent, I have said, is to choose; but the word is perhaps not wholly exact. It makes one think of a purchaser before whom are displayed a large number of samples, and who examines them, one after the other, to make a choice. Here the samples would be so numerous that a whole lifetime would not suffice to examine them. This is not the actual state of things. The sterile combinations do not even present themselves to the mind of the inventor. Never in the field of his consciousness do combinations appear that are not really useful, except some that he rejects but which have to some extent the characteristics of useful combinations. All goes on as if the inventor were an examiner for[Pg 387] the second degree who would only have to question the candidates who had passed a previous examination.

To invent, as I've mentioned, is to make a choice; but that term might not be completely accurate. It brings to mind a shopper looking at a wide selection of items and reviewing them one by one to decide. In this case, the options would be so vast that a lifetime wouldn't be enough to consider them all. That's not the reality. The unproductive combinations don't even come to the mind of the inventor. They never appear in his awareness unless they have some qualities of useful combinations, though he tends to dismiss them. It’s as if the inventor is an evaluator of a second round, only needing to assess candidates who have already passed an initial screening.[Pg 387]

But what I have hitherto said is what may be observed or inferred in reading the writings of the geometers, reading reflectively.

But what I've said so far is what you can notice or conclude while reading the works of the geometers, when you read thoughtfully.

It is time to penetrate deeper and to see what goes on in the very soul of the mathematician. For this, I believe, I can do best by recalling memories of my own. But I shall limit myself to telling how I wrote my first memoir on Fuchsian functions. I beg the reader's pardon; I am about to use some technical expressions, but they need not frighten him, for he is not obliged to understand them. I shall say, for example, that I have found the demonstration of such a theorem under such circumstances. This theorem will have a barbarous name, unfamiliar to many, but that is unimportant; what is of interest for the psychologist is not the theorem but the circumstances.

It’s time to dive deeper and explore what really happens in the soul of a mathematician. To do this, I think the best way is to share some of my own memories. However, I’ll focus specifically on how I wrote my first paper on Fuchsian functions. I ask the reader to bear with me; I’ll be using some technical terms, but there's no need to worry about them, as you don’t have to fully understand them. For instance, I might say that I found a proof of a certain theorem under specific conditions. This theorem may have a complicated name that many people are not familiar with, but that’s not what matters; what’s interesting for a psychologist is not the theorem itself, but the context in which it was developed.

For fifteen days I strove to prove that there could not be any functions like those I have since called Fuchsian functions. I was then very ignorant; every day I seated myself at my work table, stayed an hour or two, tried a great number of combinations and reached no results. One evening, contrary to my custom, I drank black coffee and could not sleep. Ideas rose in crowds; I felt them collide until pairs interlocked, so to speak, making a stable combination. By the next morning I had established the existence of a class of Fuchsian functions, those which come from the hypergeometric series; I had only to write out the results, which took but a few hours.

For fifteen days, I struggled to prove that there couldn't be any functions like what I've since called Fuchsian functions. I was really in the dark back then; every day, I'd sit at my work table, spend an hour or two there, try out a bunch of combinations, and get nowhere. One evening, though, unlike my usual routine, I drank black coffee and couldn't sleep. Ideas started flooding in; I felt them bumping into each other until they formed stable combinations. By the next morning, I had established the existence of a class of Fuchsian functions, specifically those derived from the hypergeometric series; all I had to do was write out the results, which took just a few hours.

Then I wanted to represent these functions by the quotient of two series; this idea was perfectly conscious and deliberate, the analogy with elliptic functions guided me. I asked myself what properties these series must have if they existed, and I succeeded without difficulty in forming the series I have called theta-Fuchsian.

Then I wanted to represent these functions as the quotient of two series; this idea was fully intentional and planned, guided by the analogy with elliptic functions. I wondered what properties these series would need to have if they existed, and I was able to easily create the series I’ve named theta-Fuchsian.

Just at this time I left Caen, where I was then living, to go on a geologic excursion under the auspices of the school of mines. The changes of travel made me forget my mathematical work. Having reached Coutances, we entered an omnibus to go some place or other. At the moment when I put my foot on the step[Pg 388] the idea came to me, without anything in my former thoughts seeming to have paved the way for it, that the transformations I had used to define the Fuchsian functions were identical with those of non-Euclidean geometry. I did not verify the idea; I should not have had time, as, upon taking my seat in the omnibus, I went on with a conversation already commenced, but I felt a perfect certainty. On my return to Caen, for conscience' sake I verified the result at my leisure.

Just then, I left Caen, where I was living, to go on a geology trip organized by the school of mines. The changes from traveling made me forget my math work. Once we reached Coutances, we got on a bus to go somewhere. The moment I put my foot on the step[Pg 388], the thought struck me, without any of my previous thoughts leading up to it, that the transformations I had used to define the Fuchsian functions were the same as those in non-Euclidean geometry. I didn't check the idea; I didn’t have the time because, as soon as I sat down on the bus, I jumped into a conversation that had already started, but I felt completely certain. When I got back to Caen, just to clear my conscience, I took the time to verify the result.

Then I turned my attention to the study of some arithmetical questions apparently without much success and without a suspicion of any connection with my preceding researches. Disgusted with my failure, I went to spend a few days at the seaside, and thought of something else. One morning, walking on the bluff, the idea came to me, with just the same characteristics of brevity, suddenness and immediate certainty, that the arithmetic transformations of indeterminate ternary quadratic forms were identical with those of non-Euclidean geometry.

Then I focused on studying some math problems, but it didn’t go well, and I didn’t think it had anything to do with what I was working on before. Frustrated with my failure, I decided to spend a few days at the beach and clear my mind. One morning, while walking on the cliff, the idea struck me with the same clarity, suddenness, and certainty: the arithmetic transformations of indeterminate ternary quadratic forms were the same as those in non-Euclidean geometry.

Returned to Caen, I meditated on this result and deduced the consequences. The example of quadratic forms showed me that there were Fuchsian groups other than those corresponding to the hypergeometric series; I saw that I could apply to them the theory of theta-Fuchsian series and that consequently there existed Fuchsian functions other than those from the hypergeometric series, the ones I then knew. Naturally I set myself to form all these functions. I made a systematic attack upon them and carried all the outworks, one after another. There was one however that still held out, whose fall would involve that of the whole place. But all my efforts only served at first the better to show me the difficulty, which indeed was something. All this work was perfectly conscious.

Returned to Caen, I thought about this outcome and figured out the implications. The example of quadratic forms showed me that there were Fuchsian groups besides those linked to the hypergeometric series; I realized that I could apply the theory of theta-Fuchsian series to them and that, therefore, there were Fuchsian functions other than those from the hypergeometric series that I was familiar with. Naturally, I set out to create all these functions. I systematically tackled them and captured all the outer defenses one by one. However, there was one that still resisted, and its defeat would mean the fall of the entire structure. But all my efforts only highlighted the challenge, which was something in itself. All this work was fully intentional.

Thereupon I left for Mont-Valérien, where I was to go through my military service; so I was very differently occupied. One day, going along the street, the solution of the difficulty which had stopped me suddenly appeared to me. I did not try to go deep into it immediately, and only after my service did I again take up the question. I had all the elements and had only to arrange them and put them together. So I wrote out my final memoir at a single stroke and without difficulty.[Pg 389]

I then left for Mont-Valérien, where I was going to do my military service; my focus was completely different. One day, while walking down the street, the solution to the problem that had been bothering me suddenly came to me. I didn’t try to dive into it right away, and it was only after my service was over that I revisited the question. I had all the pieces and just needed to organize and put them together. So I wrote my final memoir in one go and without any trouble.[Pg 389]

I shall limit myself to this single example; it is useless to multiply them. In regard to my other researches I would have to say analogous things, and the observations of other mathematicians given in L'enseignement mathématique would only confirm them.

I’ll stick to this one example; there’s no need to increase the number. About my other research, I’d have to say similar things, and the observations from other mathematicians found in L'enseignement mathématique would just confirm that.

Most striking at first is this appearance of sudden illumination, a manifest sign of long, unconscious prior work. The rôle of this unconscious work in mathematical invention appears to me incontestable, and traces of it would be found in other cases where it is less evident. Often when one works at a hard question, nothing good is accomplished at the first attack. Then one takes a rest, longer or shorter, and sits down anew to the work. During the first half-hour, as before, nothing is found, and then all of a sudden the decisive idea presents itself to the mind. It might be said that the conscious work has been more fruitful because it has been interrupted and the rest has given back to the mind its force and freshness. But it is more probable that this rest has been filled out with unconscious work and that the result of this work has afterward revealed itself to the geometer just as in the cases I have cited; only the revelation, instead of coming during a walk or a journey, has happened during a period of conscious work, but independently of this work which plays at most a rôle of excitant, as if it were the goad stimulating the results already reached during rest, but remaining unconscious, to assume the conscious form.

Most striking at first is this sudden burst of insight, a clear sign of extensive, unconscious preparation. The role of this unconscious effort in mathematical creativity seems undeniable, and evidence of it can be found in other situations where it’s less obvious. Often, when tackling a difficult problem, nothing productive results from the initial attempts. Then, after taking a break—whether long or short—you return to the task. In the first half-hour, like before, you find nothing, and then suddenly, a crucial idea pops into your mind. One might argue that the conscious effort has been more successful because it was interrupted, allowing the mind to regain its energy and clarity. However, it’s more likely that this break has been filled with unconscious processing, and that this work later manifests to the thinker just as in the examples I mentioned; only this revelation has occurred during a period of conscious effort, but independently from this work, which at most acts as a prompt, like a nudge that stimulates the results already achieved during rest, yet still unconscious, to become clear and deliberate.

There is another remark to be made about the conditions of this unconscious work: it is possible, and of a certainty it is only fruitful, if it is on the one hand preceded and on the other hand followed by a period of conscious work. These sudden inspirations (and the examples already cited sufficiently prove this) never happen except after some days of voluntary effort which has appeared absolutely fruitless and whence nothing good seems to have come, where the way taken seems totally astray. These efforts then have not been as sterile as one thinks; they have set agoing the unconscious machine and without them it would not have moved and would have produced nothing.

There’s another point to mention about the conditions of this unconscious work: it can only be effective if it's preceded and followed by a period of conscious effort. These sudden bursts of inspiration (and the examples already given clearly show this) only occur after several days of deliberate work that seems completely unproductive, where it feels like no progress has been made at all. These efforts haven’t been as useless as one might think; they’ve activated the unconscious process, and without them, it wouldn’t have started and wouldn’t have produced anything.

The need for the second period of conscious work, after the inspiration, is still easier to understand. It is necessary to put[Pg 390] in shape the results of this inspiration, to deduce from them the immediate consequences, to arrange them, to word the demonstrations, but above all is verification necessary. I have spoken of the feeling of absolute certitude accompanying the inspiration; in the cases cited this feeling was no deceiver, nor is it usually. But do not think this a rule without exception; often this feeling deceives us without being any the less vivid, and we only find it out when we seek to put on foot the demonstration. I have especially noticed this fact in regard to ideas coming to me in the morning or evening in bed while in a semi-hypnagogic state.

The need for a second phase of focused work after inspiration is pretty straightforward. It's important to organize the results of this inspiration, figure out the immediate outcomes, arrange them, articulate the arguments, but most importantly, verification is crucial. I mentioned the feeling of complete certainty that comes with inspiration; in the examples I discussed, that feeling was accurate, and it usually is. However, don't assume this is a rule without exceptions; often, this feeling can mislead us, even if it's still strong, and we only realize this when we try to present the argument. I've particularly noticed this with ideas that come to me in the morning or evening while I'm in a semi-dreamlike state.

Such are the realities; now for the thoughts they force upon us. The unconscious, or, as we say, the subliminal self plays an important rôle in mathematical creation; this follows from what we have said. But usually the subliminal self is considered as purely automatic. Now we have seen that mathematical work is not simply mechanical, that it could not be done by a machine, however perfect. It is not merely a question of applying rules, of making the most combinations possible according to certain fixed laws. The combinations so obtained would be exceedingly numerous, useless and cumbersome. The true work of the inventor consists in choosing among these combinations so as to eliminate the useless ones or rather to avoid the trouble of making them, and the rules which must guide this choice are extremely fine and delicate. It is almost impossible to state them precisely; they are felt rather than formulated. Under these conditions, how imagine a sieve capable of applying them mechanically?

Such are the realities; now for the thoughts they force upon us. The unconscious, or what we call the subliminal self, plays a significant role in mathematical creation; this follows from what we've discussed. But usually, the subliminal self is seen as purely automatic. Now we have observed that mathematical work isn’t just mechanical; it couldn’t be done by a machine, no matter how perfect. It’s not just about applying rules or generating as many combinations as possible according to specific fixed laws. The combinations produced would be extremely numerous, useless, and cumbersome. The real work of the inventor lies in selecting from these combinations to eliminate the useless ones or to avoid the effort of creating them in the first place, and the guidelines that must steer this choice are very subtle and intricate. It’s nearly impossible to state them precisely; they’re understood rather than articulated. Under these circumstances, how can we even conceive of a sieve that could apply them mechanically?

A first hypothesis now presents itself: the subliminal self is in no way inferior to the conscious self; it is not purely automatic; it is capable of discernment; it has tact, delicacy; it knows how to choose, to divine. What do I say? It knows better how to divine than the conscious self, since it succeeds where that has failed. In a word, is not the subliminal self superior to the conscious self? You recognize the full importance of this question. Boutroux in a recent lecture has shown how it came up on a very different occasion, and what consequences would follow an affirmative answer. (See also, by the same author, Science et Religion, pp. 313 ff.)

A first hypothesis comes to mind: the subliminal self is not at all inferior to the conscious self; it isn't just automatic; it can make distinctions; it has sensitivity and finesse; it knows how to choose and to intuit. What am I saying? It can intuit better than the conscious self since it succeeds where the conscious self has failed. In short, isn't the subliminal self superior to the conscious self? You understand the significance of this question. Boutroux, in a recent lecture, demonstrated how it emerged in a completely different context and what implications would arise from a positive answer. (See also, by the same author, Science et Religion, pp. 313 ff.)

Is this affirmative answer forced upon us by the facts I have[Pg 391] just given? I confess that, for my part, I should hate to accept it. Reexamine the facts then and see if they are not compatible with another explanation.

Is this positive answer forced on us by the facts I have[Pg 391] just presented? Honestly, I would really dislike having to accept it. Let's take another look at the facts and see if they could support a different explanation.

It is certain that the combinations which present themselves to the mind in a sort of sudden illumination, after an unconscious working somewhat prolonged, are generally useful and fertile combinations, which seem the result of a first impression. Does it follow that the subliminal self, having divined by a delicate intuition that these combinations would be useful, has formed only these, or has it rather formed many others which were lacking in interest and have remained unconscious?

It’s clear that the ideas that come to mind in a flash, after a period of subconscious effort, are usually valuable and productive combinations that seem to stem from a first impression. Does this mean that the subconscious self, having sensed through a subtle intuition that these combinations would be useful, has only created these, or has it created many others that were less interesting and have stayed hidden?

In this second way of looking at it, all the combinations would be formed in consequence of the automatism of the subliminal self, but only the interesting ones would break into the domain of consciousness. And this is still very mysterious. What is the cause that, among the thousand products of our unconscious activity, some are called to pass the threshold, while others remain below? Is it a simple chance which confers this privilege? Evidently not; among all the stimuli of our senses, for example, only the most intense fix our attention, unless it has been drawn to them by other causes. More generally the privileged unconscious phenomena, those susceptible of becoming conscious, are those which, directly or indirectly, affect most profoundly our emotional sensibility.

In this second perspective, all combinations would arise from the automatism of the subconscious self, but only the interesting ones would enter our conscious awareness. And this remains quite mysterious. What determines that, among the countless outcomes of our unconscious activity, some are chosen to cross into consciousness while others stay hidden? Is it just sheer luck that gives this privilege? Clearly not; for instance, among all the stimuli our senses encounter, only the most intense capture our attention, unless something else has drawn us to them. More generally, the privileged unconscious phenomena, those that can become conscious, are the ones that directly or indirectly impact our emotional sensitivity the most.

It may be surprising to see emotional sensibility invoked à propos of mathematical demonstrations which, it would seem, can interest only the intellect. This would be to forget the feeling of mathematical beauty, of the harmony of numbers and forms, of geometric elegance. This is a true esthetic feeling that all real mathematicians know, and surely it belongs to emotional sensibility.

It might be surprising to see emotional sensitivity referenced à propos of mathematical proofs, which seem like they would only engage the intellect. This overlooks the feeling of mathematical beauty, the harmony of numbers and shapes, and the elegance of geometry. This is a genuine aesthetic appreciation that all real mathematicians understand, and it definitely relates to emotional sensitivity.

Now, what are the mathematic entities to which we attribute this character of beauty and elegance, and which are capable of developing in us a sort of esthetic emotion? They are those whose elements are harmoniously disposed so that the mind without effort can embrace their totality while realizing the details. This harmony is at once a satisfaction of our esthetic needs and an aid to the mind, sustaining and guiding; And at the same[Pg 392] time, in putting under our eyes a well-ordered whole, it makes us foresee a mathematical law. Now, as we have said above, the only mathematical facts worthy of fixing our attention and capable of being useful are those which can teach us a mathematical law. So that we reach the following conclusion: The useful combinations are precisely the most beautiful, I mean those best able to charm this special sensibility that all mathematicians know, but of which the profane are so ignorant as often to be tempted to smile at it.

Now, what are the mathematical entities that we see as beautiful and elegant, and that can evoke a kind of aesthetic emotion in us? They are those whose components are organized in such a way that the mind can grasp the whole effortlessly while appreciating the details. This harmony not only satisfies our aesthetic needs but also supports and guides the mind; and at the same time, by presenting us with a well-structured whole, it allows us to anticipate a mathematical law. As we mentioned earlier, the only mathematical facts that deserve our attention and can be useful are those that can teach us a mathematical law. Therefore, we arrive at the following conclusion: the most useful combinations are also the most beautiful, meaning those that best appeal to this unique sensitivity familiar to all mathematicians, but which the uninitiated often overlook, sometimes even finding it amusing.

What happens then? Among the great numbers of combinations blindly formed by the subliminal self, almost all are without interest and without utility; but just for that reason they are also without effect upon the esthetic sensibility. Consciousness will never know them; only certain ones are harmonious, and, consequently, at once useful and beautiful. They will be capable of touching this special sensibility of the geometer of which I have just spoken, and which, once aroused, will call our attention to them, and thus give them occasion to become conscious.

What happens next? Among the countless combinations created unknowingly by the subconscious mind, nearly all are unremarkable and useless; but because of that, they also don't impact our aesthetic sensitivity. Consciousness will never recognize them; only a few are harmonious and therefore both useful and beautiful. These combinations can resonate with that unique sensitivity of the geometer I just mentioned, and once it's stimulated, it will draw our attention to them, giving them a chance to come into our consciousness.

This is only a hypothesis, and yet here is an observation which may confirm it: when a sudden illumination seizes upon the mind of the mathematician, it usually happens that it does not deceive him, but it also sometimes happens, as I have said, that it does not stand the test of verification; well, we almost always notice that this false idea, had it been true, would have gratified our natural feeling for mathematical elegance.

This is just a theory, but here’s an observation that might support it: when a sudden insight strikes a mathematician, it usually proves accurate, though it sometimes doesn’t hold up under scrutiny, as I mentioned. We often see that this incorrect idea, if it were true, would have satisfied our natural appreciation for mathematical elegance.

Thus it is this special esthetic sensibility which plays the rôle of the delicate sieve of which I spoke, and that sufficiently explains why the one lacking it will never be a real creator.

Thus, it is this unique aesthetic sensitivity that acts as the delicate filter I mentioned, and that clearly explains why someone who lacks it will never be a true creator.

Yet all the difficulties have not disappeared. The conscious self is narrowly limited, and as for the subliminal self we know not its limitations, and this is why we are not too reluctant in supposing that it has been able in a short time to make more different combinations than the whole life of a conscious being could encompass. Yet these limitations exist. Is it likely that it is able to form all the possible combinations, whose number would frighten the imagination? Nevertheless that would seem necessary, because if it produces only a small part of these combinations, and if it makes them at random, there would be small[Pg 393] chance that the good, the one we should choose, would be found among them.

Yet all the difficulties haven't vanished. The conscious self is pretty limited, and as for the subconscious self, we don't know its limitations, which is why we don't hesitate to think that it might have created more combinations in a short time than an entire conscious life could ever cover. Still, these limitations exist. Is it really possible for it to create all the possible combinations, a number that would boggle the mind? Yet that seems necessary because if it only generates a small portion of these combinations and does so randomly, there's little chance that the good, the one we should choose, would be among them.

Perhaps we ought to seek the explanation in that preliminary period of conscious work which always precedes all fruitful unconscious labor. Permit me a rough comparison. Figure the future elements of our combinations as something like the hooked atoms of Epicurus. During the complete repose of the mind, these atoms are motionless, they are, so to speak, hooked to the wall; so this complete rest may be indefinitely prolonged without the atoms meeting, and consequently without any combination between them.

Perhaps we should look for the explanation in that initial stage of conscious effort that always comes before any productive unconscious work. Let me make a rough comparison. Imagine the future components of our combinations as resembling the hooked atoms of Epicurus. When the mind is completely at rest, these atoms are still; they are, in a way, stuck to the wall. This total stillness can go on for an extended time without the atoms coming together and, as a result, without any combination occurring between them.

On the other hand, during a period of apparent rest and unconscious work, certain of them are detached from the wall and put in motion. They flash in every direction through the space (I was about to say the room) where they are enclosed, as would, for example, a swarm of gnats or, if you prefer a more learned comparison, like the molecules of gas in the kinematic theory of gases. Then their mutual impacts may produce new combinations.

On the other hand, during a time of seeming rest and unconscious activity, some of them break away from the wall and start moving. They dart in every direction through the space (I was going to say the room) where they are contained, similar to how a swarm of gnats behaves or, if you want a more scientific comparison, like the molecules of gas in the kinetic theory of gases. Their interactions can then create new combinations.

What is the rôle of the preliminary conscious work? It is evidently to mobilize certain of these atoms, to unhook them from the wall and put them in swing. We think we have done no good, because we have moved these elements a thousand different ways in seeking to assemble them, and have found no satisfactory aggregate. But, after this shaking up imposed upon them by our will, these atoms do not return to their primitive rest. They freely continue their dance.

What is the role of the initial conscious effort? It clearly aims to activate some of these atoms, to detach them from the wall and set them in motion. We believe we haven't accomplished anything because we've rearranged these elements in a thousand different ways in our attempt to put them together, and we haven't achieved a satisfactory combination. However, after this disruption we've caused through our intention, these atoms don't just go back to their original state. They continue to move freely in their dance.

Now, our will did not choose them at random; it pursued a perfectly determined aim. The mobilized atoms are therefore not any atoms whatsoever; they are those from which we might reasonably expect the desired solution. Then the mobilized atoms undergo impacts which make them enter into combinations among themselves or with other atoms at rest which they struck against in their course. Again I beg pardon, my comparison is very rough, but I scarcely know how otherwise to make my thought understood.

Now, our will didn’t choose them randomly; it pursued a clear goal. The mobilized atoms aren't just any atoms; they are the ones from which we can reasonably expect the desired solution. Then the mobilized atoms collide, causing them to combine with each other or with other stationary atoms they encounter along the way. I apologize; my comparison is quite basic, but I don't really know how else to convey my thoughts.

However it may be, the only combinations that have a chance of forming are those where at least one of the elements is one of those atoms freely chosen by our will. Now, it is evidently[Pg 394] among these that is found what I called the good combination. Perhaps this is a way of lessening the paradoxical in the original hypothesis.

However it may be, the only combinations that could potentially form are those where at least one of the elements is one of the atoms we freely choose. Now, it is obviously[Pg 394] among these that we find what I referred to as the good combination. Perhaps this is a way to reduce the paradox in the original hypothesis.

Another observation. It never happens that the unconscious work gives us the result of a somewhat long calculation all made, where we have only to apply fixed rules. We might think the wholly automatic subliminal self particularly apt for this sort of work, which is in a way exclusively mechanical. It seems that thinking in the evening upon the factors of a multiplication we might hope to find the product ready made upon our awakening, or again that an algebraic calculation, for example a verification, would be made unconsciously. Nothing of the sort, as observation proves. All one may hope from these inspirations, fruits of unconscious work, is a point of departure for such calculations. As for the calculations themselves, they must be made in the second period of conscious work, that which follows the inspiration, that in which one verifies the results of this inspiration and deduces their consequences. The rules of these calculations are strict and complicated. They require discipline, attention, will, and therefore consciousness. In the subliminal self, on the contrary, reigns what I should call liberty, if we might give this name to the simple absence of discipline and to the disorder born of chance. Only, this disorder itself permits unexpected combinations.

Another observation. It never happens that the unconscious work gives us the result of a somewhat long calculation all done, where we only need to apply fixed rules. We might think the completely automatic subliminal self is especially suited for this type of work, which is essentially mechanical. It seems that by thinking in the evening about the factors of a multiplication, we might hope to find the product ready when we wake up, or that an algebraic calculation, like a verification, would be done unconsciously. However, that’s not the case, as observation shows. All we can expect from these inspirations, which are the fruits of unconscious work, is a starting point for such calculations. As for the calculations themselves, they must be made during the second phase of conscious work, which follows the inspiration, where one verifies the results of this inspiration and draws their consequences. The rules for these calculations are strict and complex. They require discipline, attention, will, and thus, consciousness. In the subliminal self, on the other hand, what I would call liberty reigns, if we can call it that, referring to the simple absence of discipline and the disorder that arises from chance. But this disorder itself allows for unexpected combinations.

I shall make a last remark: when above I made certain personal observations, I spoke of a night of excitement when I worked in spite of myself. Such cases are frequent, and it is not necessary that the abnormal cerebral activity be caused by a physical excitant as in that I mentioned. It seems, in such cases, that one is present at his own unconscious work, made partially perceptible to the over-excited consciousness, yet without having changed its nature. Then we vaguely comprehend what distinguishes the two mechanisms or, if you wish, the working methods of the two egos. And the psychologic observations I have been able thus to make seem to me to confirm in their general outlines the views I have given.

I want to add one last point: when I mentioned my personal observations above, I talked about a night filled with excitement where I worked despite myself. This happens often, and it's not always that the unusual brain activity is triggered by a physical stimulant, like the one I referenced. In these situations, it feels like you’re witnessing your own unconscious work, which is only partially visible to your overly stimulated mind, yet it hasn’t changed in nature. Then, we get a vague understanding of what sets the two mechanisms apart, or if you prefer, the different working styles of the two selves. The psychological observations I've made in this way seem to support the general ideas I've presented.

Surely they have need of it, for they are and remain in spite of all very hypothetical: the interest of the questions is so great that I do not repent of having submitted them to the reader.

Surely they need it, because they are still very uncertain: the interest in these questions is so high that I don’t regret having brought them to the reader's attention.


CHAPTER IV

Opportunity

I

"How dare we speak of the laws of chance? Is not chance the antithesis of all law?" So says Bertrand at the beginning of his Calcul des probabiltités. Probability is opposed to certitude; so it is what we do not know and consequently it seems what we could not calculate. Here is at least apparently a contradiction, and about it much has already been written.

"How can we talk about the laws of chance? Isn't chance the opposite of all laws?" This is what Bertrand states at the beginning of his Calcul des probabiltités. Probability is against certainty; it deals with what we don’t know and, therefore, seems something we can't measure. This appears to be a contradiction, and a lot has already been written about it.

And first, what is chance? The ancients distinguished between phenomena seemingly obeying harmonious laws, established once for all, and those which they attributed to chance; these were the ones unpredictable because rebellious to all law. In each domain the precise laws did not decide everything, they only drew limits between which chance might act. In this conception the word chance had a precise and objective meaning; what was chance for one was also chance for another and even for the gods.

And first, what is chance? The ancients distinguished between phenomena that seemed to follow harmonious laws, established once and for all, and those they attributed to chance; these were the unpredictable ones that defied all laws. In each domain, the specific laws didn’t determine everything; they only set boundaries within which chance could operate. In this view, the word chance had a clear and objective meaning; what was considered chance for one person was also chance for another and even for the gods.

But this conception is not ours to-day. We have become absolute determinists, and even those who want to reserve the rights of human free will let determinism reign undividedly in the inorganic world at least. Every phenomenon, however minute, has a cause; and a mind infinitely powerful, infinitely well-informed about the laws of nature, could have foreseen it from the beginning of the centuries. If such a mind existed, we could not play with it at any game of chance; we should always lose.

But this idea isn't ours today. We've become complete determinists, and even those who want to keep the notion of human free will still let determinism rule completely in the inorganic world at least. Every phenomenon, no matter how small, has a cause; and a mind that's infinitely powerful and knows all the laws of nature could have seen it coming from the very beginning of time. If such a mind existed, we couldn't gamble against it at any game of chance; we would always lose.

In fact for it the word chance would not have any meaning, or rather there would be no chance. It is because of our weakness and our ignorance that the word has a meaning for us. And, even without going beyond our feeble humanity, what is chance for the ignorant is not chance for the scientist. Chance is only the measure of our ignorance. Fortuitous phenomena are, by definition, those whose laws we do not know.

In reality, the word chance wouldn’t mean anything for it, or rather, there wouldn’t be any chance at all. It’s because of our weakness and ignorance that the word holds meaning for us. Moreover, even if we don’t go beyond our limited humanity, what seems like chance to the uninformed isn’t seen as chance by the scientist. Chance is just a measure of our ignorance. Random phenomena are, by definition, those whose laws we haven't yet discovered.

But is this definition altogether satisfactory? When the first[Pg 396] Chaldean shepherds followed with their eyes the movements of the stars, they knew not as yet the laws of astronomy; would they have dreamed of saying that the stars move at random? If a modern physicist studies a new phenomenon, and if he discovers its law Tuesday, would he have said Monday that this phenomenon was fortuitous? Moreover, do we not often invoke what Bertrand calls the laws of chance, to predict a phenomenon? For example, in the kinetic theory of gases we obtain the known laws of Mariotte and of Gay-Lussac by means of the hypothesis that the velocities of the molecules of gas vary irregularly, that is to say at random. All physicists will agree that the observable laws would be much less simple if the velocities were ruled by any simple elementary law whatsoever, if the molecules were, as we say, organized, if they were subject to some discipline. It is due to chance, that is to say, to our ignorance, that we can draw our conclusions; and then if the word chance is simply synonymous with ignorance what does that mean? Must we therefore translate as follows?

But is this definition completely satisfactory? When the first [Pg 396] Chaldean shepherds watched the movements of the stars, they didn't yet know the laws of astronomy; would they have thought to say that the stars move randomly? If a modern physicist studies a new phenomenon and discovers its law on Tuesday, would he have claimed on Monday that this phenomenon was by chance? Also, don’t we often refer to what Bertrand calls the laws of chance to predict a phenomenon? For instance, in the kinetic theory of gases, we derive the known laws of Mariotte and Gay-Lussac based on the idea that the speeds of gas molecules vary randomly. All physicists would agree that the observable laws would be much less straightforward if the speeds were governed by any simple underlying law, if the molecules were, as we say, organized, if they were subject to some kind of discipline. It is due to chance, meaning our ignorance, that we can make our conclusions; and if the word chance is just another way of saying ignorance, what does that imply? Should we then translate it this way?

"You ask me to predict for you the phenomena about to happen. If, unluckily, I knew the laws of these phenomena I could make the prediction only by inextricable calculations and would have to renounce attempting to answer you; but as I have the good fortune not to know them, I will answer you at once. And what is most surprising, my answer will be right."

"You ask me to predict the events that are about to happen. If, unfortunately, I knew the rules governing these events, I could only make a prediction through complicated calculations and would have to give up trying to answer you; but since I’m lucky enough not to know them, I’ll respond right away. And what’s even more surprising, my answer will be correct."

So it must well be that chance is something other than the name we give our ignorance, that among phenomena whose causes are unknown to us we must distinguish fortuitous phenomena about which the calculus of probabilities will provisionally give information, from those which are not fortuitous and of which we can say nothing so long as we shall not have determined the laws governing them. For the fortuitous phenomena themselves, it is clear that the information given us by the calculus of probabilities will not cease to be true upon the day when these phenomena shall be better known.

So it’s possible that chance is more than just a label for what we don’t understand. Among events whose causes we don't know, we need to differentiate between random events, which we can make provisional predictions about using probability calculations, and those that aren’t random, where we can't say anything until we figure out the rules that govern them. For the random events themselves, it's clear that the insights from probability calculations will remain valid even when we learn more about these events.

The director of a life insurance company does not know when each of the insured will die, but he relies upon the calculus of probabilities and on the law of great numbers, and he is not deceived, since he distributes dividends to his stockholders. These[Pg 397] dividends would not vanish if a very penetrating and very indiscreet physician should, after the policies were signed, reveal to the director the life chances of the insured. This doctor would dissipate the ignorance of the director, but he would have no influence on the dividends, which evidently are not an outcome of this ignorance.

The director of a life insurance company doesn’t know when each insured person will die, but he relies on probability calculations and the law of large numbers, and he’s not misled, since he pays dividends to his shareholders. These[Pg 397] dividends wouldn’t disappear even if a very insightful and overly inquisitive doctor were to, after the policies were signed, inform the director of the life expectancy of the insured. This doctor would clear the director's ignorance, but he wouldn’t affect the dividends, which are clearly not a result of this ignorance.

II

To find a better definition of chance we must examine some of the facts which we agree to regard as fortuitous, and to which the calculus of probabilities seems to apply; we then shall investigate what are their common characteristics.

To find a clearer definition of chance, we need to look at some of the facts that we consider random and to which probability calculations seem to be relevant; then we will explore their common traits.

The first example we select is that of unstable equilibrium; if a cone rests upon its apex, we know well that it will fall, but we do not know toward what side; it seems to us chance alone will decide. If the cone were perfectly symmetric, if its axis were perfectly vertical, if it were acted upon by no force other than gravity, it would not fall at all. But the least defect in symmetry will make it lean slightly toward one side or the other, and if it leans, however little, it will fall altogether toward that side. Even if the symmetry were perfect, a very slight tremor, a breath of air could make it incline some seconds of arc; this will be enough to determine its fall and even the sense of its fall which will be that of the initial inclination.

The first example we choose is unstable equilibrium; if a cone is balanced on its point, we know it will topple, but we can’t predict which direction it will fall. It seems like chance will decide. If the cone were perfectly symmetrical, its axis perfectly vertical, and if gravity were the only force acting on it, it wouldn’t fall at all. However, even the slightest flaw in symmetry will cause it to tilt slightly to one side or the other, and if it tilts even a little, it will completely fall in that direction. Even with perfect symmetry, a tiny shake or a breath of air could cause it to lean just a tiny bit; that will be enough to determine its fall and the direction of its fall, which will match its initial tilt.

A very slight cause, which escapes us, determines a considerable effect which we can not help seeing, and then we say this effect is due to chance. If we could know exactly the laws of nature and the situation of the universe at the initial instant, we should be able to predict exactly the situation of this same universe at a subsequent instant. But even when the natural laws should have no further secret for us, we could know the initial situation only approximately. If that permits us to foresee the subsequent situation with the same degree of approximation, this is all we require, we say the phenomenon has been predicted, that it is ruled by laws. But this is not always the case; it may happen that slight differences in the initial conditions produce very great differences in the final phenomena; a slight error in the former would make an enormous error in the[Pg 398] latter. Prediction becomes impossible and we have the fortuitous phenomenon.

A very small cause, which we might not notice, leads to a significant effect that we can’t ignore, and then we say this effect is just a coincidence. If we could fully understand the laws of nature and the state of the universe at the very beginning, we could predict the state of the universe at a later time accurately. But even if we understood the natural laws completely, we could only know the initial conditions approximately. If that allows us to anticipate the later state with the same level of approximation, that's all we need; we would say the phenomenon has been predicted and that it follows certain laws. However, this isn’t always true; small differences in the initial conditions can lead to very large differences in the final outcomes; a tiny mistake in the beginning can result in a massive error in the[Pg 398]end. Prediction becomes impossible, and we encounter a chance phenomenon.

Our second example will be very analogous to the first and we shall take it from meteorology. Why have the meteorologists such difficulty in predicting the weather with any certainty? Why do the rains, the tempests themselves seem to us to come by chance, so that many persons find it quite natural to pray for rain or shine, when they would think it ridiculous to pray for an eclipse? We see that great perturbations generally happen in regions where the atmosphere is in unstable equilibrium. The meteorologists are aware that this equilibrium is unstable, that a cyclone is arising somewhere; but where they can not tell; one-tenth of a degree more or less at any point, and the cyclone bursts here and not there, and spreads its ravages over countries it would have spared. This we could have foreseen if we had known that tenth of a degree, but the observations were neither sufficiently close nor sufficiently precise, and for this reason all seems due to the agency of chance. Here again we find the same contrast between a very slight cause, unappreciable to the observer, and important effects, which are sometimes tremendous disasters.

Our second example will be quite similar to the first, and we’ll take it from meteorology. Why do meteorologists struggle so much to predict the weather with any certainty? Why do the rains and storms seem to happen randomly, so much so that many people find it normal to pray for rain or sunshine, while they would think it absurd to pray for an eclipse? We observe that significant disturbances usually occur in areas where the atmosphere is in unstable equilibrium. Meteorologists know that this equilibrium is unstable and that a cyclone is forming somewhere; however, they can’t pinpoint its location. Just one-tenth of a degree more or less at any point, and the cyclone could hit here instead of there, wreaking havoc on regions it might have otherwise spared. We could have predicted this if we had known that one-tenth of a degree, but the observations were neither sufficiently close nor precise. For this reason, everything seems left to chance. Once again, we see the same contrast between a very minor cause, unnoticed by the observer, and significant effects, which can sometimes lead to major disasters.

Let us pass to another example, the distribution of the minor planets on the zodiac. Their initial longitudes may have been any longitudes whatever; but their mean motions were different and they have revolved for so long a time that we may say they are now distributed at random along the zodiac. Very slight initial differences between their distances from the sun, or, what comes to the same thing, between their mean motions, have ended by giving enormous differences between their present longitudes. An excess of the thousandth of a second in the daily mean motion will give in fact a second in three years, a degree in ten thousand years, an entire circumference in three or four million years, and what is that to the time which has passed since the minor planets detached themselves from the nebula of Laplace? Again therefore we see a slight cause and a great effect; or better, slight differences in the cause and great differences in the effect.

Let’s look at another example, the distribution of the minor planets along the zodiac. Their starting longitudes could have been anything, but their average speeds were different, and they've been orbiting for such a long time that we can say they are now distributed at random along the zodiac. Very small initial differences in their distances from the sun, or, equivalently, in their average speeds, have led to huge differences in their current longitudes. Just a thousandth of a second more in the daily average speed will result in a second over three years, a degree over ten thousand years, and a complete orbit in three or four million years. And what does that mean compared to the time that has passed since the minor planets separated from Laplace's nebula? Once again, we see a small cause leading to a large effect; or rather, slight differences in the cause resulting in significant differences in the effect.

The game of roulette does not take us as far as might seem[Pg 399] from the preceding example. Assume a needle to be turned on a pivot over a dial divided into a hundred sectors alternately red and black. If it stops on a red sector I win; if not, I lose. Evidently all depends upon the initial impulse I give the needle. The needle will make, suppose, ten or twenty turns, but it will stop sooner or not so soon, according as I shall have pushed it more or less strongly. It suffices that the impulse vary only by a thousandth or a two thousandth to make the needle stop over a black sector or over the following red one. These are differences the muscular sense can not distinguish and which elude even the most delicate instruments. So it is impossible for me to foresee what the needle I have started will do, and this is why my heart throbs and I hope everything from luck. The difference in the cause is imperceptible, and the difference in the effect is for me of the highest importance, since it means my whole stake.

The game of roulette doesn’t take us as far as you might think from the previous example. Imagine a needle spinning on a pivot over a dial split into a hundred sectors that alternate between red and black. If it lands on a red sector, I win; if not, I lose. Clearly, everything depends on the initial push I give the needle. The needle might spin, let’s say, ten or twenty times, but it will stop sooner or later based on how hard I push it. It only takes a variation of a thousandth or two thousandths for the needle to land on a black sector or the next red one. These are differences that the muscular sense can’t detect, and even the most sensitive instruments can miss them. So, I can’t predict what the needle I’ve set in motion will do, which is why my heart races and I put my hopes in luck. The difference in the cause is undetectable, but the difference in the effect is crucial for me, as it determines my entire stake.

III

Permit me, in this connection, a thought somewhat foreign to my subject. Some years ago a philosopher said that the future is determined by the past, but not the past by the future; or, in other words, from knowledge of the present we could deduce the future, but not the past; because, said he, a cause can have only one effect, while the same effect might be produced by several different causes. It is clear no scientist can subscribe to this conclusion. The laws of nature bind the antecedent to the consequent in such a way that the antecedent is as well determined by the consequent as the consequent by the antecedent. But whence came the error of this philosopher? We know that in virtue of Carnot's principle physical phenomena are irreversible and the world tends toward uniformity. When two bodies of different temperature come in contact, the warmer gives up heat to the colder; so we may foresee that the temperature will equalize. But once equal, if asked about the anterior state, what can we answer? We might say that one was warm and the other cold, but not be able to divine which formerly was the warmer.

Let me share a thought that might seem a bit off-topic. A few years back, a philosopher claimed that the future is shaped by the past, but not the other way around. In other words, we can predict the future from what we know now, but we can't determine the past from the future. He argued that a cause can lead to only one effect, while the same effect can result from many different causes. Clearly, no scientist can agree with this idea. The laws of nature connect the cause and effect in such a way that the cause is just as influenced by the effect as the effect is by the cause. So where did this philosopher go wrong? We understand that, according to Carnot's principle, physical processes are irreversible, and the universe moves toward uniformity. When two objects at different temperatures touch, the warmer one transfers heat to the cooler one, which allows us to predict that their temperatures will balance out. But once they are equal, if we’re asked about their previous states, what can we say? We might state that one was warm and the other was cold, but we wouldn’t be able to tell which one was warmer before.

And yet in reality the temperatures will never reach perfect equality. The difference of the temperatures only tends asymptotically toward zero. There comes a moment when our[Pg 400] thermometers are powerless to make it known. But if we had thermometers a thousand times, a hundred thousand times as sensitive, we should recognize that there still is a slight difference, and that one of the bodies remains a little warmer than the other, and so we could say this it is which formerly was much the warmer.

And yet in reality, the temperatures will never be perfectly equal. The difference between the temperatures only approaches zero gradually. There comes a point when our[Pg 400] thermometers can't detect it anymore. But if we had thermometers a thousand times, or even a hundred thousand times, more sensitive, we would see that there is still a slight difference and that one of the bodies remains a little warmer than the other. Thus, we could say that this one was the much warmer one before.

So then there are, contrary to what we found in the former examples, great differences in cause and slight differences in effect. Flammarion once imagined an observer going away from the earth with a velocity greater than that of light; for him time would have changed sign. History would be turned about, and Waterloo would precede Austerlitz. Well, for this observer, effects and causes would be inverted; unstable equilibrium would no longer be the exception. Because of the universal irreversibility, all would seem to him to come out of a sort of chaos in unstable equilibrium. All nature would appear to him delivered over to chance.

So, unlike what we saw in the earlier examples, there are significant differences in cause and minor differences in effect. Flammarion once imagined a traveler leaving Earth at a speed faster than light; for this person, time would flip. History would be reversed, with Waterloo happening before Austerlitz. For this observer, effects and causes would be flipped; unstable equilibrium would no longer be the norm. Due to universal irreversibility, everything would seem to emerge from a sort of chaos in unstable balance. All of nature would seem to be governed by randomness.

IV

Now for other examples where we shall see somewhat different characteristics. Take first the kinetic theory of gases. How should we picture a receptacle filled with gas? Innumerable molecules, moving at high speeds, flash through this receptacle in every direction. At every instant they strike against its walls or each other, and these collisions happen under the most diverse conditions. What above all impresses us here is not the littleness of the causes, but their complexity, and yet the former element is still found here and plays an important rôle. If a molecule deviated right or left from its trajectory, by a very small quantity, comparable to the radius of action of the gaseous molecules, it would avoid a collision or sustain it under different conditions, and that would vary the direction of its velocity after the impact, perhaps by ninety degrees or by a hundred and eighty degrees.

Now for some other examples where we’ll see somewhat different characteristics. Let’s start with the kinetic theory of gases. How should we envision a container filled with gas? Countless molecules, moving at high speeds, zip through this container in every direction. Every moment, they hit the walls or collide with each other, and these collisions occur under all kinds of conditions. What stands out to us here is not how small the causes are, but their complexity, and yet the former aspect is still present and plays an important role. If a molecule veers slightly to the right or left from its path, by a tiny amount comparable to the range of the gaseous molecules, it could dodge a collision or experience it differently, which would change the direction of its velocity after the impact, perhaps by ninety degrees or even one hundred eighty degrees.

And this is not all; we have just seen that it is necessary to deflect the molecule before the clash by only an infinitesimal, to produce its deviation after the collision by a finite quantity. If then the molecule undergoes two successive shocks, it will suffice to deflect it before the first by an infinitesimal of the second order, for it to deviate after the first encounter by an infinitesimal[Pg 401] of the first order, and after the second hit, by a finite quantity. And the molecule will not undergo merely two shocks; it will undergo a very great number per second. So that if the first shock has multiplied the deviation by a very large number A, after n shocks it will be multiplied by An. It will therefore become very great not merely because A is large, that is to say because little causes produce big effects, but because the exponent n is large, that is to say because the shocks are very numerous and the causes very complex.

And that's not all; we just saw that it's necessary to deflect the molecule before the impact by just a tiny amount to cause its deviation after the collision by a larger amount. If the molecule experiences two successive impacts, it only needs to be deflected a tiny bit before the first one to deviate after the first collision by a small amount, and after the second impact, by a larger amount. And the molecule won’t just experience two impacts; it will actually undergo a huge number each second. So if the first impact increases the deviation by a large number A, after n collisions it will be multiplied by An. This means it will become significantly large not just because A is large—which means small causes lead to big effects—but also because n is large, meaning the impacts are numerous and the causes are very complex.

Take a second example. Why do the drops of rain in a shower seem to be distributed at random? This is again because of the complexity of the causes which determine their formation. Ions are distributed in the atmosphere. For a long while they have been subjected to air-currents constantly changing, they have been caught in very small whirlwinds, so that their final distribution has no longer any relation to their initial distribution. Suddenly the temperature falls, vapor condenses, and each of these ions becomes the center of a drop of rain. To know what will be the distribution of these drops and how many will fall on each paving-stone, it would not be sufficient to know the initial situation of the ions, it would be necessary to compute the effect of a thousand little capricious air-currents.

Take a second example. Why do the drops of rain in a shower seem to be scattered randomly? This is again due to the complexity of the factors that determine their formation. Ions are spread throughout the atmosphere. For a long time, they've been affected by constantly shifting air currents, getting caught in tiny whirlwinds, so their final distribution no longer relates to their initial arrangement. Suddenly, the temperature drops, vapor condenses, and each of these ions becomes the center of a raindrop. To predict the distribution of these drops and how many will land on each paving stone, it wouldn’t be enough to know the initial position of the ions; you would need to calculate the impact of countless unpredictable air currents.

And again it is the same if we put grains of powder in suspension in water. The vase is ploughed by currents whose law we know not, we only know it is very complicated. At the end of a certain time the grains will be distributed at random, that is to say uniformly, in the vase; and this is due precisely to the complexity of these currents. If they obeyed some simple law, if for example the vase revolved and the currents circulated around the axis of the vase, describing circles, it would no longer be the same, since each grain would retain its initial altitude and its initial distance from the axis.

And it's the same when we mix powder grains in water. The vase is stirred by currents that we don't fully understand; we just know they're very complicated. After a while, the grains will spread out randomly, which means evenly, in the vase; and this is exactly because of the complexity of those currents. If they followed a simple pattern, like if the vase spun and the currents moved in circles around its axis, it would be different, as each grain would keep its original height and distance from the axis.

We should reach the same result in considering the mixing of two liquids or of two fine-grained powders. And to take a grosser example, this is also what happens when we shuffle playing-cards. At each stroke the cards undergo a permutation (analogous to that studied in the theory of substitutions). What will happen? The probability of a particular permutation (for[Pg 402] example, that bringing to the nth place the card occupying the ϕ(n)th place before the permutation) depends upon the player's habits. But if this player shuffles the cards long enough, there will be a great number of successive permutations, and the resulting final order will no longer be governed by aught but chance; I mean to say that all possible orders will be equally probable. It is to the great number of successive permutations, that is to say to the complexity of the phenomenon, that this result is due.

We can come to the same conclusion when looking at the mixing of two liquids or two fine powders. A simpler example is what happens when we shuffle playing cards. With each shuffle, the cards change position (similar to what’s studied in substitution theory). What will happen next? The probability of a specific arrangement (for example, getting the card that was in the ϕ(n) position to now occupy the nth position) depends on the player's habits. However, if the player shuffles long enough, there will be many different arrangements, and the final order will be determined solely by chance; this means all possible orders will have the same likelihood. This outcome is a result of the numerous shuffles, highlighting the complexity of the situation.

A final word about the theory of errors. Here it is that the causes are complex and multiple. To how many snares is not the observer exposed, even with the best instrument! He should apply himself to finding out the largest and avoiding them. These are the ones giving birth to systematic errors. But when he has eliminated those, admitting that he succeeds, there remain many small ones which, their effects accumulating, may become dangerous. Thence come the accidental errors; and we attribute them to chance because their causes are too complicated and too numerous. Here again we have only little causes, but each of them would produce only a slight effect; it is by their union and their number that their effects become formidable.

A final word about the theory of errors. Here, the causes are complex and varied. The observer faces many pitfalls, even with the best tools! They should focus on identifying and avoiding the larger issues first. These are the ones that create systematic errors. However, even after eliminating those, assuming they succeed, there are still many minor errors left that, when combined, can become significant. This is where accidental errors come in; we attribute them to chance because their causes are too complicated and numerous. Again, we have only small causes, but each causes just a minor effect; it is the combination and sheer number of them that make their overall impact alarming.

V

We may take still a third point of view, less important than the first two and upon which I shall lay less stress. When we seek to foresee an event and examine its antecedents, we strive to search into the anterior situation. This could not be done for all parts of the universe and we are content to know what is passing in the neighborhood of the point where the event should occur, or what would appear to have some relation to it. An examination can not be complete and we must know how to choose. But it may happen that we have passed by circumstances which at first sight seemed completely foreign to the foreseen happening, to which one would never have dreamed of attributing any influence and which nevertheless, contrary to all anticipation, come to play an important rôle.

We can also consider a third perspective, which is less significant than the first two and I won’t focus on as much. When we try to predict an event and look at its background, we aim to explore the previous situation. We can’t do this for every part of the universe, so we’re satisfied with understanding what’s happening near the point where the event is supposed to take place, or anything that seems related to it. Our analysis can’t be exhaustive, and we have to be selective. However, it’s possible that we overlook factors that initially seem completely unrelated to the anticipated event—factors we wouldn’t have thought could have any impact—but, surprisingly, end up playing a major role.

A man passes in the street going to his business; some one knowing the business could have told why he started at such a[Pg 403] time and went by such a street. On the roof works a tiler. The contractor employing him could in a certain measure foresee what he would do. But the passer-by scarcely thinks of the tiler, nor the tiler of him; they seem to belong to two worlds completely foreign to one another. And yet the tiler drops a tile which kills the man, and we do not hesitate to say this is chance.

A man walks down the street heading to work; someone familiar with his job could explain why he left at such a time and took this route. On the roof, a tiler is at work. The contractor who hired him can somewhat predict his actions. But the passerby hardly notices the tiler, nor does the tiler think about him; they seem to come from two completely different worlds. Yet, when the tiler drops a tile that kills the man, we readily call it chance.

Our weakness forbids our considering the entire universe and makes us cut it up into slices. We try to do this as little artificially as possible. And yet it happens from time to time that two of these slices react upon each other. The effects of this mutual action then seem to us to be due to chance.

Our weakness prevents us from seeing the whole universe and forces us to break it into parts. We try to do this in the least artificial way we can. Still, sometimes two of these parts interact with each other. The results of this interaction then seem to us to be random.

Is this a third way of conceiving chance? Not always; in fact most often we are carried back to the first or the second. Whenever two worlds usually foreign to one another come thus to react upon each other, the laws of this reaction must be very complex. On the other hand, a very slight change in the initial conditions of these two worlds would have been sufficient for the reaction not to have happened. How little was needed for the man to pass a second later or the tiler to drop his tile a second sooner.

Is this a different way of thinking about chance? Not always; in fact, most of the time we find ourselves going back to the first or second idea. Whenever two worlds that usually don’t interact with each other come together, the rules of their interaction must be quite complicated. On the other hand, a tiny change in the starting conditions of these two worlds could have been enough to prevent the interaction from occurring altogether. It took so little for the man to walk by a second later or for the tiler to drop his tile a second sooner.

VI

All we have said still does not explain why chance obeys laws. Does the fact that the causes are slight or complex suffice for our foreseeing, if not their effects in each case, at least what their effects will be, on the average? To answer this question we had better take up again some of the examples already cited.

All we've discussed still doesn't explain why chance follows rules. Does the fact that the causes are either minor or complicated allow us to predict, if not their effects in each case, at least what their effects will be on average? To answer this question, it’s best to revisit some of the examples we've already mentioned.

I shall begin with that of the roulette. I have said that the point where the needle will stop depends upon the initial push given it. What is the probability of this push having this or that value? I know nothing about it, but it is difficult for me not to suppose that this probability is represented by a continuous analytic function. The probability that the push is comprised between α and α + ε will then be sensibly equal to the probability of its being comprised between α + ε and α + 2ε, provided ε be very small. This is a property common to all analytic functions. Minute variations of the function are proportional to minute variations of the variable.[Pg 404]

I’ll start with the roulette. I mentioned that where the needle stops depends on the initial push given to it. What’s the likelihood of this push having a certain value? I don’t really know, but it’s hard for me not to think that this probability can be represented by a smooth, analytical function. The probability that the push falls between α and α + ε will be roughly equal to the probability of it falling between α + ε and α + 2ε, as long as ε is very small. This is a property shared by all analytical functions. Tiny changes in the function are proportional to tiny changes in the variable.[Pg 404]

But we have assumed that an exceedingly slight variation of the push suffices to change the color of the sector over which the needle finally stops. From α to α + ε it is red, from α + ε to α + 2ε it is black; the probability of each red sector is therefore the same as of the following black, and consequently the total probability of red equals the total probability of black.

But we have assumed that a very slight change in the push is enough to alter the color of the section where the needle eventually rests. From α to α + ε it is red, and from α + ε to α + 2ε it is black; the likelihood of each red section is therefore the same as that of the next black one, meaning the overall probability of red is equal to the overall probability of black.

The datum of the question is the analytic function representing the probability of a particular initial push. But the theorem remains true whatever be this datum, since it depends upon a property common to all analytic functions. From this it follows finally that we no longer need the datum.

The basis of the question is the analytic function that shows the likelihood of a specific initial push. However, the theorem holds true regardless of this basis, as it relies on a characteristic shared by all analytic functions. Therefore, it's clear that we no longer require the basis.

What we have just said for the case of the roulette applies also to the example of the minor planets. The zodiac may be regarded as an immense roulette on which have been tossed many little balls with different initial impulses varying according to some law. Their present distribution is uniform and independent of this law, for the same reason as in the preceding case. Thus we see why phenomena obey the laws of chance when slight differences in the causes suffice to bring on great differences in the effects. The probabilities of these slight differences may then be regarded as proportional to these differences themselves, just because these differences are minute, and the infinitesimal increments of a continuous function are proportional to those of the variable.

What we've just discussed about roulette also applies to the example of minor planets. The zodiac can be seen as a giant roulette where many small balls have been thrown, each with different starting speeds based on certain rules. Their current distribution is uniform and doesn't depend on these rules, just like in the previous case. This explains why phenomena follow the laws of chance when small changes in the causes can lead to significant differences in the effects. We can think of the probabilities of these small changes as being proportional to the changes themselves, simply because these changes are tiny, and the infinitesimal increments of a continuous function are proportional to those of the variable.

Take an entirely different example, where intervenes especially the complexity of the causes. Suppose a player shuffles a pack of cards. At each shuffle he changes the order of the cards, and he may change them in many ways. To simplify the exposition, consider only three cards. The cards which before the shuffle occupied respectively the places 123, may after the shuffle occupy the places

Take a completely different example, where the complexity of the causes is particularly involved. Imagine a player shuffling a deck of cards. With each shuffle, he changes the order of the cards, and there are many ways to do this. To keep it simple, let’s only consider three cards. The cards that originally held the positions 1, 2, and 3 may after the shuffle occupy the positions

123, 231, 312, 321, 132, 213.

123, 231, 312, 321, 132, 213.

Each of these six hypotheses is possible and they have respectively for probabilities:

Each of these six hypotheses is possible, and they each have the following probabilities:

p1, p2, p3, p4, p5, p6.

p1, p2, p3, p4, p5, p6.

The sum of these six numbers equals 1; but this is all we know of them; these six probabilities depend naturally upon the habits of the player which we do not know.[Pg 405]

The total of these six numbers adds up to 1; but that's all we can say about them; these six probabilities are naturally influenced by the player's habits, which we don’t know.[Pg 405]

At the second shuffle and the following, this will recommence, and under the same conditions; I mean that p4 for example represents always the probability that the three cards which occupied after the nth shuffle and before the n + 1th the places 123, occupy the places 321 after the n + 1th shuffle. And this remains true whatever be the number n, since the habits of the player and his way of shuffling remain the same.

At the second shuffle and every shuffle after that, this will start over, and under the same conditions; I mean that p4 for instance always represents the probability that the three cards which were in positions 123 after the nth shuffle and before the n + 1th shuffle will occupy positions 321 after the n + 1th shuffle. And this stays true no matter what the number n is, since the player's habits and their way of shuffling remain the same.

But if the number of shuffles is very great, the cards which before the first shuffle occupied the places 123 may, after the last shuffle, occupy the places

But if the number of shuffles is really high, the cards that were in positions 1, 2, and 3 before the first shuffle might end up in those same positions after the last shuffle.

123, 231, 312, 321, 132, 213

123, 231, 312, 321, 132, 213

and the probability of these six hypotheses will be sensibly the same and equal to 1/6; and this will be true whatever be the numbers p1 ... p6 which we do not know. The great number of shuffles, that is to say the complexity of the causes, has produced uniformity.

and the likelihood of these six hypotheses will be roughly the same and equal to 1/6; and this will hold true regardless of the numbers p1 ... p6 that we don’t know. The high number of shuffles, meaning the complexity of the causes, has resulted in uniformity.

This would apply without change if there were more than three cards, but even with three cards the demonstration would be complicated; let it suffice to give it for only two cards. Then we have only two possibilities 12, 21 with the probabilities p1 and p2 = 1 − p1.

This would remain the same if there were more than three cards, but even with three cards, the explanation would be complicated; it’s enough to illustrate it with just two cards. So, we have only two possibilities: 12 and 21, with the probabilities p1 and p2 = 1 − p1.

Suppose n shuffles and suppose I win one franc if the cards are finally in the initial order and lose one if they are finally inverted. Then, my mathematical expectation will be (p1p2)n.

Suppose n shuffles occur, and let’s say I win one franc if the cards end up in their original order and lose one franc if they are reversed. Then, my expected value will be (p1p2)n.

The difference p1p2 is certainly less than 1; so that if n is very great my expectation will be zero; we need not learn p1 and p2 to be aware that the game is equitable.

The difference p1p2 is definitely less than 1; so if n is very large, my expectation will be zero; we don't need to know p1 and p2 to realize that the game is fair.

There would always be an exception if one of the numbers p1 and p2 was equal to 1 and the other naught. Then it would not apply because our initial hypotheses would be too simple.

There would always be an exception if one of the numbers p1 and p2 was equal to 1 and the other was zero. Then it wouldn't apply because our initial assumptions would be too simplistic.

What we have just seen applies not only to the mixing of cards, but to all mixings, to those of powders and of liquids; and even to those of the molecules of gases in the kinetic theory of gases.

What we've just observed applies not only to shuffling cards, but to all types of mixing, including powders and liquids; and even to the mixing of gas molecules in the kinetic theory of gases.

To return to this theory, suppose for a moment a gas whose molecules can not mutually clash, but may be deviated by hitting the insides of the vase wherein the gas is confined. If the form[Pg 406] of the vase is sufficiently complex the distribution of the molecules and that of the velocities will not be long in becoming uniform. But this will not be so if the vase is spherical or if it has the shape of a cuboid. Why? Because in the first case the distance from the center to any trajectory will remain constant; in the second case this will be the absolute value of the angle of each trajectory with the faces of the cuboid.

To return to this theory, let’s consider a gas where the molecules don’t collide with each other, but can be redirected by hitting the walls of the container holding the gas. If the shape[Pg 406] of the container is complex enough, the distribution of the molecules and their velocities will quickly become uniform. However, this will not happen if the container is spherical or has the shape of a box. Why? Because in the first case, the distance from the center to any path will stay constant; in the second case, it will depend on the angle of each path relative to the faces of the box.

So we see what should be understood by conditions too simple; they are those which conserve something, which leave an invariant remaining. Are the differential equations of the problem too simple for us to apply the laws of chance? This question would seem at first view to lack precise meaning; now we know what it means. They are too simple if they conserve something, if they admit a uniform integral. If something in the initial conditions remains unchanged, it is clear the final situation can no longer be independent of the initial situation.

So we understand what is meant by conditions too simple; they are those that preserve something, leaving an unchanged aspect. Are the differential equations of the problem too simple for us to apply the laws of probability? This question might initially seem unclear; but now we see what it means. They are too simple if they preserve something, if they allow for a uniform solution. If something in the initial conditions stays the same, it’s clear that the final situation cannot be independent of the initial situation.

We come finally to the theory of errors. We know not to what are due the accidental errors, and precisely because we do not know, we are aware they obey the law of Gauss. Such is the paradox. The explanation is nearly the same as in the preceding cases. We need know only one thing: that the errors are very numerous, that they are very slight, that each may be as well negative as positive. What is the curve of probability of each of them? We do not know; we only suppose it is symmetric. We prove then that the resultant error will follow Gauss's law, and this resulting law is independent of the particular laws which we do not know. Here again the simplicity of the result is born of the very complexity of the data.

We finally arrive at the theory of errors. We don’t know what causes the random errors, and precisely because we don’t know, we understand they follow Gauss’s law. That’s the paradox. The explanation is almost the same as in the previous cases. We only need to know one thing: that the errors are very numerous, that they are very small, and that each can be either negative or positive. What is the probability curve for each of them? We don’t know; we only assume it’s symmetric. We then prove that the overall error will follow Gauss's law, and this resulting law is independent of the specific laws that we don’t understand. Once again, the simplicity of the result comes from the complexity of the data.

VII

But we are not through with paradoxes. I have just recalled the figment of Flammarion, that of the man going quicker than light, for whom time changes sign. I said that for him all phenomena would seem due to chance. That is true from a certain point of view, and yet all these phenomena at a given moment would not be distributed in conformity with the laws of chance, since the distribution would be the same as for us, who, seeing them unfold harmoniously and without coming out of a primal chaos, do not regard them as ruled by chance.[Pg 407]

But we’re not done with contradictions. I just remembered Flammarion’s idea about a man moving faster than light, for whom time behaves differently. I mentioned that for him, all events would seem random. That’s true from one perspective, but at a specific moment, these events wouldn’t follow the laws of randomness, because the distribution would be the same as for us. We see them unfolding smoothly and without reverting to a primal chaos, so we don’t see them as determined by chance.[Pg 407]

What does that mean? For Lumen, Flammarion's man, slight causes seem to produce great effects; why do not things go on as for us when we think we see grand effects due to little causes? Would not the same reasoning be applicable in his case?

What does that mean? For Lumen, Flammarion's guy, small causes seem to lead to big effects; why don't things work the same way for us when we think we see major effects from minor causes? Wouldn't the same logic apply to him?

Let us return to the argument. When slight differences in the causes produce vast differences in the effects, why are these effects distributed according to the laws of chance? Suppose a difference of a millimeter in the cause produces a difference of a kilometer in the effect. If I win in case the effect corresponds to a kilometer bearing an even number, my probability of winning will be 1/2. Why? Because to make that, the cause must correspond to a millimeter with an even number. Now, according to all appearance, the probability of the cause varying between certain limits will be proportional to the distance apart of these limits, provided this distance be very small. If this hypothesis were not admitted there would no longer be any way of representing the probability by a continuous function.

Let’s get back to the argument. When small differences in the causes create huge differences in the effects, why do these effects follow the laws of chance? Imagine that a difference of just a millimeter in the cause results in a difference of a kilometer in the effect. If I win when the effect matches a kilometer that’s an even number, my chance of winning would be 1/2. Why? Because for that to happen, the cause has to correspond to a millimeter that’s an even number. Now, it seems that the probability of the cause varying within certain limits will be proportional to the distance between those limits, as long as that distance is very small. If we didn’t accept this idea, there would be no way to represent probability using a continuous function.

What now will happen when great causes produce small effects? This is the case where we should not attribute the phenomenon to chance and where on the contrary Lumen would attribute it to chance. To a difference of a kilometer in the cause would correspond a difference of a millimeter in the effect. Would the probability of the cause being comprised between two limits n kilometers apart still be proportional to n? We have no reason to suppose so, since this distance, n kilometers, is great. But the probability that the effect lies between two limits n millimeters apart will be precisely the same, so it will not be proportional to n, even though this distance, n millimeters, be small. There is no way therefore of representing the law of probability of effects by a continuous curve. This curve, understand, may remain continuous in the analytic sense of the word; to infinitesimal variations of the abscissa will correspond infinitesimal variations of the ordinate. But practically it will not be continuous, since very small variations of the ordinate would not correspond to very small variations of the abscissa. It would become impossible to trace the curve with an ordinary pencil; that is what I mean.

What will happen when significant causes lead to minor effects? This is a situation where we shouldn't blame the phenomenon on chance, whereas Lumen would. A one-kilometer difference in the cause would lead to a one-millimeter difference in the effect. Would the likelihood of the cause being between two limits n kilometers apart still be proportional to n? We have no reason to think so, since this distance, n kilometers, is large. However, the probability that the effect falls between two limits n millimeters apart will be exactly the same, so it won't be proportional to n, even though this distance, n millimeters, is small. Therefore, there's no way to represent the law of probability of effects with a continuous curve. This curve, keep in mind, could still be continuous in the analytic sense; infinitesimal changes in the x-axis would correspond to infinitesimal changes in the y-axis. But practically, it won’t be continuous, since very small changes in the y-axis wouldn't match very small changes in the x-axis. It would become impossible to draw the curve with an ordinary pencil; that's what I mean.

So what must we conclude? Lumen has no right to say that[Pg 408] the probability of the cause (his cause, our effect) should be represented necessarily by a continuous function. But then why have we this right? It is because this state of unstable equilibrium which we have been calling initial is itself only the final outcome of a long previous history. In the course of this history complex causes have worked a great while: they have contributed to produce the mixture of elements and they have tended to make everything uniform at least within a small region; they have rounded off the corners, smoothed down the hills and filled up the valleys. However capricious and irregular may have been the primitive curve given over to them, they have worked so much toward making it regular that finally they deliver over to us a continuous curve. And this is why we may in all confidence assume its continuity.

So what should we conclude? Lumen has no right to claim that[Pg 408] the probability of the cause (his cause, our effect) needs to be represented by a continuous function. But why do we have this right? It’s because this state of unstable equilibrium that we've been calling initial is actually just the final result of a long history before it. During this history, complex causes have been at work for a long time: they have contributed to creating the mixture of elements and have worked to make everything uniform, at least within a small area; they have smoothed the edges, leveled the hills, and filled in the valleys. No matter how unpredictable and irregular the original curve they dealt with may have been, they have worked so much to make it regular that in the end, they present us with a continuous curve. This is why we can confidently assume its continuity.

Lumen would not have the same reasons for such a conclusion. For him complex causes would not seem agents of equalization and regularity, but on the contrary would create only inequality and differentiation. He would see a world more and more varied come forth from a sort of primitive chaos. The changes he could observe would be for him unforeseen and impossible to foresee. They would seem to him due to some caprice or another; but this caprice would be quite different from our chance, since it would be opposed to all law, while our chance still has its laws. All these points call for lengthy explications, which perhaps would aid in the better comprehension of the irreversibility of the universe.

Lumen wouldn’t have the same reasons for reaching such a conclusion. To him, complex causes wouldn’t seem like agents of balance and regularity; instead, they would create only inequality and distinction. He would perceive an increasingly diverse world emerging from a kind of primitive chaos. The changes he could observe would seem unexpected and impossible to predict. They would appear to him as the result of some whim or another; but this whim would be very different from our idea of chance, as it would contradict all laws, while our chance still follows its own rules. All these points require extensive explanations, which might help in understanding the irreversibility of the universe better.

VIII

We have sought to define chance, and now it is proper to put a question. Has chance thus defined, in so far as this is possible, objectivity?

We have tried to define chance, and now it's time to ask a question. Does the chance we've defined have any objectivity, as far as that’s possible?

It may be questioned. I have spoken of very slight or very complex causes. But what is very little for one may be very big for another, and what seems very complex to one may seem simple to another. In part I have already answered by saying precisely in what cases differential equations become too simple for the laws of chance to remain applicable. But it is fitting to examine the matter a little more closely, because we may take still other points of view.[Pg 409]

It can be questioned. I've talked about very minor or very complicated causes. But what feels minor to one person can feel significant to another, and what seems complicated to one might appear simple to someone else. I've partially answered this by specifying the situations in which differential equations become too simple for the laws of chance to apply. However, it's useful to take a closer look at this topic, as we can consider other perspectives as well.[Pg 409]

What means the phrase 'very slight'? To understand it we need only go back to what has already been said. A difference is very slight, an interval is very small, when within the limits of this interval the probability remains sensibly constant. And why may this probability be regarded as constant within a small interval? It is because we assume that the law of probability is represented by a continuous curve, continuous not only in the analytic sense, but practically continuous, as already explained. This means that it not only presents no absolute hiatus, but that it has neither salients nor reentrants too acute or too accentuated.

What does the phrase 'very slight' mean? To understand it, we just need to refer back to what we’ve already discussed. A difference is very slight, and an interval is very small, when the probability within that interval stays effectively the same. But why can this probability be considered constant within a small interval? It's because we assume that the probability law is represented by a continuous curve, not only in the analytical sense but also practically continuous, as we’ve previously explained. This means it doesn’t just have no absolute gaps, but it also lacks any sharp peaks or overly pronounced dips.

And what gives us the right to make this hypothesis? We have already said it is because, since the beginning of the ages, there have always been complex causes ceaselessly acting in the same way and making the world tend toward uniformity without ever being able to turn back. These are the causes which little by little have flattened the salients and filled up the reentrants, and this is why our probability curves now show only gentle undulations. In milliards of milliards of ages another step will have been made toward uniformity, and these undulations will be ten times as gentle; the radius of mean curvature of our curve will have become ten times as great. And then such a length as seems to us to-day not very small, since on our curve an arc of this length can not be regarded as rectilineal, should on the contrary at that epoch be called very little, since the curvature will have become ten times less and an arc of this length may be sensibly identified with a sect.

And what gives us the right to propose this idea? We've already mentioned that it's because, since the beginning of time, there have always been complex forces constantly acting in the same way, pushing the world toward uniformity without ever being able to reverse course. These forces have gradually smoothed out the high points and filled in the low points, which is why our probability curves now show only gentle waves. Over billions of years, another step will have been taken toward uniformity, and these waves will be ten times as gentle; the average curvature of our curve will have increased tenfold. At that time, what seems to us today like a moderately sized length—since on our curve, an arc of this length can't be seen as straight—should, on the contrary, be considered very small, since the curvature will have decreased tenfold, and an arc of this length can be practically seen as a straight line.

Thus the phrase 'very slight' remains relative; but it is not relative to such or such a man, it is relative to the actual state of the world. It will change its meaning when the world shall have become more uniform, when all things shall have blended still more. But then doubtless men can no longer live and must give place to other beings—should I say far smaller or far larger? So that our criterion, remaining true for all men, retains an objective sense.

Thus the phrase 'very slight' stays relative; but it doesn't refer to any specific person, it relates to the actual condition of the world. Its meaning will shift when the world becomes more consistent, when everything merges even further. But then, undoubtedly, humans may no longer exist and will need to make way for other beings—should I say much smaller or much larger? Therefore, our standard, remaining true for all people, holds an objective significance.

And on the other hand what means the phrase 'very complex'? I have already given one solution, but there are others. Complex causes we have said produce a blend more and more intimate,[Pg 410] but after how long a time will this blend satisfy us? When will it have accumulated sufficient complexity? When shall we have sufficiently shuffled the cards? If we mix two powders, one blue, the other white, there comes a moment when the tint of the mixture seems to us uniform because of the feebleness of our senses; it will be uniform for the presbyte, forced to gaze from afar, before it will be so for the myope. And when it has become uniform for all eyes, we still could push back the limit by the use of instruments. There is no chance for any man ever to discern the infinite variety which, if the kinetic theory is true, hides under the uniform appearance of a gas. And yet if we accept Gouy's ideas on the Brownian movement, does not the microscope seem on the point of showing us something analogous?

And on the other hand, what does the phrase 'very complex' really mean? I've already offered one answer, but there are more. We've mentioned that complex causes create an increasingly intimate blend, [Pg 410] but how long will it take for this blend to satisfy us? When will it have gathered enough complexity? When will we have shuffled the cards enough? If we mix two powders, one blue and the other white, there comes a time when the color of the mixture seems uniform because our senses can’t pick out the differences. For someone with poor eyesight, they’ll perceive it as uniform from a distance before it becomes clear for someone who sees well. Even once it looks uniform to everyone, we could still delve deeper using instruments. There’s no way for anyone to notice the infinite variety that, if the kinetic theory is correct, lies hidden beneath the uniform appearance of a gas. Yet, if we consider Gouy's ideas on Brownian motion, doesn’t the microscope seem ready to reveal something similar?

This new criterion is therefore relative like the first; and if it retains an objective character, it is because all men have approximately the same senses, the power of their instruments is limited, and besides they use them only exceptionally.

This new criterion is, therefore, relative like the first; and if it keeps an objective character, it’s because all people have roughly the same senses, the capability of their tools is limited, and besides, they only use them occasionally.

IX

It is just the same in the moral sciences and particularly in history. The historian is obliged to make a choice among the events of the epoch he studies; he recounts only those which seem to him the most important. He therefore contents himself with relating the most momentous events of the sixteenth century, for example, as likewise the most remarkable facts of the seventeenth century. If the first suffice to explain the second, we say these conform to the laws of history. But if a great event of the seventeenth century should have for cause a small fact of the sixteenth century which no history reports, which all the world has neglected, then we say this event is due to chance. This word has therefore the same sense as in the physical sciences; it means that slight causes have produced great effects.

It’s the same in the moral sciences and especially in history. The historian has to pick which events from the period they're studying to focus on; they only tell the ones they think are most significant. So they talk about the most important events of the sixteenth century, for instance, and also the most notable facts from the seventeenth century. If the first set helps explain the second, we say they follow the laws of history. However, if a major event from the seventeenth century is caused by a minor detail from the sixteenth century that no one else has reported or noticed, then we say that event happened by chance. This term carries the same meaning as it does in the physical sciences; it indicates that small causes can lead to significant effects.

The greatest bit of chance is the birth of a great man. It is only by chance that meeting of two germinal cells, of different sex, containing precisely, each on its side, the mysterious elements whose mutual reaction must produce the genius. One will agree that these elements must be rare and that their meeting is still more rare. How slight a thing it would have required to deflect from its route the carrying spermatozoon. It would have[Pg 411] sufficed to deflect it a tenth of a millimeter and Napoleon would not have been born and the destinies of a continent would have been changed. No example can better make us understand the veritable characteristics of chance.

The greatest stroke of luck is the birth of a great person. It's all a matter of chance when two reproductive cells, from different sexes, come together, each holding the mysterious elements that must interact to create genius. One can agree that these elements are likely to be rare, and their union is even rarer. Just a small change could have redirected the sperm cell's path. If it had been deflected by even one-tenth of a millimeter, Napoleon wouldn't have been born, and the fate of an entire continent would have been altered. No example illustrates the true nature of chance better than this.

One more word about the paradoxes brought out by the application of the calculus of probabilities to the moral sciences. It has been proven that no Chamber of Deputies will ever fail to contain a member of the opposition, or at least such an event would be so improbable that we might without fear wager the contrary, and bet a million against a sou.

One more thing about the contradictions highlighted by using probability calculus in moral sciences. It has been shown that no Chamber of Deputies will ever lack a member of the opposition, or at least such a situation would be so unlikely that we could confidently bet the opposite, wagering a million against a sou.

Condorcet has striven to calculate how many jurors it would require to make a judicial error practically impossible. If we had used the results of this calculation, we should certainly have been exposed to the same disappointments as in betting, on the faith of the calculus, that the opposition would never be without a representative.

Condorcet has worked to figure out how many jurors it would take to make a judicial mistake nearly impossible. If we had relied on this calculation, we would definitely have faced the same letdowns as when betting, believing that the opposing side would always have a representative.

The laws of chance do not apply to these questions. If justice be not always meted out to accord with the best reasons, it uses less than we think the method of Bridoye. This is perhaps to be regretted, for then the system of Condorcet would shield us from judicial errors.

The laws of chance don't apply to these questions. If justice isn't always served according to the best reasons, it uses a method that's less effective than we think, like Bridoye's. This might be unfortunate because then Condorcet's system would protect us from mistakes in the legal system.

What is the meaning of this? We are tempted to attribute facts of this nature to chance because their causes are obscure; but this is not true chance. The causes are unknown to us, it is true, and they are even complex; but they are not sufficiently so, since they conserve something. We have seen that this it is which distinguishes causes 'too simple.' When men are brought together they no longer decide at random and independently one of another; they influence one another. Multiplex causes come into action. They worry men, dragging them to right or left, but one thing there is they can not destroy, this is their Panurge flock-of-sheep habits. And this is an invariant.

What does this mean? We might be tempted to think that these kinds of events happen by chance because their causes are unclear; however, this isn't true randomness. It's true that we don't know the causes, and they can be complex, but they're not so complicated that nothing remains consistent. We've seen that this is what sets apart causes that are "too simple." When people come together, they stop making decisions randomly and independently; they start to influence each other. Multiple causes come into play. They confuse people, pulling them in different directions, but there’s one thing they can't change: their group mentality. And that remains consistent.

X

Difficulties are indeed involved in the application of the calculus of probabilities to the exact sciences. Why are the decimals of a table of logarithms, why are those of the number π distributed in accordance with the laws of chance? Elsewhere I have already studied the question in so far as it concerns[Pg 412] logarithms, and there it is easy. It is clear that a slight difference of argument will give a slight difference of logarithm, but a great difference in the sixth decimal of the logarithm. Always we find again the same criterion.

Difficulties do exist when applying probability calculus to the exact sciences. Why are the decimals in a logarithm table, and why are those of the number π, distributed according to chance laws? I've previously examined this question regarding[Pg 412] logarithms, and it’s straightforward. It's evident that a small change in the argument results in a small change in the logarithm, but a significant difference can appear in the sixth decimal place of the logarithm. We consistently encounter the same criteria.

But as for the number π, that presents more difficulties, and I have at the moment nothing worth while to say.

But when it comes to the number π, it poses more challenges, and right now I don't have anything significant to share.

There would be many other questions to resolve, had I wished to attack them before solving that which I more specially set myself. When we reach a simple result, when we find for example a round number, we say that such a result can not be due to chance, and we seek, for its explanation, a non-fortuitous cause. And in fact there is only a very slight probability that among 10,000 numbers chance will give a round number; for example, the number 10,000. This has only one chance in 10,000. But there is only one chance in 10,000 for the occurrence of any other one number; and yet this result will not astonish us, nor will it be hard for us to attribute it to chance; and that simply because it will be less striking.

There would be many other questions to tackle if I wanted to address them before resolving the specific one I've focused on. When we arrive at a simple outcome, like finding a round number, we say that this result can’t be attributed to chance, so we look for a non-random cause to explain it. In reality, there’s only a tiny probability of randomly landing on a round number among 10,000 possibilities; for example, getting the number 10,000. That has just one chance in 10,000. However, there’s also one chance in 10,000 for any other specific number to occur; yet that outcome doesn’t surprise us, nor do we find it hard to attribute it to chance, and that’s simply because it seems less remarkable.

Is this a simple illusion of ours, or are there cases where this way of thinking is legitimate? We must hope so, else were all science impossible. When we wish to check a hypothesis, what do we do? We can not verify all its consequences, since they would be infinite in number; we content ourselves with verifying certain ones and if we succeed we declare the hypothesis confirmed, because so much success could not be due to chance. And this is always at bottom the same reasoning.

Is this just our imagination, or are there times when this way of thinking makes sense? We have to hope so; otherwise, science would be impossible. When we want to test a hypothesis, what do we do? We can't check all its outcomes since there would be an infinite number of them; we settle for checking a few, and if we succeed, we say the hypothesis is confirmed because such success couldn't just be a coincidence. And this is fundamentally the same reasoning every time.

I can not completely justify it here, since it would take too much time; but I may at least say that we find ourselves confronted by two hypotheses, either a simple cause or that aggregate of complex causes we call chance. We find it natural to suppose that the first should produce a simple result, and then, if we find that simple result, the round number for example, it seems more likely to us to be attributable to the simple cause which must give it almost certainly, than to chance which could only give it once in 10,000 times. It will not be the same if we find a result which is not simple; chance, it is true, will not give this more than once in 10,000 times; but neither has the simple cause any more chance of producing it.

I can't fully justify this here since it would take too much time, but I can at least say we are faced with two possibilities: either a simple cause or the mix of complex causes we call chance. It seems natural to think that a simple cause should lead to a simple result, and when we do find that simple result—like a round number—it feels more likely that it's due to that simple cause, which should almost certainly produce it, rather than to chance, which would only produce it once in 10,000 tries. However, if we find a result that is not simple, chance may not produce it more than once in 10,000 times, but a simple cause also doesn’t have a better chance of producing it.


BOOK II

MATHEMATICAL REASONING


CHAPTER I

The Relativity of Space

I

It is impossible to represent to oneself empty space; all our efforts to imagine a pure space, whence should be excluded the changing images of material objects, can result only in a representation where vividly colored surfaces, for example, are replaced by lines of faint coloration, and we can not go to the very end in this way without all vanishing and terminating in nothingness. Thence comes the irreducible relativity of space.

It’s impossible to visualize empty space; whenever we try to imagine a pure space, free from the shifting images of material objects, all we can create is a representation where bright, vivid surfaces, for instance, become lines of faint color. We can’t push this idea to its limit without everything disappearing and ending in nothingness. This leads to the unavoidable relativity of space.

Whoever speaks of absolute space uses a meaningless phrase. This is a truth long proclaimed by all who have reflected upon the matter, but which we are too often led to forget.

Whoever talks about absolute space is using a meaningless phrase. This is a truth that has been widely recognized by everyone who has thought about it, but it's something we often forget.

I am at a determinate point in Paris, place du Panthéon for instance, and I say: I shall come back here to-morrow. If I be asked: Do you mean you will return to the same point of space, I shall be tempted to answer: yes; and yet I shall be wrong, since by to-morrow the earth will have journeyed hence, carrying with it the place du Panthéon, which will have traveled over more than two million kilometers. And if I tried to speak more precisely, I should gain nothing, since our globe has run over these two million kilometers in its motion with relation to the sun, while the sun in its turn is displaced with reference to the Milky Way, while the Milky Way itself is doubtless in motion without our being able to perceive its velocity. So that we are completely ignorant, and always shall be, of how much the place du Panthéon is displaced in a day.

I am at a specific point in Paris, like the Place du Panthéon, and I say: I will come back here tomorrow. If someone asks me: Do you mean you will return to the exact spot, I might be tempted to say yes; but I would be mistaken, because by tomorrow the Earth will have moved, taking the Place du Panthéon with it, which will have traveled over more than two million kilometers. And if I tried to be more precise, I wouldn't gain anything, since our planet has covered those two million kilometers in its movement relative to the sun, while the sun itself is also moving in relation to the Milky Way, and the Milky Way is likely moving too, even if we can't perceive its speed. So, we are completely unaware, and always will be, of how much the Place du Panthéon shifts in a day.

In sum, I meant to say: To-morrow I shall see again the dome[Pg 414] and the pediment of the Panthéon, and if there were no Panthéon my phrase would be meaningless and space would vanish.

In short, I wanted to say: Tomorrow I’ll see the dome[Pg 414] and the pediment of the Panthéon again, and if there were no Panthéon, my words would make no sense and everything would disappear.

This is one of the most commonplace forms of the principle of the relativity of space; but there is another, upon which Delbeuf has particularly insisted. Suppose that in the night all the dimensions of the universe become a thousand times greater: the world will have remained similar to itself, giving to the word similitude the same meaning as in Euclid, Book VI. Only what was a meter long will measure thenceforth a kilometer, what was a millimeter long will become a meter. The bed whereon I lie and my body itself will be enlarged in the same proportion.

This is one of the most common examples of the principle of the relativity of space, but there's another one that Delbeuf emphasized. Imagine that during the night, all the dimensions of the universe become a thousand times larger: the world will remain similar to itself, giving the term similitude the same meaning as in Euclid, Book VI. Only what was a meter long will now measure a kilometer, and what was a millimeter long will become a meter. The bed I'm lying on and my body will also enlarge in the same proportion.

When I awake to-morrow morning, what sensation shall I feel in presence of such an astounding transformation? Well, I shall perceive nothing at all. The most precise measurements will be incapable of revealing to me anything of this immense convulsion, since the measures I use will have varied precisely in the same proportion as the objects I seek to measure. In reality, this convulsion exists only for those who reason as if space were absolute. If I for a moment have reasoned as they do, it is the better to bring out that their way of seeing implies contradiction. In fact it would be better to say that, space being relative, nothing at all has happened, which is why we have perceived nothing.

When I wake up tomorrow morning, what will I feel in light of such an incredible change? Honestly, I won’t feel anything at all. The most accurate measurements won’t reveal anything about this massive shift, because the tools I use will change in exactly the same way as the things I’m trying to measure. The truth is, this change only exists for those who think of space as something absolute. If I’ve considered things that way for a moment, it’s only to highlight that their perspective is contradictory. In reality, it would be more accurate to say that, since space is relative, nothing has actually happened, which is why we haven’t perceived anything.

Has one the right, therefore, to say he knows the distance between two points? No, since this distance could undergo enormous variations without our being able to perceive them, provided the other distances have varied in the same proportion. We have just seen that when I say: I shall be here to-morrow, this does not mean: To-morrow I shall be at the same point of space where I am to-day, but rather: To-morrow I shall be at the same distance from the Panthéon as to-day. And we see that this statement is no longer sufficient and that I should say: To-morrow and to-day my distance from the Panthéon will be equal to the same number of times the height of my body.

Does one have the right to say they know the distance between two points? No, because this distance could change significantly without us being able to notice it, as long as the other distances change in the same way. We just saw that when I say: I will be here tomorrow, it doesn’t mean: Tomorrow I will be at the exact spot I am today, but rather: Tomorrow I will be the same distance from the Panthéon as I am today. And we see that this statement isn't enough anymore, and I should say: Tomorrow and today my distance from the Panthéon will be equal to the same number of times the height of my body.

But this is not all; I have supposed the dimensions of the world to vary, but that at least the world remained always similar to itself. We might go much further, and one of the most astonishing theories of modern physics furnishes us the occasion.[Pg 415]

But that's not all; I imagined that the size of the world could change, but at least the world always stayed somewhat the same. We could explore even more, and one of the most incredible theories in modern physics gives us the chance to do so.[Pg 415]

According to Lorentz and Fitzgerald, all the bodies borne along in the motion of the earth undergo a deformation.

According to Lorentz and Fitzgerald, all objects moving with the Earth's motion experience a deformation.

This deformation is, in reality, very slight, since all dimensions parallel to the movement of the earth diminish by a hundred millionth, while the dimensions perpendicular to this movement are unchanged. But it matters little that it is slight, that it exists suffices for the conclusion I am about to draw. And besides, I have said it was slight, but in reality I know nothing about it; I have myself been victim of the tenacious illusion which makes us believe we conceive an absolute space; I have thought of the motion of the earth in its elliptic orbit around the sun, and I have allowed thirty kilometers as its velocity. But its real velocity (I mean, this time, not its absolute velocity, which is meaningless, but its velocity with relation to the ether), I do not know that, and have no means of knowing it: it is perhaps, 10, 100 times greater, and then the deformation will be 100, 10,000 times more.

This deformation is actually very small, since all dimensions parallel to the earth's movement shrink by one hundred millionth, while the dimensions perpendicular to this movement stay the same. But it doesn't really matter that it's small; its existence is enough for the conclusion I’m about to make. And also, I’ve said it’s small, but honestly, I don’t really know. I’ve fallen prey to the persistent illusion that makes us think we understand absolute space. I’ve considered the earth’s motion in its elliptical orbit around the sun, and I estimated its speed at thirty kilometers. However, its actual speed (and this time I mean not its absolute speed, which is meaningless, but its speed relative to the ether) remains unknown to me: it could be 10 or even 100 times greater, which would then make the deformation 100 or 10,000 times more.

Can we show this deformation? Evidently not; here is a cube with edge one meter; in consequence of the earth's displacement it is deformed, one of its edges, that parallel to the motion, becomes smaller, the others do not change. If I wish to assure myself of it by aid of a meter measure, I shall measure first one of the edges perpendicular to the motion and shall find that my standard meter fits this edge exactly; and in fact neither of these two lengths is changed, since both are perpendicular to the motion. Then I wish to measure the other edge, that parallel to the motion; to do this I displace my meter and turn it so as to apply it to the edge. But the meter, having changed orientation and become parallel to the motion, has undergone, in its turn, the deformation, so that though the edge be not a meter long, it will fit exactly, I shall find out nothing.

Can we demonstrate this deformation? Clearly not; here’s a cube with an edge of one meter. Due to the earth's movement, it is deformed; one of its edges, the one parallel to the movement, gets shorter, while the others stay the same. If I want to verify this with a meter stick, I’ll first measure one of the edges that is perpendicular to the movement and will find that my standard meter fits this edge perfectly; in fact, neither of these two lengths has changed, since both are perpendicular to the movement. Then I want to measure the other edge, the one parallel to the movement; to do this, I move my meter stick and turn it to align with the edge. But since the meter stick is now oriented parallel to the movement, it has also experienced deformation, so even though the edge isn’t a meter long, it will still fit exactly, and I won’t discover anything.

You ask then of what use is the hypothesis of Lorentz and of Fitzgerald if no experiment can permit of its verification? It is my exposition that has been incomplete; I have spoken only of measurements that can be made with a meter; but we can also measure a length by the time it takes light to traverse it, on condition we suppose the velocity of light constant and independent of direction. Lorentz could have accounted for the[Pg 416] facts by supposing the velocity of light greater in the direction of the earth's motion than in the perpendicular direction. He preferred to suppose that the velocity is the same in these different directions but that the bodies are smaller in the one than in the other. If the wave surfaces of light had undergone the same deformations as the material bodies we should never have perceived the Lorentz-Fitzgerald deformation.

You’re wondering what use the Lorentz and Fitzgerald hypothesis is if no experiment can verify it? My explanation has been incomplete; I've only talked about measurements we can make with a meter. But we can also measure length by how long it takes light to travel through it, provided we assume the speed of light is constant and not affected by direction. Lorentz could have explained the[Pg 416] phenomena by suggesting that the speed of light is faster in the direction of the Earth’s movement than it is perpendicularly. He chose to believe that the speed is the same in both directions, but that objects are smaller in one direction than in the other. If the light wave fronts had changed shape in the same way as material objects, we would never have noticed the Lorentz-Fitzgerald contraction.

In either case, it is not a question of absolute magnitude, but of the measure of this magnitude by means of some instrument; this instrument may be a meter, or the path traversed by light; it is only the relation of the magnitude to the instrument that we measure; and if this relation is altered, we have no way of knowing whether it is the magnitude or the instrument which has changed.

In either case, it's not about the absolute size, but how we measure that size using some tool; this tool could be a meter or the distance light travels. We're only measuring the relationship between the size and the tool we use. If that relationship changes, we can't tell if it's the size or the tool that has changed.

But what I wish to bring out is, that in this deformation the world has not remained similar to itself; squares have become rectangles, circles ellipses, spheres ellipsoids. And yet we have no way of knowing whether this deformation be real.

But what I want to highlight is that in this distortion, the world has not remained the same; squares have turned into rectangles, circles have become ellipses, and spheres have changed into ellipsoids. Yet, we have no way of knowing if this distortion is real.

Evidently one could go much further: in place of the Lorentz-Fitzgerald deformation, whose laws are particularly simple, we could imagine any deformation whatsoever. Bodies could be deformed according to any laws, as complicated as we might wish, we never should notice it provided all bodies without exception were deformed according to the same laws. In saying, all bodies without exception, I include of course our own body and the light rays emanating from different objects.

Evidently, one could go much further: instead of the Lorentz-Fitzgerald deformation, which has particularly simple laws, we could imagine any kind of deformation. Objects could be deformed according to any rules, no matter how complicated we wanted, and we would never notice it as long as all objects, without exception, were deformed according to the same rules. When I say all objects without exception, I obviously include our own bodies and the light rays coming from different objects.

If we look at the world in one of those mirrors of complicated shape which deform objects in a bizarre way, the mutual relations of the different parts of this world would not be altered; if, in fact two real objects touch, their images likewise seem to touch. Of course when we look in such a mirror we see indeed the deformation, but this is because the real world subsists alongside of its deformed image; and then even were this real world hidden from us, something there is could not be hidden, ourself; we could not cease to see, or at least to feel, our body and our limbs which have not been deformed and which continue to serve us as instruments of measure.

If we look at the world in a weirdly shaped mirror that distorts things in a strange way, the relationships between different parts of the world wouldn’t really change; when two real objects touch, their reflections also seem to touch. Sure, when we look in that mirror, we definitely see the distortion, but that's because the real world exists alongside its distorted image. Even if this real world were hidden from us, there’s still something that couldn’t be hidden: ourselves. We couldn’t stop seeing, or at least feeling, our body and limbs, which haven’t been distorted and continue to function as our means of measurement.

But if we imagine our body itself deformed in the same way[Pg 417] as if seen in the mirror, these instruments of measure in their turn will fail us and the deformation will no longer be ascertainable.

But if we visualize our body distorted in the same way[Pg 417] as if viewed in a mirror, these measuring tools will also let us down, and the distortion will no longer be detectable.

Consider in the same way two worlds images of one another; to each object P of the world A corresponds in the world B an object , its image; the coordinates of this image are determinate functions of those of the object P; moreover these functions may be any whatsoever; I only suppose them chosen once for all. Between the position of P and that of there is a constant relation; what this relation is, matters not; enough that it be constant.

Consider two worlds reflecting images of each other; for each object P in world A, there is a corresponding object in world B, its image. The coordinates of this image are determined by functions of the coordinates of the object P; these functions can be anything, as long as they are chosen once and for all. There exists a consistent relationship between the position of P and that of ; the specific nature of this relationship isn't important; it just needs to be constant.

Well, these two worlds will be indistinguishable one from the other. I mean the first will be for its inhabitants what the second is for its. And so it will be as long as the two worlds remain strangers to each other. Suppose we lived in world A, we shall have constructed our science and in particular our geometry; during this time the inhabitants of world B will have constructed a science, and as their world is the image of ours, their geometry will also be the image of ours or, better, it will be the same. But if for us some day a window is opened upon world B, how we shall pity them: "Poor things," we shall say, "they think they have made a geometry, but what they call so is only a grotesque image of ours; their straights are all twisted, their circles are humped, their spheres have capricious inequalities." And we shall never suspect they say the same of us, and one never will know who is right.

Well, these two worlds will be indistinguishable from each other. The first will be what the second is for its inhabitants. And that will remain true as long as the two worlds stay unaware of each other. If we lived in world A, we would have developed our own science, particularly our geometry; meanwhile, the inhabitants of world B would have created their own science, and since their world reflects ours, their geometry will also reflect ours, or rather, it will be the same. But if one day we gain access to world B, we'll feel pity for them: "Poor things," we'll say, "they think they've created a geometry, but what they call that is just a distorted version of ours; their straight lines are all warped, their circles are bumpy, their spheres have strange irregularities." And we would never realize they think the same about us, and no one will ever know who is right.

We see in how broad a sense should be understood the relativity of space; space is in reality amorphous and the things which are therein alone give it a form. What then should be thought of that direct intuition we should have of the straight or of distance? So little have we intuition of distance in itself that in the night, as we have said, a distance might become a thousand times greater without our being able to perceive it, if all other distances had undergone the same alteration. And even in a night the world B might be substituted for the world A without our having any way of knowing it, and then the straight lines of yesterday would have ceased to be straight and we should never notice.[Pg 418]

We see how broadly we should understand the relativity of space; space is actually shapeless, and the things within it give it form. So, what should we think about our direct perception of straight lines or distance? We have so little understanding of distance on its own that at night, as we mentioned, a distance could increase a thousand times without us noticing, if all other distances changed in the same way. Even at night, the world B could replace the world A without us having any way to tell, and those straight lines from yesterday would no longer be straight, and we wouldn't even realize it.[Pg 418]

One part of space is not by itself and in the absolute sense of the word equal to another part of space; because if so it is for us, it would not be for the dwellers in world B; and these have just as much right to reject our opinion as we to condemn theirs.

One part of space isn't equal to another part of space in a complete sense; because if it were for us, it wouldn't be for the inhabitants of world B; and they have just as much right to dismiss our view as we do to disapprove of theirs.

I have elsewhere shown what are the consequences of these facts from the viewpoint of the idea we should form of non-Euclidean geometry and other analogous geometries; to that I do not care to return; and to-day I shall take a somewhat different point of view.

I have previously outlined the implications of these facts regarding how we should understand non-Euclidean geometry and similar geometries; I won’t revisit that topic now. Today, I want to approach it from a different perspective.

II

If this intuition of distance, of direction, of the straight line, if this direct intuition of space in a word does not exist, whence comes our belief that we have it? If this is only an illusion, why is this illusion so tenacious? It is proper to examine into this. We have said there is no direct intuition of size and we can only arrive at the relation of this magnitude to our instruments of measure. We should therefore not have been able to construct space if we had not had an instrument to measure it; well, this instrument to which we relate everything, which we use instinctively, it is our own body. It is in relation to our body that we place exterior objects, and the only spatial relations of these objects that we can represent are their relations to our body. It is our body which serves us, so to speak, as system of axes of coordinates.

If this sense of distance, direction, and straight lines—this direct perception of space—doesn’t exist, where does our belief that it does come from? If it’s just an illusion, why is this illusion so persistent? It’s worth investigating. We’ve stated that there’s no direct perception of size, and we can only understand the relationship of this size to our measuring tools. Thus, we wouldn’t be able to create a concept of space without having a tool to measure it; this measurement tool, to which we relate everything instinctively, is our own body. We place external objects in relation to our body, and the only spatial relationships we can represent for these objects are their relationships to our body. Our body acts, so to speak, as our system of coordinate axes.

For example, at an instant α, the presence of the object A is revealed to me by the sense of sight; at another instant, β, the presence of another object, B, is revealed to me by another sense, that of hearing or of touch, for instance. I judge that this object B occupies the same place as the object A. What does that mean? First that does not signify that these two objects occupy, at two different moments, the same point of an absolute space, which even if it existed would escape our cognition, since, between the instants α and β, the solar system has moved and we can not know its displacement. That means these two objects occupy the same relative position with reference to our body.

For instance, at one moment, α, I see object A; at a different moment, β, I perceive another object, B, through a different sense, like hearing or touch, for example. I conclude that object B is in the same location as object A. What does this mean? First, it doesn’t imply that these two objects occupy the same point in an absolute space, which, even if it existed, would be beyond our understanding, since, between moments α and β, the solar system has moved and we can’t track its movement. This means that these two objects occupy the same relative position concerning our body.

But even this, what does it mean? The impressions that have come to us from these objects have followed paths absolutely[Pg 419] different, the optic nerve for the object A, the acoustic nerve for the object B. They have nothing in common from the qualitative point of view. The representations we are able to make of these two objects are absolutely heterogeneous, irreducible one to the other. Only I know that to reach the object A I have just to extend the right arm in a certain way; even when I abstain from doing it, I represent to myself the muscular sensations and other analogous sensations which would accompany this extension, and this representation is associated with that of the object A.

But even so, what does it mean? The impressions we get from these objects have taken completely [Pg 419] different paths: the optic nerve for object A, and the acoustic nerve for object B. They have nothing in common from a qualitative standpoint. The ways we represent these two objects are completely distinct and cannot be reduced to one another. However, I know that to reach object A, all I have to do is extend my right arm in a specific way; even if I choose not to do it, I still picture the muscle sensations and other similar feelings that would come with that movement, and this mental image is linked to the representation of object A.

Now, I likewise know I can reach the object B by extending my right arm in the same manner, an extension accompanied by the same train of muscular sensations. And when I say these two objects occupy the same place, I mean nothing more.

Now, I also know I can grab object B by stretching out my right arm in the same way, experiencing the same set of muscle sensations. And when I say these two objects are in the same place, I mean nothing more.

I also know I could have reached the object A by another appropriate motion of the left arm and I represent to myself the muscular sensations which would have accompanied this movement; and by this same motion of the left arm, accompanied by the same sensations, I likewise could have reached the object B.

I also know I could have grabbed the object A with a different movement of my left arm, and I can picture the muscular sensations that would have come with that movement; and with the same motion of the left arm, along with those same sensations, I could have also grabbed the object B.

And that is very important, since thus I can defend myself against dangers menacing me from the object A or the object B. With each of the blows we can be hit, nature has associated one or more parries which permit of our guarding ourselves. The same parry may respond to several strokes; and so it is, for instance, that the same motion of the right arm would have allowed us to guard at the instant α against the object A and at the instant β against the object B. Just so, the same stroke can be parried in several ways, and we have said, for instance, the object A could be reached indifferently either by a certain movement of the right arm or by a certain movement of the left arm.

And that is really important because it allows me to protect myself against threats coming from object A or object B. For every blow we might receive, nature has paired it with one or more defenses that help us shield ourselves. The same defense can block multiple attacks; for example, the same motion of the right arm could help us block against object A at moment α and against object B at moment β. Similarly, the same attack can be defended against in different ways, and we’ve mentioned that object A could be defended against either with a certain movement of the right arm or with a certain movement of the left arm.

All these parries have nothing in common except warding off the same blow, and this it is, and nothing else, which is meant when we say they are movements terminating at the same point of space. Just so, these objects, of which we say they occupy the same point of space, have nothing in common, except that the same parry guards against them.

All these defenses have nothing in common except for blocking the same attack, and that’s all it means when we say they are movements ending at the same point in space. Similarly, these objects, which we say occupy the same point in space, have nothing in common except that the same defense protects against them.

Or, if you choose, imagine innumerable telegraph wires, some centripetal, others centrifugal. The centripetal wires warn us of[Pg 420] accidents happening without; the centrifugal wires carry the reparation. Connections are so established that when a centripetal wire is traversed by a current this acts on a relay and so starts a current in one of the centrifugal wires, and things are so arranged that several centripetal wires may act on the same centrifugal wire if the same remedy suits several ills, and that a centripetal wire may agitate different centrifugal wires, either simultaneously or in lieu one of the other when the same ill may be cured by several remedies.

Or, if you prefer, picture countless telegraph wires, some pulling in and others pushing out. The inward wires alert us to[Pg 420] accidents occurring outside; the outward wires handle the repairs. The connections are set up so that when a current flows through an inward wire, it activates a relay, which then starts a current in one of the outward wires. The setup is such that multiple inward wires can trigger the same outward wire if the same fix applies to various problems, and that an inward wire can stimulate different outward wires, either at the same time or one after the other, when the same issue can be resolved with different solutions.

It is this complex system of associations, it is this table of distribution, so to speak, which is all our geometry or, if you wish, all in our geometry that is instinctive. What we call our intuition of the straight line or of distance is the consciousness we have of these associations and of their imperious character.

It’s this intricate network of connections, this distribution layout, which forms the basis of our geometry, or if you prefer, everything instinctive in our geometry. What we refer to as our intuition for a straight line or distance is the awareness we have of these connections and their compelling nature.

And it is easy to understand whence comes this imperious character itself. An association will seem to us by so much the more indestructible as it is more ancient. But these associations are not, for the most part, conquests of the individual, since their trace is seen in the new-born babe: they are conquests of the race. Natural selection had to bring about these conquests by so much the more quickly as they were the more necessary.

And it's easy to see where this demanding character comes from. An association will seem even more unbreakable the older it is. However, these associations are mostly not individual achievements, since their mark can be seen in a newborn baby; they are achievements of the entire human race. Natural selection had to create these achievements more rapidly because they were all the more essential.

On this account, those of which we speak must have been of the earliest in date, since without them the defense of the organism would have been impossible. From the time when the cellules were no longer merely juxtaposed, but were called upon to give mutual aid, it was needful that a mechanism organize analogous to what we have described, so that this aid miss not its way, but forestall the peril.

On this basis, the ones we’re talking about had to be among the earliest because, without them, the survival of the organism would have been impossible. Once the cells were no longer just next to each other but needed to support one another, it was essential to have a mechanism set up similar to what we've described, ensuring that this support doesn't get misplaced but rather anticipates danger.

When a frog is decapitated, and a drop of acid is placed on a point of its skin, it seeks to wipe off the acid with the nearest foot, and, if this foot be amputated, it sweeps it off with the foot of the opposite side. There we have the double parry of which I have just spoken, allowing the combating of an ill by a second remedy, if the first fails. And it is this multiplicity of parries, and the resulting coordination, which is space.

When a frog is decapitated and a drop of acid is applied to a spot on its skin, it tries to wipe off the acid with the nearest foot, and if that foot is removed, it uses the foot on the opposite side to get it off. This illustrates the double defense I just mentioned, where one remedy is used to combat a problem if the first one doesn’t work. It’s this variety of defenses and the resulting coordination that create space.

We see to what depths of the unconscious we must descend to find the first traces of these spatial associations, since only the inferior parts of the nervous system are involved. Why be[Pg 421] astonished then at the resistance we oppose to every attempt made to dissociate what so long has been associated? Now, it is just this resistance that we call the evidence for the geometric truths; this evidence is nothing but the repugnance we feel toward breaking with very old habits which have always proved good.

We need to explore the depths of our unconscious to find the earliest signs of these spatial connections because only the lower parts of our nervous system are involved. So why be[Pg 421] surprised by the resistance we show to every effort to break apart things that have been connected for so long? This resistance is actually what we refer to as evidence for geometric truths; this evidence is just the aversion we experience to changing longstanding habits that have always served us well.

III

The space so created is only a little space extending no farther than my arm can reach; the intervention of the memory is necessary to push back its limits. There are points which will remain out of my reach, whatever effort I make to stretch forth my hand; if I were fastened to the ground like a hydra polyp, for instance, which can only extend its tentacles, all these points would be outside of space, since the sensations we could experience from the action of bodies there situated, would be associated with the idea of no movement allowing us to reach them, of no appropriate parry. These sensations would not seem to us to have any spatial character and we should not seek to localize them.

The space created is just a small area that doesn’t extend beyond my arm’s reach; I need my memory to push its boundaries. There are certain points that will always be beyond my grasp, no matter how hard I try to reach out; for example, if I were stuck to the ground like a hydra polyp, which can only extend its tentacles, all those points would be outside of that space. The sensations we could experience from actions happening there wouldn’t feel connected to any movement that could allow us to reach them, and we wouldn’t have any physical response to them. These sensations wouldn’t seem to have any spatial quality, and we wouldn’t try to pinpoint their location.

But we are not fixed to the ground like the lower animals; we can, if the enemy be too far away, advance toward him first and extend the hand when we are sufficiently near. This is still a parry, but a parry at long range. On the other hand, it is a complex parry, and into the representation we make of it enter the representation of the muscular sensations caused by the movements of the legs, that of the muscular sensations caused by the final movement of the arm, that of the sensations of the semicircular canals, etc. We must, besides, represent to ourselves, not a complex of simultaneous sensations, but a complex of successive sensations, following each other in a determinate order, and this is why I have just said the intervention of memory was necessary. Notice moreover that, to reach the same point, I may approach nearer the mark to be attained, so as to have to stretch my arm less. What more? It is not one, it is a thousand parries I can oppose to the same danger. All these parries are made of sensations which may have nothing in common and yet we regard them as defining the same point of space, since they may respond to the same danger and are all associated with the notion of this danger. It is the potentiality of warding off the[Pg 422] same stroke which makes the unity of these different parries, as it is the possibility of being parried in the same way which makes the unity of the strokes so different in kind, which may menace us from the same point of space. It is this double unity which makes the individuality of each point of space, and, in the notion of point, there is nothing else.

But we're not stuck to the ground like lower animals; we can, if the enemy is too far away, move towards them first and reach out our hand when we're close enough. This is still a parry, but it's a long-range parry. On the other hand, it's a complex parry, and the representation we create involves the sensations of our muscles from moving our legs, the sensations from the final movement of our arm, the sensations of our inner ear, and so on. Additionally, we need to visualize not a mix of simultaneous sensations but a series of successive sensations that follow a specific order, which is why I mentioned that memory's involvement is necessary. Also, to reach the same target, I can get closer so I don't have to stretch my arm as much. What's more? I can have a thousand different parries ready for the same threat. All these parries consist of sensations that might not be related, yet we consider them as defining the same location since they respond to the same danger and are all connected to the concept of that danger. It's the ability to deflect the same attack that brings unity to these various parries, just as the possibility of being countered in the same way provides unity to the strokes that can threaten us from the same location. This dual unity creates the individuality of each point in space, and when we think of a point, that's all there is.

The space before considered, which might be called restricted space, was referred to coordinate axes bound to my body; these axes were fixed, since my body did not move and only my members were displaced. What are the axes to which we naturally refer the extended space? that is to say the new space just defined. We define a point by the sequence of movements to be made to reach it, starting from a certain initial position of the body. The axes are therefore fixed to this initial position of the body.

The area we just talked about, which we can call restricted space, was linked to coordinate axes attached to my body; these axes were stationary since my body didn’t move, and only my limbs shifted. What are the axes we naturally use to refer to the extended space? In other words, the new space we just defined. We define a point by the series of movements needed to get there, starting from a specific initial position of the body. So, the axes are fixed to that starting position of the body.

But the position I call initial may be arbitrarily chosen among all the positions my body has successively occupied; if the memory more or less unconscious of these successive positions is necessary for the genesis of the notion of space, this memory may go back more or less far into the past. Thence results in the definition itself of space a certain indetermination, and it is precisely this indetermination which constitutes its relativity.

But the position I refer to as initial can be chosen randomly from all the positions my body has occupied over time; if the largely unconscious memory of these positions is needed to form the concept of space, this memory can extend back into the past to varying degrees. This leads to a certain ambiguity in the very definition of space, and it's this ambiguity that creates its relativity.

There is no absolute space, there is only space relative to a certain initial position of the body. For a conscious being fixed to the ground like the lower animals, and consequently knowing only restricted space, space would still be relative (since it would have reference to his body), but this being would not be conscious of this relativity, because the axes of reference for this restricted space would be unchanging! Doubtless the rock to which this being would be fettered would not be motionless, since it would be carried along in the movement of our planet; for us consequently these axes would change at each instant; but for him they would be changeless. We have the faculty of referring our extended space now to the position A of our body, considered as initial, again to the position B, which it had some moments afterward, and which we are free to regard in its turn as initial; we make therefore at each instant unconscious transformations of coordinates. This faculty would be lacking in our imaginary[Pg 423] being, and from not having traveled, he would think space absolute. At every instant, his system of axes would be imposed upon him; this system would have to change greatly in reality, but for him it would be always the same, since it would be always the only system. Quite otherwise is it with us, who at each instant have many systems between which we may choose at will, on condition of going back by memory more or less far into the past.

There is no absolute space; there is only space in relation to a specific initial position of the body. For a conscious being fixed to the ground like lower animals, and therefore only aware of limited space, space would still be relative (since it would relate to their body), but this being would not be aware of this relativity because the reference points for this limited space would remain constant! Undoubtedly, the rock to which this being would be bound would not be motionless, as it would move with our planet; for us, these reference points would change constantly, but for that being, they would be unchanging. We have the ability to relate our extended space to the position A of our body, viewed as the starting point, and then to the position B, which it occupies moments later and which we can also consider as the new starting point; we therefore unconsciously transform coordinates at every moment. This ability would be absent in our imaginary[Pg 423] being, and since it has not traveled, it would perceive space as absolute. At every moment, its system of reference points would be imposed on it; this system would actually change significantly, but for it, it would always be the same, as it would always be the only system. In contrast, we have many systems at our disposal at each moment, from which we can choose at will, as long as we can remember back into the past to some extent.

This is not all; restricted space would not be homogeneous; the different points of this space could not be regarded as equivalent, since some could be reached only at the cost of the greatest efforts, while others could be easily attained. On the contrary, our extended space seems to us homogeneous, and we say all its points are equivalent. What does that mean?

This isn't everything; limited space wouldn't be uniform; the various points in that space couldn't be seen as equal, since some could only be reached with a lot of effort, while others could be easily accessed. In contrast, our expansive space appears uniform to us, and we say that all its points are equal. What does that mean?

If we start from a certain place A, we can, from this position, make certain movements, M, characterized by a certain complex of muscular sensations. But, starting from another position, B, we make movements characterized by the same muscular sensations. Let a, then, be the situation of a certain point of the body, the end of the index finger of the right hand for example, in the initial position A, and b the situation of this same index when, starting from this position A, we have made the motions M. Afterwards, let be the situation of this index in the position B, and its situation when, starting from the position B, we have made the motions .

If we start from a certain place A, we can make some movements, M, defined by a specific set of muscle sensations. But if we start from another position, B, we make movements defined by the same muscle sensations. Let a represent the position of a certain part of the body, like the tip of the index finger of the right hand, in the initial position A, and b be the position of that same index finger after making the movements M. Then, let be the position of this index finger in position B, and be its position after making the movements from position B.

Well, I am accustomed to say that the points of space a and b are related to each other just as the points and , and this simply means that the two series of movements M and are accompanied by the same muscular sensations. And as I am conscious that, in passing from the position A to the position B, my body has remained capable of the same movements, I know there is a point of space related to the point just as any point b is to the point a, so that the two points a and are equivalent. This is what is called the homogeneity of space. And, at the same time, this is why space is relative, since its properties remain the same whether it be referred to the axes A or to the axes B. So that the relativity of space and its homogeneity are one sole and same thing.[Pg 424]

Well, I usually say that points in space a and b are related to each other just like points and , which simply means that the two series of movements M and come with the same muscular sensations. And since I realize that when moving from position A to position B, my body is still capable of the same movements, I know there’s a point in space that relates to point just like any point b relates to point a, making the two points a and equivalent. This is what’s called the homogeneity of space. At the same time, this is why space is relative, as its properties stay the same whether referring to axes A or to axes B. Thus, the relativity of space and its homogeneity are one and the same.[Pg 424]

Now, if I wish to pass to the great space, which no longer serves only for me, but where I may lodge the universe, I get there by an act of imagination. I imagine how a giant would feel who could reach the planets in a few steps; or, if you choose, what I myself should feel in presence of a miniature world where these planets were replaced by little balls, while on one of these little balls moved a liliputian I should call myself. But this act of imagination would be impossible for me had I not previously constructed my restricted space and my extended space for my own use.

Now, if I want to enter the vast space that doesn’t just belong to me but can hold the whole universe, I get there through my imagination. I picture how a giant would feel being able to reach the planets in just a few steps; or, if you prefer, how I would feel in front of a tiny world where these planets were swapped for small balls, and on one of those balls, I would see myself as a little person. But this act of imagination wouldn’t be possible for me if I hadn’t first created my own limited space and my expanded space for myself.

IV

Why now have all these spaces three dimensions? Go back to the "table of distribution" of which we have spoken. We have on the one side the list of the different possible dangers; designate them by A1, A2, etc.; and, on the other side, the list of the different remedies which I shall call in the same way B1, B2, etc. We have then connections between the contact studs or push buttons of the first list and those of the second, so that when, for instance, the announcer of danger A3 functions, it will put or may put in action the relay corresponding to the parry B4.

Why do all these spaces have three dimensions now? Let's refer back to the "table of distribution" we discussed. On one side, we have a list of different potential dangers, labeled as A1, A2, etc.; and on the other side, a list of different remedies, which I will also label as B1, B2, etc. We then have connections between the contact points or push buttons of the first list and those of the second, so that when, for example, the danger signal A3 is triggered, it will activate or may activate the corresponding relay for the remedy B4.

As I have spoken above of centripetal or centrifugal wires, I fear lest one see in all this, not a simple comparison, but a description of the nervous system. Such is not my thought, and that for several reasons: first I should not permit myself to put forth an opinion on the structure of the nervous system which I do not know, while those who have studied it speak only circumspectly; again because, despite my incompetence, I well know this scheme would be too simplistic; and finally because on my list of parries, some would figure very complex, which might even, in the case of extended space, as we have seen above, consist of many steps followed by a movement of the arm. It is not a question then of physical connection between two real conductors but of psychologic association between two series of sensations.

As I mentioned earlier about centripetal or centrifugal wires, I'm worried that someone might interpret this as more than just a simple comparison but as a description of the nervous system. That’s not my intention, and for several reasons: first, I wouldn't claim to have an opinion on the structure of the nervous system which I don’t fully understand, while those who have studied it speak cautiously; second, even though I'm not an expert, I know that this concept would be too simplistic; and finally, in my list of actions, some would be quite complex, which could even involve multiple steps followed by an arm movement, especially in the context of extended space, as mentioned earlier. So, it’s not about a physical connection between two real conductors but rather a psychological association between two sets of sensations.

If A1 and A2 for instance are both associated with the parry B1, and if A1 is likewise associated with the parry B2, it will generally happen that A2 and B2 will also themselves be associated. If this fundamental law were not generally true, there[Pg 425] would exist only an immense confusion and there would be nothing resembling a conception of space or a geometry. How in fact have we defined a point of space. We have done it in two ways: it is on the one hand the aggregate of announcers A in connection with the same parry B; it is on the other hand the aggregate of parries B in connection with the same announcer A. If our law was not true, we should say A1 and A2 correspond to the same point since they are both in connection with B1; but we should likewise say they do not correspond to the same point, since A1 would be in connection with B2 and the same would not be true of A2. This would be a contradiction.

If A1 and A2 are both linked to the parry B1, and if A1 is also linked to the parry B2, then generally, A2 and B2 will be connected as well. If this basic principle weren't usually true, there[Pg 425] would be nothing but chaos, and we wouldn't have any real understanding of space or geometry. So, how do we define a point in space? We define it in two ways: firstly, it's the group of announcers A associated with the same parry B; secondly, it's the group of parries B linked to the same announcer A. If our principle were false, we would say that A1 and A2 correspond to the same point since they're both associated with B1; but we would also say they don't correspond to the same point, since A1 is linked to B2 and A2 is not. That would create a contradiction.

But, from another side, if the law were rigorously and always true, space would be very different from what it is. We should have categories strongly contrasted between which would be portioned out on the one hand the announcers A, on the other hand the parries B; these categories would be excessively numerous, but they would be entirely separated one from another. Space would be composed of points very numerous, but discrete; it would be discontinuous. There would be no reason for ranging these points in one order rather than another, nor consequently for attributing to space three dimensions.

But, on the other hand, if the law were strictly and always true, space would be very different from what it is. We would have categories sharply divided, with the announcers A on one side and the parries B on the other; these categories would be excessively numerous, but completely separate from each other. Space would be made up of many points, but discrete; it would be discontinuous. There would be no reason to arrange these points in one order over another, nor consequently to assign three dimensions to space.

But it is not so; permit me to resume for a moment the language of those who already know geometry; this is quite proper since this is the language best understood by those I wish to make understand me.

But that's not the case; let me briefly return to the vocabulary of those who are familiar with geometry; this is entirely appropriate since it's the language best understood by those I want to reach.

When I desire to parry the stroke, I seek to attain the point whence comes this blow, but it suffices that I approach quite near. Then the parry B1 may answer for A1 and for A2, if the point which corresponds to B1 is sufficiently near both to that corresponding to A1 and to that corresponding to A2. But it may happen that the point corresponding to another parry B2 may be sufficiently near to the point corresponding to A1 and not sufficiently near the point corresponding to A2; so that the parry B2 may answer for A1 without answering for A2. For one who does not yet know geometry, this translates itself simply by a derogation of the law stated above. And then things will happen thus:

When I want to block the hit, I try to get to the point where the blow is coming from, but it's enough to get pretty close. Then the block B1 can work for A1 and A2, as long as the point that corresponds to B1 is close enough to those that correspond to A1 and A2. However, it could happen that the point for another block B2 is close enough to the point for A1 but not close enough to the point for A2; in that case, the block B2 can work for A1 but not for A2. For someone who doesn't know geometry yet, this simply means a breaking of the law stated above. And then things will happen like this:

Two parries B1 and B2 will be associated with the same warning[Pg 426] A1 and with a large number of warnings which we shall range in the same category as A1 and which we shall make correspond to the same point of space. But we may find warnings A2 which will be associated with B2 without being associated with B1, and which in compensation will be associated with B3, which B3 was not associated with A1, and so forth, so that we may write the series

Two parries B1 and B2 will be linked to the same warning[Pg 426] A1, along with many other warnings that we’ll categorize with A1 and connect to the same point in space. However, we might come across warnings A2 that will be linked to B2 without being linked to B1, and which, in turn, will be associated with B3, a warning that B3 was not linked to A1, and so on, allowing us to write the series.

B1, A1, B2, A2, B3, A3, B4, A4,

B1, A1, B2, A2, B3, A3, B4, A4,

where each term is associated with the following and the preceding, but not with the terms several places away.

where each term is linked to the one before it and the one after it, but not to those that are several spots away.

Needless to add that each of the terms of these series is not isolated, but forms part of a very numerous category of other warnings or of other parries which have the same connections as it, and which may be regarded as belonging to the same point of space.

Needless to say, each term in these series is not isolated; it is part of a large category of other warnings or counteractions that are connected in the same way and can be considered as belonging to the same area.

The fundamental law, though admitting of exceptions, remains therefore almost always true. Only, in consequence of these exceptions, these categories, in place of being entirely separated, encroach partially one upon another and mutually penetrate in a certain measure, so that space becomes continuous.

The basic law, while allowing for some exceptions, is still almost always valid. However, because of these exceptions, these categories don’t completely separate; they partially overlap and intersect to some extent, making space continuous.

On the other hand, the order in which these categories are to be ranged is no longer arbitrary, and if we refer to the preceding series, we see it is necessary to put B2 between A1 and A2 and consequently between B1 and B3 and that we could not for instance put it between B3 and B4.

On the other hand, the order of these categories is no longer random, and if we look at the earlier series, we see it's essential to place B2 between A1 and A2 and, therefore, between B1 and B3. We also can't, for example, put it between B3 and B4.

There is therefore an order in which are naturally arranged our categories which correspond to the points of space, and experience teaches us that this order presents itself under the form of a table of triple entry, and this is why space has three dimensions.

There is, therefore, an order in which our categories are naturally arranged that corresponds to points in space, and experience shows us that this order appears as a table of three entries, which is why space has three dimensions.

V

So the characteristic property of space, that of having three dimensions, is only a property of our table of distribution, an internal property of the human intelligence, so to speak. It would suffice to destroy certain of these connections, that is to say of the associations of ideas to give a different table of distribution, and that might be enough for space to acquire a fourth dimension.[Pg 427]

So the defining feature of space, having three dimensions, is really just a property of how we perceive it, an internal aspect of human intelligence, so to speak. If we were to break some of these connections, meaning the associations of ideas, we could create a different perception, which might allow space to gain a fourth dimension.[Pg 427]

Some persons will be astonished at such a result. The external world, they will think, should count for something. If the number of dimensions comes from the way we are made, there might be thinking beings living in our world, but who might be made differently from us and who would believe space has more or less than three dimensions. Has not M. de Cyon said that the Japanese mice, having only two pair of semicircular canals, believe that space is two-dimensional? And then this thinking being, if he is capable of constructing a physics, would he not make a physics of two or of four dimensions, and which in a sense would still be the same as ours, since it would be the description of the same world in another language?

Some people will be surprised by such a result. They might think the external world should matter. If the number of dimensions comes from how we're built, there could be intelligent beings living in our world who are made differently and believe that space has more or less than three dimensions. Didn't M. de Cyon mention that Japanese mice, having only two pairs of semicircular canals, think space is two-dimensional? And then this intelligent being, if capable of creating a physics, wouldn’t they come up with a physics based on two or four dimensions, which would, in a way, still be similar to ours since it would describe the same world in a different language?

It seems in fact that it would be possible to translate our physics into the language of geometry of four dimensions; to attempt this translation would be to take great pains for little profit, and I shall confine myself to citing the mechanics of Hertz where we have something analogous. However, it seems that the translation would always be less simple than the text, and that it would always have the air of a translation, that the language of three dimensions seems the better fitted to the description of our world, although this description can be rigorously made in another idiom. Besides, our table of distribution was not made at random. There is connection between the warning A1 and the parry B1, this is an internal property of our intelligence; but why this connection? It is because the parry B1 affords means effectively to guard against the danger A1; and this is a fact exterior to us, this is a property of the exterior world. Our table of distribution is therefore only the translation of an aggregate of exterior facts; if it has three dimensions, this is because it has adapted itself to a world having certain properties; and the chief of these properties is that there exist natural solids whose displacements follow sensibly the laws we call laws of motion of rigid solids. If therefore the language of three dimensions is that which permits us most easily to describe our world, we should not be astonished; this language is copied from our table of distribution; and it is in order to be able to live in this world that this table has been established.

It seems that it would actually be possible to translate our physics into the language of four-dimensional geometry; trying to do this would require a lot of effort for little gain, so I will just refer to Hertz's mechanics where we have something similar. However, this translation would always be more complicated than the original text and would still feel like a translation. The three-dimensional language seems better suited to describe our world, even though we could rigorously express this in another language. Moreover, our distribution table wasn't created randomly. There’s a connection between the warning A1 and the defense B1, which is an inherent aspect of our intelligence; but why this connection? It’s because the defense B1 effectively protects against the threat A1; and that's a fact beyond us, a characteristic of the external world. Our distribution table, therefore, is just a reflection of a set of external facts; if it has three dimensions, it's because it has adapted to a world with certain properties; the main of these is that there are natural solids whose movements closely follow the principles we refer to as the laws of motion of rigid solids. So, if the three-dimensional language is the one that allows us to describe our world most easily, we shouldn’t be surprised; this language is derived from our distribution table; and it’s to be able to navigate this world that this table was created.

I have said we could conceive, living in our world, thinking[Pg 428] beings whose table of distribution would be four-dimensional and who consequently would think in hyperspace. It is not certain however that such beings, admitting they were born there, could live there and defend themselves against the thousand dangers by which they would there be assailed.

I’ve mentioned that we could imagine living in our world, thinking[Pg 428] about beings whose way of organizing things would be four-dimensional and who would consequently think in hyperspace. However, it’s not guaranteed that such beings, assuming they originated there, could survive and protect themselves from the countless dangers they would face there.

VI

A few remarks to end with. There is a striking contrast between the roughness of this primitive geometry, reducible to what I call a table of distribution, and the infinite precision of the geometers' geometry. And yet this is born of that; but not of that alone; it must be made fecund by the faculty we have of constructing mathematical concepts, such as that of group, for instance; it was needful to seek among the pure concepts that which best adapts itself to this rough space whose genesis I have sought to explain and which is common to us and the higher animals.

A few final thoughts. There’s a noticeable difference between the simplicity of this basic geometry, which I refer to as a distribution table, and the infinite precision of the geometry used by mathematicians. Yet, this emerges from that; but not solely from that; it needs to be enriched by our ability to create mathematical concepts, like the concept of a group, for example. We needed to look among the pure concepts to find the one that best fits this rough space whose origins I’ve tried to explain and which is shared by us and higher animals.

The evidence for certain geometric postulates, we have said, is only our repugnance to renouncing very old habits. But these postulates are infinitely precise, while these habits have something about them essentially pliant. When we wish to think, we need postulates infinitely precise, since this is the only way to avoid contradiction; but among all the possible systems of postulates, there are some we dislike to choose because they are not sufficiently in accord with our habits; however pliant, however elastic they may be, these have a limit of elasticity.

The proof for certain geometric principles, as we've mentioned, is just our reluctance to give up long-standing habits. However, these principles are incredibly precise, while our habits tend to be more flexible. When we want to think clearly, we need principles that are extremely precise, as that's the only way to avoid contradictions. Among all the possible systems of principles, there are some we hesitate to adopt because they don't align well with our habits; no matter how flexible or adaptable they may be, they still have their limits.

We see that if geometry is not an experimental science, it is a science born apropos of experience; that we have created the space it studies, but adapting it to the world wherein we live. We have selected the most convenient space, but experience has guided our choice; as this choice has been unconscious, we think it has been imposed upon us; some say experience imposes it, others that we are born with our space ready made; we see from the preceding considerations, what in these two opinions is the part of truth, what of error.

We can see that while geometry isn't an experimental science, it originates from experience; we've created the space it examines, yet we've tailored it to the world we inhabit. We've chosen the most suitable space, but our experiences have shaped that choice. Since this choice has been unconscious, we assume it's been forced upon us. Some believe that experience dictates it, while others argue that we are born with our space already defined. From the previous points, we can discern the truth and errors in these two views.

In this progressive education whose outcome has been the construction of space, it is very difficult to determine what is the[Pg 429] part of the individual, what the part of the race. How far could one of us, transported from birth to an entirely different world, where were dominant, for instance, bodies moving in conformity to the laws of motion of non-Euclidean solids, renounce the ancestral space to build a space completely new?

In this progressive education that focuses on creating space, it's hard to tell what part belongs to the individual and what part belongs to society. How far could one of us, taken from birth to a completely different world, where, for example, bodies moved according to the principles of non-Euclidean geometry, give up our inherited space to create an entirely new one?

The part of the race seems indeed preponderant; yet if to it we owe rough space, the soft space I have spoken of, the space of the higher animals, is it not to the unconscious experience of the individual we owe the infinitely precise space of the geometer? This is a question not easy to solve. Yet we cite a fact showing that the space our ancestors have bequeathed us still retains a certain plasticity. Some hunters learn to shoot fish under water, though the image of these fish be turned up by refraction. Besides they do it instinctively: they therefore have learned to modify their old instinct of direction; or, if you choose, to substitute for the association A1, B1, another association A1, B2, because experience showed them the first would not work.

The part of the race does seem really significant; however, if we owe rough space to it, the soft space I've mentioned—the space of higher animals—isn't it the unconscious experiences of individuals that give us the incredibly precise space of a geometer? This is a tough question to tackle. Yet, there's a fact that illustrates that the space passed down from our ancestors still has some flexibility. Some hunters can learn to shoot fish underwater, even though the image of the fish appears distorted due to refraction. Furthermore, they do this instinctively: they have learned to adjust their old sense of direction; or, if you prefer, to switch the association A1, B1 for a different association A1, B2, because experience has shown them that the first wouldn’t work.


CHAPTER II

Math Definitions and Teaching

1. I should speak here of general definitions in mathematics; at least that is the title, but it will be impossible to confine myself to the subject as strictly as the rule of unity of action would require; I shall not be able to treat it without touching upon a few other related questions, and if thus I am forced from time to time to walk on the bordering flower-beds on the right or left, I pray you bear with me.

1. I should talk about general definitions in mathematics; that's the title, but I won't be able to stick to the topic as strictly as the rule of unity of action would suggest. I can't discuss it without bringing up a few other related questions, and if I occasionally have to stray into the adjacent areas on either side, I ask you to be patient with me.

What is a good definition? For the philosopher or the scientist it is a definition which applies to all the objects defined, and only those; it is the one satisfying the rules of logic. But in teaching it is not that; a good definition is one understood by the scholars.

What’s a good definition? For a philosopher or a scientist, it’s a definition that applies to all the objects being defined and only those; it follows the rules of logic. But in teaching, it’s not just that; a good definition is one that the learners grasp.

How does it happen that so many refuse to understand mathematics? Is that not something of a paradox? Lo and behold! a science appealing only to the fundamental principles of logic, to the principle of contradiction, for instance, to that which is the skeleton, so to speak, of our intelligence, to that of which we can not divest ourselves without ceasing to think, and there are people who find it obscure! and they are even in the majority! That they are incapable of inventing may pass, but that they do not understand the demonstrations shown them, that they remain blind when we show them a light which seems to us flashing pure flame, this it is which is altogether prodigious.

How is it that so many people refuse to understand math? Isn’t that a bit of a paradox? Look! It’s a science that relies on basic logic principles, like the principle of contradiction, which forms the foundation of our intelligence—something we can’t shed without stopping our ability to think. Yet, there are people who find it confusing! And they are even in the majority! It might be acceptable that they can’t create anything, but the fact that they can’t grasp the explanations presented to them, that they stay oblivious when we show them a truth that seems to shine like pure fire—this is truly astonishing.

And yet there is no need of a wide experience with examinations to know that these blind men are in no wise exceptional beings. This is a problem not easy to solve, but which should engage the attention of all those wishing to devote themselves to teaching.

And yet, you don't need extensive experience with exams to know that these blind people are not exceptional in any way. This is a challenging issue to tackle, but it should capture the interest of anyone looking to get into teaching.

What is it, to understand? Has this word the same meaning for all the world? To understand the demonstration of a theorem, is that to examine successively each of the syllogisms composing it and to ascertain its correctness, its conformity to the rules of[Pg 431] the game? Likewise, to understand a definition, is this merely to recognize that one already knows the meaning of all the terms employed and to ascertain that it implies no contradiction?

What does it mean to understand? Does this word mean the same thing for everyone? When you understand the proof of a theorem, does that just mean going through each of the arguments step by step to check that they're correct and follow the rules of[Pg 431] the process? Similarly, when you understand a definition, is it just about realizing that you already know the meaning of all the words used and confirming that it doesn't contradict itself?

For some, yes; when they have done this, they will say: I understand.

For some, yes; after they’ve done this, they will say: I get it.

For the majority, no. Almost all are much more exacting; they wish to know not merely whether all the syllogisms of a demonstration are correct, but why they link together in this order rather than another. In so far as to them they seem engendered by caprice and not by an intelligence always conscious of the end to be attained, they do not believe they understand.

For most people, no. Almost everyone is much more demanding; they want to understand not just whether all the syllogisms in a demonstration are correct, but also why they connect in this order instead of another. As far as they can see, if the connections seem random rather than driven by an intelligence aware of the goal to be achieved, they feel like they don’t truly understand.

Doubtless they are not themselves just conscious of what they crave and they could not formulate their desire, but if they do not get satisfaction, they vaguely feel that something is lacking. Then what happens? In the beginning they still perceive the proofs one puts under their eyes; but as these are connected only by too slender a thread to those which precede and those which follow, they pass without leaving any trace in their head; they are soon forgotten; a moment bright, they quickly vanish in night eternal. When they are farther on, they will no longer see even this ephemeral light, since the theorems lean one upon another and those they would need are forgotten; thus it is they become incapable of understanding mathematics.

They probably aren't fully aware of what they want or able to express their desires, but when they don't get what they need, they have a vague sense that something is missing. So what happens next? At first, they can still see the evidence presented to them; however, since these pieces are only loosely connected to what came before and after, they slip away without leaving any mark on their minds. They are quickly forgotten; a bright moment fades rapidly into darkness. Later on, they won't even see this fleeting light, as the concepts rely on one another, and the ones they need to grasp are forgotten; this is how they become unable to understand mathematics.

This is not always the fault of their teacher; often their mind, which needs to perceive the guiding thread, is too lazy to seek and find it. But to come to their aid, we first must know just what hinders them.

This isn’t always the teacher’s fault; often, their mind, which needs to see the main idea, is too lazy to look for it and find it. But to help them, we first need to understand what’s holding them back.

Others will always ask of what use is it; they will not have understood if they do not find about them, in practise or in nature, the justification of such and such a mathematical concept. Under each word they wish to put a sensible image; the definition must evoke this image, so that at each stage of the demonstration they may see it transform and evolve. Only upon this condition do they comprehend and retain. Often these deceive themselves; they do not listen to the reasoning, they look at the figures; they think they have understood and they have only seen.

Others will always ask what it's useful for; they won't understand unless they find real-life examples or nature that justify a particular mathematical concept. They want to attach a clear image to every word; the definition needs to trigger that image so they can see it change and develop at each step of the demonstration. Only under these conditions do they truly understand and remember. Often, they mislead themselves; they don't pay attention to the reasoning, only the figures; they think they’ve understood when they’ve really just glanced at it.

2. How many different tendencies! Must we combat them? Must we use them? And if we wish to combat them, which should[Pg 432] be favored? Must we show those content with the pure logic that they have seen only one side of the matter? Or need we say to those not so cheaply satisfied that what they demand is not necessary?

2. How many different tendencies there are! Should we fight against them? Should we make use of them? And if we want to fight them, which ones should[Pg 432] we support? Should we point out to those who are satisfied with pure logic that they have only considered one side of the issue? Or should we tell those who aren’t so easily satisfied that what they want isn’t necessary?

In other words, should we constrain the young people to change the nature of their minds? Such an attempt would be vain; we do not possess the philosopher's stone which would enable us to transmute one into another the metals confided to us; all we can do is to work with them, adapting ourselves to their properties.

In other words, should we force young people to change how they think? Trying to do that would be pointless; we don’t have some magical way to transform what we’ve been given; all we can do is work with them, adapting to their qualities.

Many children are incapable of becoming mathematicians, to whom however it is necessary to teach mathematics; and the mathematicians themselves are not all cast in the same mold. To read their works suffices to distinguish among them two sorts of minds, the logicians like Weierstrass for example, the intuitives like Riemann. There is the same difference among our students. The one sort prefer to treat their problems 'by analysis' as they say, the others 'by geometry.'

Many children aren't able to become mathematicians, yet it's essential to teach them math; and even mathematicians come in different types. Reading their work shows that there are two main types of thinkers: the logicians, like Weierstrass, and the intuitives, like Riemann. The same variety exists among our students. Some prefer to tackle problems 'using analysis,' while others prefer 'using geometry.'

It is useless to seek to change anything of that, and besides would it be desirable? It is well that there are logicians and that there are intuitives; who would dare say whether he preferred that Weierstrass had never written or that there never had been a Riemann? We must therefore resign ourselves to the diversity of minds, or better we must rejoice in it.

It’s pointless to try to change any of that, and besides, would we really want to? It’s good that we have logicians and intuitives; who would dare say whether they’d prefer that Weierstrass had never written anything or that Riemann had never existed? So, we should accept the diversity of minds, or better yet, we should celebrate it.

3. Since the word understand has many meanings, the definitions which will be best understood by some will not be best suited to others. We have those which seek to produce an image, and those where we confine ourselves to combining empty forms, perfectly intelligible, but purely intelligible, which abstraction has deprived of all matter.

3. Since the word "understand" has many meanings, the definitions that make sense to some people won’t be the best fit for others. We have those that aim to create a vivid image and those where we limit ourselves to putting together empty forms—clear but abstract—which have been stripped of any substance by abstraction.

I know not whether it be necessary to cite examples. Let us cite them, anyhow, and first the definition of fractions will furnish us an extreme case. In the primary schools, to define a fraction, one cuts up an apple or a pie; it is cut up mentally of course and not in reality, because I do not suppose the budget of the primary instruction allows of such prodigality. At the Normal School, on the other hand, or at the college, it is said: a fraction is the combination of two whole numbers separated by[Pg 433] a horizontal bar; we define by conventions the operations to which these symbols may be submitted; it is proved that the rules of these operations are the same as in calculating with whole numbers, and we ascertain finally that multiplying the fraction, according to these rules, by the denominator gives the numerator. This is all very well because we are addressing young people long familiarized with the notion of fractions through having cut up apples or other objects, and whose mind, matured by a hard mathematical education, has come little by little to desire a purely logical definition. But the débutant to whom one should try to give it, how dumfounded!

I don't know if it's necessary to give examples. But let's give them anyway, and first, the definition of fractions will provide us with an extreme case. In elementary schools, to define a fraction, they might cut up an apple or a pie; it's done mentally, of course, not in reality, because I don't think the school budget allows for such extravagance. On the other hand, at the Normal School or in college, it's stated: a fraction is the combination of two whole numbers separated by a horizontal bar; we define the operations these symbols can undergo by convention; it's proven that the rules for these operations are the same as when calculating with whole numbers, and we ultimately find that multiplying the fraction by the denominator gives the numerator, according to these rules. This is all well and good because we're speaking to young people who are already familiar with the idea of fractions from cutting up apples and other objects, and whose minds, shaped by rigorous math education, have gradually come to seek a purely logical definition. But for a beginner who is just being introduced to it, what confusion!

Such also are the definitions found in a book justly admired and greatly honored, the Foundations of Geometry by Hilbert. See in fact how he begins: We think three systems of things which we shall call points, straights and planes. What are these 'things'?

Such are the definitions found in a book that is rightfully admired and greatly respected, the Foundations of Geometry by Hilbert. Look at how he starts: We think three systems of stuff which we shall call points, lines, and planes. What are these 'things'?

We know not, nor need we know; it would even be a pity to seek to know; all we have the right to know of them is what the assumptions tell us; this for example: Two distinct points always determine a straight, which is followed by this remark: in place of determine, we may say the two points are on the straight, or the straight goes through these two points or joins the two points.

We don’t know, and we don’t need to know; it would actually be a shame to try to find out. All we have the right to understand from them is what the assumptions tell us; for example: Two distinct points always determine a straight, followed by this note: instead of determine, we can say the two points are on the straight, or the straight goes through these two points, or connects the two points.

Thus 'to be on a straight' is simply defined as synonymous with 'determine a straight.' Behold a book of which I think much good, but which I should not recommend to a school boy. Yet I could do so without fear, he would not read much of it. I have taken extreme examples and no teacher would dream of going that far. But even stopping short of such models, does he not already expose himself to the same danger?

Thus, 'to be on a straight' is simply defined as the same as 'determine a straight.' Look at this book that I think is quite good, but I wouldn't recommend it to a schoolboy. Still, I could do so without worry; he wouldn't read much of it. I've used extreme examples, and no teacher would ever consider going that far. But even without such examples, doesn't he already put himself at the same risk?

Suppose we are in a class; the professor dictates: the circle is the locus of points of the plane equidistant from an interior point called the center. The good scholar writes this phrase in his note-book; the bad scholar draws faces; but neither understands; then the professor takes the chalk and draws a circle on the board. "Ah!" think the scholars, "why did he not say at once: a circle is a ring, we should have understood." Doubtless the professor is right. The scholars' definition would have been of no avail, since it could serve for no demonstration, since besides it would[Pg 434] not give them the salutary habit of analyzing their conceptions. But one should show them that they do not comprehend what they think they know, lead them to be conscious of the roughness of their primitive conception, and of themselves to wish it purified and made precise.

Imagine we're in a class; the professor says: a circle is the set of points in a plane that are all the same distance from a point called the center. The diligent student writes this down in their notebook; the careless student doodles; but neither really understands; then the professor grabs the chalk and draws a circle on the board. "Oh!" think the students, "why didn’t he just say: a circle is a ring? We would have understood." The professor is definitely correct. The students’ definition wouldn’t be helpful, as it wouldn’t support any explanation, and it also wouldn’t help them develop the important habit of examining their ideas. But it’s essential to show them that they don’t truly grasp what they think they know, to make them aware of the imperfections in their initial understanding, and to encourage them to seek a clearer and more refined definition.

4. I shall return to these examples; I only wished to show you the two opposed conceptions; they are in violent contrast. This contrast the history of science explains. If we read a book written fifty years ago, most of the reasoning we find there seems lacking in rigor. Then it was assumed a continuous function can change sign only by vanishing; to-day we prove it. It was assumed the ordinary rules of calculation are applicable to incommensurable numbers; to-day we prove it. Many other things were assumed which sometimes were false.

4. I will come back to these examples; I just wanted to show you the two conflicting ideas; they are in stark contrast. The history of science explains this contrast. If we read a book written fifty years ago, most of the reasoning it contains seems lacking in rigor. Back then, it was assumed that a continuous function could only change sign by becoming zero; today we can prove that. It was assumed that the usual rules of calculation apply to incommensurable numbers; today we can prove that too. Many other assumptions were made that sometimes turned out to be wrong.

We trusted to intuition; but intuition can not give rigor, nor even certainty; we see this more and more. It tells us for instance that every curve has a tangent, that is to say that every continuous function has a derivative, and that is false. And as we sought certainty, we had to make less and less the part of intuition.

We relied on intuition, but intuition can't provide precision or even certainty; we see this more and more. It suggests, for example, that every curve has a tangent, meaning that every continuous function has a derivative, which is not true. As we pursued certainty, we had to rely less and less on intuition.

What has made necessary this evolution? We have not been slow to perceive that rigor could not be established in the reasonings, if it were not first put into the definitions.

What has made this evolution necessary? We quickly realized that we couldn't establish rigor in our reasoning without first laying it out in the definitions.

The objects occupying mathematicians were long ill defined; we thought we knew them because we represented them with the senses or the imagination; but we had of them only a rough image and not a precise concept upon which reasoning could take hold. It is there that the logicians would have done well to direct their efforts.

The things that mathematicians focused on were unclear for a long time; we thought we understood them because we represented them through our senses or imagination; but we only had a vague picture and not a clear concept that could support reasoning. This is where logicians should have channeled their efforts.

So for the incommensurable number, the vague idea of continuity, which we owe to intuition, has resolved itself into a complicated system of inequalities bearing on whole numbers. Thus have finally vanished all those difficulties which frightened our fathers when they reflected upon the foundations of the infinitesimal calculus. To-day only whole numbers are left in analysis, or systems finite or infinite of whole numbers, bound by a plexus of equalities and inequalities. Mathematics we say is arithmetized.[Pg 435]

So, regarding the incommensurable numbers, the vague idea of continuity that we got from intuition has turned into a complex system of inequalities related to whole numbers. This has finally eliminated all the challenges that worried our predecessors when they thought about the basics of infinitesimal calculus. Today, only whole numbers remain in analysis, or finite or infinite systems of whole numbers, connected by a web of equalities and inequalities. We say that mathematics has been arithmetized.[Pg 435]

5. But do you think mathematics has attained absolute rigor without making any sacrifice? Not at all; what it has gained in rigor it has lost in objectivity. It is by separating itself from reality that it has acquired this perfect purity. We may freely run over its whole domain, formerly bristling with obstacles, but these obstacles have not disappeared. They have only been moved to the frontier, and it would be necessary to vanquish them anew if we wished to break over this frontier to enter the realm of the practical.

5. But do you think mathematics has reached complete precision without giving anything up? Not at all; what it has gained in precision, it has lost in objectivity. By distancing itself from reality, it has achieved this perfect clarity. We can easily navigate its entire landscape, which used to be full of challenges, but those challenges haven't vanished. They've just been pushed to the edge, and we would need to overcome them again if we wanted to cross that border into the practical world.

We had a vague notion, formed of incongruous elements, some a priori, others coming from experiences more or less digested; we thought we knew, by intuition, its principal properties. To-day we reject the empiric elements, retaining only the a priori; one of the properties serves as definition and all the others are deduced from it by rigorous reasoning. This is all very well, but it remains to be proved that this property, which has become a definition, pertains to the real objects which experience had made known to us and whence we drew our vague intuitive notion. To prove that, it would be necessary to appeal to experience, or to make an effort of intuition, and if we could not prove it, our theorems would be perfectly rigorous, but perfectly useless.

We had a vague idea, made up of mismatched parts, some a priori, and others from experiences that were somewhat processed; we thought we knew its main properties intuitively. Today, we dismiss the empirical parts and keep only the a priori; one of the properties acts as a definition, and all the others are derived from it through strict reasoning. This is all well and good, but we still need to prove that this property, which has become the definition, applies to the real objects that experience had revealed to us and from which we developed our vague intuitive idea. To prove that, we would need to rely on experience or make an intuitive effort, and if we can't prove it, our theorems would be completely rigorous, but totally useless.

Logic sometimes makes monsters. Since half a century we have seen arise a crowd of bizarre functions which seem to try to resemble as little as possible the honest functions which serve some purpose. No longer continuity, or perhaps continuity, but no derivatives, etc. Nay more, from the logical point of view, it is these strange functions which are the most general, those one meets without seeking no longer appear except as particular case. There remains for them only a very small corner.

Logic sometimes creates monsters. For the past fifty years, we've seen a bunch of weird functions emerge that seem to try to look as different as possible from the straightforward functions that actually serve a purpose. There's no continuity, or maybe there is continuity, but no derivatives, etc. Furthermore, from a logical perspective, it's these strange functions that are the most general; those we encounter without searching for them now seem to only appear as special cases. They are left with only a very small space.

Heretofore when a new function was invented, it was for some practical end; to-day they are invented expressly to put at fault the reasonings of our fathers, and one never will get from them anything more than that.

Up until now, when a new function was created, it served some practical purpose; today, they are designed specifically to challenge the reasoning of our predecessors, and you won't get anything more from them than that.

If logic were the sole guide of the teacher, it would be necessary to begin with the most general functions, that is to say with the most bizarre. It is the beginner that would have to be set[Pg 436] grappling with this teratologic museum. If you do not do it, the logicians might say, you will achieve rigor only by stages.

If logic were the only guide for the teacher, we would have to start with the broadest concepts, which means the most unusual ones. It would be the novice who would need to wrestle with this strange collection. If you don’t do it, the logicians might argue, you will only gain precision gradually.

6. Yes, perhaps, but we can not make so cheap of reality, and I mean not only the reality of the sensible world, which however has its worth, since it is to combat against it that nine tenths of your students ask of you weapons. There is a reality more subtile, which makes the very life of the mathematical beings, and which is quite other than logic.

6. Yes, maybe, but we can't underestimate reality, and I’m not just talking about the reality of the physical world, which does have its value, since that's what most of your students come to you for—to gain tools to fight against it. There’s a deeper reality that underpins the very essence of mathematical concepts, and it’s quite different from logic.

Our body is formed of cells, and the cells of atoms; are these cells and these atoms then all the reality of the human body? The way these cells are arranged, whence results the unity of the individual, is it not also a reality and much more interesting?

Our body is made up of cells, and those cells are made of atoms; are these cells and atoms all there is to the reality of the human body? The way these cells are organized, which creates the unity of an individual, isn't that also a reality and much more fascinating?

A naturalist who never had studied the elephant except in the microscope, would he think he knew the animal adequately? It is the same in mathematics. When the logician shall have broken up each demonstration into a multitude of elementary operations, all correct, he still will not possess the whole reality; this I know not what which makes the unity of the demonstration will completely escape him.

A naturalist who has only studied the elephant under a microscope, would he really think he knows the animal well enough? It's the same in mathematics. When the logician breaks each proof down into a bunch of elementary operations, all correct, he still won’t grasp the entire reality; that indescribable quality that brings the proof together will completely elude him.

In the edifices built up by our masters, of what use to admire the work of the mason if we can not comprehend the plan of the architect? Now pure logic can not give us this appreciation of the total effect; this we must ask of intuition.

In the buildings created by our masters, what's the point of admiring the mason's work if we can't understand the architect's design? Pure logic can't give us this understanding of the overall impact; we need to rely on intuition for that.

Take for instance the idea of continuous function. This is at first only a sensible image, a mark traced by the chalk on the blackboard. Little by little it is refined; we use it to construct a complicated system of inequalities, which reproduces all the features of the primitive image; when all is done, we have removed the centering, as after the construction of an arch; this rough representation, support thenceforth useless, has disappeared and there remains only the edifice itself, irreproachable in the eyes of the logician. And yet, if the professor did not recall the primitive image, if he did not restore momentarily the centering, how could the student divine by what caprice all these inequalities have been scaffolded in this fashion one upon another? The definition would be logically correct, but it would not show him the veritable reality.

Take, for example, the concept of a continuous function. Initially, it's just a clear image, like a line drawn with chalk on a blackboard. Gradually, it gets refined; we use it to build a complex system of inequalities that captures all the aspects of the original image. Once we finish, we’ve removed the centering, similar to what happens after constructing an arch; this rough representation, no longer necessary, vanishes, leaving only the structure itself, flawless in the eyes of the logician. Yet, if the professor doesn’t bring back the original image, if he doesn't momentarily restore the centering, how would the student understand how all these inequalities were stacked together in this way? The definition might be logically accurate, but it wouldn’t reveal the true reality to him.

7. So back we must return; doubtless it is hard for a master[Pg 437] to teach what does not entirely satisfy him; but the satisfaction of the master is not the unique object of teaching; we should first give attention to what the mind of the pupil is and to what we wish it to become.

7. So we have to go back; it's definitely tough for a teacher[Pg 437] to teach something that doesn’t fully satisfy them. But the teacher’s satisfaction isn’t the only goal of teaching; we should first focus on where the student is at and where we want them to be.

Zoologists maintain that the embryonic development of an animal recapitulates in brief the whole history of its ancestors throughout geologic time. It seems it is the same in the development of minds. The teacher should make the child go over the path his fathers trod; more rapidly, but without skipping stations. For this reason, the history of science should be our first guide.

Zoologists argue that an animal's embryonic development briefly mirrors the complete history of its ancestors over geological time. It appears the same applies to the development of minds. Teachers should guide children through the experiences of their forebears; faster, but without skipping important stages. For this reason, the history of science should be our primary guide.

Our fathers thought they knew what a fraction was, or continuity, or the area of a curved surface; we have found they did not know it. Just so our scholars think they know it when they begin the serious study of mathematics. If without warning I tell them: "No, you do not know it; what you think you understand, you do not understand; I must prove to you what seems to you evident," and if in the demonstration I support myself upon premises which to them seem less evident than the conclusion, what shall the unfortunates think? They will think that the science of mathematics is only an arbitrary mass of useless subtilities; either they will be disgusted with it, or they will play it as a game and will reach a state of mind like that of the Greek sophists.

Our fathers thought they understood fractions, continuity, or the area of a curved surface; we've discovered they didn't. Similarly, our scholars believe they know these concepts when they start studying mathematics seriously. If I were to suddenly tell them, "No, you don’t truly understand; what you think you comprehend isn’t real understanding; I need to show you what seems obvious to you," and in my explanation, I base my argument on premises that seem less obvious to them than the conclusion, what will those poor souls think? They might believe that the field of mathematics is just a confusing set of pointless complexities; either they'll feel turned off by it, or they'll treat it like a game and end up with a mindset similar to that of the Greek sophists.

Later, on the contrary, when the mind of the scholar, familiarized with mathematical reasoning, has been matured by this long frequentation, the doubts will arise of themselves and then your demonstration will be welcome. It will awaken new doubts, and the questions will arise successively to the child, as they arose successively to our fathers, until perfect rigor alone can satisfy him. To doubt everything does not suffice, one must know why he doubts.

Later, on the other hand, when the scholar's mind, accustomed to mathematical reasoning, has developed through this long engagement, doubts will naturally come up, and then your proof will be appreciated. It will spark new doubts, and questions will come up one after another for the child, just as they did for our ancestors, until only perfect rigor can satisfy him. Simply doubting everything isn’t enough; one must understand the reason behind the doubt.

8. The principal aim of mathematical teaching is to develop certain faculties of the mind, and among them intuition is not the least precious. It is through it that the mathematical world remains in contact with the real world, and if pure mathematics could do without it, it would always be necessary to have recourse to it to fill up the chasm which separates the symbol from reality.[Pg 438] The practician will always have need of it, and for one pure geometer there should be a hundred practicians.

8. The main goal of teaching math is to develop certain mental skills, and intuition is definitely one of the most valuable. It’s through intuition that the world of mathematics stays connected to the real world, and even though pure math might not rely on it, we will always need it to bridge the gap between symbols and reality.[Pg 438] Practitioners will always need it, and for every pure geometer, there should be a hundred practitioners.

The engineer should receive a complete mathematical education, but for what should it serve him?

The engineer should get a thorough education in math, but what is it for?

To see the different aspects of things and see them quickly; he has no time to hunt mice. It is necessary that, in the complex physical objects presented to him, he should promptly recognize the point where the mathematical tools we have put in his hands can take hold. How could he do it if we should leave between instruments and objects the deep chasm hollowed out by the logicians?

To recognize the various aspects of things quickly; he has no time to waste. It's essential that, among the complex physical objects in front of him, he can swiftly identify where the mathematical tools we've given him can be applied. How could he do this if we leave a significant gap between the instruments and the objects, created by the logicians?

9. Besides the engineers, other scholars, less numerous, are in their turn to become teachers; they therefore must go to the very bottom; a knowledge deep and rigorous of the first principles is for them before all indispensable. But this is no reason not to cultivate in them intuition; for they would get a false idea of the science if they never looked at it except from a single side, and besides they could not develop in their students a quality they did not themselves possess.

9. In addition to the engineers, there are other scholars, though fewer in number, who are also set to become teachers; they need to start from the ground up. A deep and thorough understanding of the fundamental principles is essential for them. However, this doesn’t mean they shouldn’t also develop their intuition; if they only view the science from one perspective, they’ll end up with a distorted understanding, and they won’t be able to instill a quality in their students that they don’t have themselves.

For the pure geometer himself, this faculty is necessary; it is by logic one demonstrates, by intuition one invents. To know how to criticize is good, to know how to create is better. You know how to recognize if a combination is correct; what a predicament if you have not the art of choosing among all the possible combinations. Logic tells us that on such and such a way we are sure not to meet any obstacle; it does not say which way leads to the end. For that it is necessary to see the end from afar, and the faculty which teaches us to see is intuition. Without it the geometer would be like a writer who should be versed in grammar but had no ideas. Now how could this faculty develop if, as soon as it showed itself, we chase it away and proscribe it, if we learn to set it at naught before knowing the good of it.

For the pure geometer, this ability is essential; logic helps us prove, while intuition helps us create. It’s good to know how to critique, but it's even better to know how to invent. You can tell if a combination is correct; but what a dilemma if you lack the skill to choose from all the possible combinations. Logic tells us that on a particular path, we won’t hit any obstacles; it doesn’t indicate which path leads to the end. For that, we need to see the end from a distance, and the ability that helps us see is intuition. Without it, a geometer would be like a writer who knows grammar but has no ideas. How can this ability grow if we dismiss it and push it away every time it appears, if we learn to ignore it before understanding its value?

And here permit a parenthesis to insist upon the importance of written exercises. Written compositions are perhaps not sufficiently emphasized in certain examinations, at the polytechnic school, for instance. I am told they would close the door[Pg 439] against very good scholars who have mastered the course, thoroughly understanding it, and who nevertheless are incapable of making the slightest application. I have just said the word understand has several meanings: such students only understand in the first way, and we have seen that suffices neither to make an engineer nor a geometer. Well, since choice must be made, I prefer those who understand completely.

And here, let me take a moment to stress the importance of writing exercises. Written assignments might not be emphasized enough in certain assessments, like at the polytechnic school, for example. I've heard that they might turn away very good students who have mastered the material, fully grasping it, yet are unable to apply it at all. I just mentioned that "understand" has several meanings: these students only understand in a basic sense, and we know that’s not enough to make a good engineer or geometer. So, since we have to choose, I prefer those who truly understand it all.

10. But is the art of sound reasoning not also a precious thing, which the professor of mathematics ought before all to cultivate? I take good care not to forget that. It should occupy our attention and from the very beginning. I should be distressed to see geometry degenerate into I know not what tachymetry of low grade and I by no means subscribe to the extreme doctrines of certain German Oberlehrer. But there are occasions enough to exercise the scholars in correct reasoning in the parts of mathematics where the inconveniences I have pointed out do not present themselves. There are long chains of theorems where absolute logic has reigned from the very first and, so to speak, quite naturally, where the first geometers have given us models we should constantly imitate and admire.

10. But isn't the skill of sound reasoning also something valuable that mathematics professors should prioritize? I make sure not to overlook that. It should be our focus right from the start. It would upset me to see geometry turn into some sort of low-quality tachymetry. I don’t agree with the extreme views of certain German teachers. However, there are plenty of opportunities to train students in correct reasoning within the areas of mathematics where the issues I've mentioned don’t arise. There are extensive chains of theorems where pure logic has been present from the beginning, and where the earliest geometers have provided us with models we should continually strive to imitate and appreciate.

It is in the exposition of first principles that it is necessary to avoid too much subtility; there it would be most discouraging and moreover useless. We can not prove everything and we can not define everything; and it will always be necessary to borrow from intuition; what does it matter whether it be done a little sooner or a little later, provided that in using correctly premises it has furnished us, we learn to reason soundly.

It’s important to keep things straightforward when explaining basic concepts; being overly complicated can be discouraging and pointless. We can’t prove or define everything, and we always have to rely on our intuition. It doesn’t really matter if we do this a bit earlier or later, as long as we use the premises correctly and learn to reason properly.

11. Is it possible to fulfill so many opposing conditions? Is this possible in particular when it is a question of giving a definition? How find a concise statement satisfying at once the uncompromising rules of logic, our desire to grasp the place of the new notion in the totality of the science, our need of thinking with images? Usually it will not be found, and this is why it is not enough to state a definition; it must be prepared for and justified.

11. Is it really possible to meet so many conflicting conditions? Is this feasible, especially when it comes to providing a definition? How can we come up with a clear statement that satisfies the strict rules of logic, our need to understand where the new concept fits within the entire field of science, and our desire to think visually? Typically, we won’t find such a statement, and that’s why it’s not enough to just give a definition; it needs to be properly prepared and justified.

What does that mean? You know it has often been said: every definition implies an assumption, since it affirms the existence of the object defined. The definition then will not be[Pg 440] justified, from the purely logical point of view, until one shall have proved that it involves no contradiction, neither in the terms, nor with the verities previously admitted.

What does that mean? It’s often said that every definition carries an assumption because it confirms the existence of the object being defined. Therefore, the definition won’t be[Pg 440] justified, from a purely logical standpoint, until it can be proven that it doesn’t involve any contradictions, neither in the terms nor with the truths already accepted.

But this is not enough; the definition is stated to us as a convention; but most minds will revolt if we wish to impose it upon them as an arbitrary convention. They will be satisfied only when you have answered numerous questions.

But this isn't enough; the definition is presented to us as a convention; however, most people will resist if we try to force it on them as an arbitrary convention. They will only be satisfied after you’ve answered a lot of questions.

Usually mathematical definitions, as M. Liard has shown, are veritable constructions built up wholly of more simple notions. But why assemble these elements in this way when a thousand other combinations were possible?

Usually, mathematical definitions, as M. Liard has shown, are genuine constructions made entirely of simpler ideas. But why put these elements together this way when countless other combinations could have been chosen?

Is it by caprice? If not, why had this combination more right to exist than all the others? To what need does it respond? How was it foreseen that it would play an important rôle in the development of the science, that it would abridge our reasonings and our calculations? Is there in nature some familiar object which is so to speak the rough and vague image of it?

Is it just random? If not, why does this combination have more legitimacy than all the others? What need does it fulfill? How was it predicted that it would play an important role in the development of the science, that it would simplify our reasoning and calculations? Is there something in nature that, in a way, serves as a rough and vague representation of it?

This is not all; if you answer all these questions in a satisfactory manner, we shall see indeed that the new-born had the right to be baptized; but neither is the choice of a name arbitrary; it is needful to explain by what analogies one has been guided and that if analogous names have been given to different things, these things at least differ only in material and are allied in form; that their properties are analogous and so to say parallel.

This isn't everything; if you answer all these questions satisfactorily, we will clearly see that the newborn has the right to be baptized. However, choosing a name is not random; it’s important to explain the reasoning behind your choice and that if similar names have been assigned to different things, those things at least differ only in material and are connected in form; their characteristics are similar and, you could say, parallel.

At this cost we may satisfy all inclinations. If the statement is correct enough to please the logician, the justification will satisfy the intuitive. But there is still a better procedure; wherever possible, the justification should precede the statement and prepare for it; one should be led on to the general statement by the study of some particular examples.

At this cost, we can meet all interests. If the claim is logical enough to satisfy the logician, the reasoning will satisfy those with intuition. But there's a better approach; whenever possible, the reasoning should come before the claim and set the stage for it. One should arrive at the general claim by looking at specific examples first.

Still another thing: each of the parts of the statement of a definition has as aim to distinguish the thing to be defined from a class of other neighboring objects. The definition will be understood only when you have shown, not merely the object defined, but the neighboring objects from which it is proper to distinguish it, when you have given a grasp of the difference and when you have added explicitly: this is why in stating the definition I have said this or that.[Pg 441]

Another thing to note: each part of a definition aims to set the thing being defined apart from similar objects. A definition is only clear when you show not just the object itself, but also the nearby objects that it should be distinguished from. You need to highlight the differences, and explicitly explain why you chose to frame the definition the way you did.[Pg 441]

But it is time to leave generalities and examine how the somewhat abstract principles I have expounded may be applied in arithmetic, geometry, analysis and mechanics.

But it's time to move away from general ideas and look at how the somewhat abstract principles I've discussed can be applied in arithmetic, geometry, analysis, and mechanics.

Math

12. The whole number is not to be defined; in return, one ordinarily defines the operations upon whole numbers; I believe the scholars learn these definitions by heart and attach no meaning to them. For that there are two reasons: first they are made to learn them too soon, when their mind as yet feels no need of them; then these definitions are not satisfactory from the logical point of view. A good definition for addition is not to be found just simply because we must stop and can not define everything. It is not defining addition to say it consists in adding. All that can be done is to start from a certain number of concrete examples and say: the operation we have performed is called addition.

12. The whole number isn't meant to be defined; instead, we typically define the operations on whole numbers. I think students memorize these definitions without understanding them. There are two reasons for this: first, they're made to learn them too early, when their minds don't yet see the need for them; second, these definitions often don't hold up logically. A good definition for addition just can't be found, simply because we have to draw the line somewhere and can't define everything. Saying that addition consists of adding doesn't really define it. All we can do is start from a few concrete examples and say, "the operation we've done is called addition."

For subtraction it is quite otherwise; it may be logically defined as the operation inverse to addition; but should we begin in that way? Here also start with examples, show on these examples the reciprocity of the two operations; thus the definition will be prepared for and justified.

For subtraction, it’s different; it can be logically defined as the operation opposite to addition. But should we start that way? Instead, we should begin with examples, demonstrating the relationship between the two operations. This way, the definition will be set up and validated.

Just so again for multiplication; take a particular problem; show that it may be solved by adding several equal numbers; then show that we reach the result more quickly by a multiplication, an operation the scholars already know how to do by routine and out of that the logical definition will issue naturally.

Just to clarify for multiplication: take a specific problem; demonstrate that it can be solved by adding several equal numbers; then show that we get to the answer more quickly with multiplication, an operation the students already know how to do by habit, and from that, the logical definition will naturally emerge.

Division is defined as the operation inverse to multiplication; but begin by an example taken from the familiar notion of partition and show on this example that multiplication reproduces the dividend.

Division is defined as the operation that reverses multiplication; let’s start with an example from the common idea of partitioning and demonstrate with this example that multiplication reproduces the dividend.

There still remain the operations on fractions. The only difficulty is for multiplication. It is best to expound first the theory of proportion; from it alone can come a logical definition; but to make acceptable the definitions met at the beginning of this theory, it is necessary to prepare for them by numerous examples taken from classic problems of the rule of three, taking pains to introduce fractional data.

There are still operations involving fractions to cover. The main challenge is multiplication. It’s best to first explain the theory of proportion; a logical definition can only come from that. However, to make the definitions found at the start of this theory easier to understand, it’s important to set the stage with various examples taken from classic problems related to the rule of three, making sure to include fractional values.

Neither should we fear to familiarize the scholars with the[Pg 442] notion of proportion by geometric images, either by appealing to what they remember if they have already studied geometry, or in having recourse to direct intuition, if they have not studied it, which besides will prepare them to study it. Finally I shall add that after defining multiplication of fractions, it is needful to justify this definition by showing that it is commutative, associative and distributive, and calling to the attention of the auditors that this is established to justify the definition.

Neither should we hesitate to introduce scholars to the[Pg 442] concept of proportion through geometric images, either by referencing what they remember if they've already studied geometry or through direct intuitions if they haven't studied it yet, which will also prepare them for that study. Finally, I’d like to add that after defining multiplication of fractions, it’s necessary to justify this definition by demonstrating that it is commutative, associative, and distributive, and to point out to the audience that this is done to validate the definition.

One sees what a rôle geometric images play in all this; and this rôle is justified by the philosophy and the history of the science. If arithmetic had remained free from all admixture of geometry, it would have known only the whole number; it is to adapt itself to the needs of geometry that it invented anything else.

One can see how important geometric images are in all of this, and this importance is backed by the philosophy and the history of the science. If arithmetic had stayed completely separate from geometry, it would have only dealt with whole numbers; it was to meet the needs of geometry that it created anything beyond that.

Geometry

In geometry we meet forthwith the notion of the straight line. Can the straight line be defined? The well-known definition, the shortest path from one point to another, scarcely satisfies me. I should start simply with the ruler and show at first to the scholar how one may verify a ruler by turning; this verification is the true definition of the straight line; the straight line is an axis of rotation. Next he should be shown how to verify the ruler by sliding and he would have one of the most important properties of the straight line.

In geometry, we immediately encounter the concept of the straight line. Can we define the straight line? The common definition, the shortest distance between two points, doesn’t really satisfy me. I would start with the ruler and first demonstrate to the student how to verify a ruler by rotating it; this verification is the true definition of the straight line—the straight line is an axis of rotation. Next, they should see how to verify the ruler by sliding, which will reveal one of the most important properties of the straight line.

As to this other property of being the shortest path from one point to another, it is a theorem which can be demonstrated apodictically, but the demonstration is too delicate to find a place in secondary teaching. It will be worth more to show that a ruler previously verified fits on a stretched thread. In presence of difficulties like these one need not dread to multiply assumptions, justifying them by rough experiments.

As for this other quality of being the shortest route from one point to another, it's a theorem that can be proven definitively, but the proof is too intricate to include in secondary education. It would be more useful to show that a previously checked ruler aligns with a stretched thread. When faced with challenges like these, there's no need to be afraid of making additional assumptions, supported by simple experiments.

It is needful to grant these assumptions, and if one admits a few more of them than is strictly necessary, the evil is not very great; the essential thing is to learn to reason soundly on the assumptions admitted. Uncle Sarcey, who loved to repeat, often said that at the theater the spectator accepts willingly all the postulates imposed upon him at the beginning, but the curtain[Pg 443] once raised, he becomes uncompromising on the logic. Well, it is just the same in mathematics.

It’s important to accept these assumptions, and if you accept a few more than strictly necessary, it’s not a huge deal; the key is to learn how to think critically about the accepted assumptions. Uncle Sarcey, who liked to repeat it often, said that at the theater, the audience willingly accepts all the premises set before them at the start, but once the curtain[Pg 443] is up, they become strict about the logic. The same thing applies in mathematics.

For the circle, we may start with the compasses; the scholars will recognize at the first glance the curve traced; then make them observe that the distance of the two points of the instrument remains constant, that one of these points is fixed and the other movable, and so we shall be led naturally to the logical definition.

For the circle, we can begin with a compass; the students will easily recognize the curve it creates. Then, have them notice that the distance between the two points of the instrument stays the same, that one of these points is fixed while the other can move, and this will naturally take us to the logical definition.

The definition of the plane implies an axiom and this need not be hidden. Take a drawing board and show that a moving ruler may be kept constantly in complete contact with this plane and yet retain three degrees of freedom. Compare with the cylinder and the cone, surfaces on which an applied straight retains only two degrees of freedom; next take three drawing boards; show first that they will glide while remaining applied to one another and this with three degrees of freedom; and finally to distinguish the plane from the sphere, show that two of these boards which fit a third will fit each other.

The definition of a plane includes an axiom that doesn’t need to be concealed. Take a drawing board and demonstrate that a moving ruler can stay in full contact with this plane while still having three degrees of freedom. Compare this to the cylinder and the cone, where a straight edge only has two degrees of freedom. Next, take three drawing boards; first, show that they can slide against each other while still being aligned, which also has three degrees of freedom. Finally, to differentiate the plane from the sphere, show that two of these boards that fit with a third will also fit with each other.

Perhaps you are surprised at this incessant employment of moving things; this is not a rough artifice; it is much more philosophic than one would at first think. What is geometry for the philosopher? It is the study of a group. And what group? That of the motions of solid bodies. How define this group then without moving some solids?

Perhaps you’re surprised by this constant use of moving things; this isn’t a simple trick; it’s much more philosophical than you might initially think. What does geometry mean to a philosopher? It’s the study of a set. And what set? The motions of solid objects. So how do we define this set without moving some solids?

Should we retain the classic definition of parallels and say parallels are two coplanar straights which do not meet, however far they be prolonged? No, since this definition is negative, since it is unverifiable by experiment, and consequently can not be regarded as an immediate datum of intuition. No, above all because it is wholly strange to the notion of group, to the consideration of the motion of solid bodies which is, as I have said, the true source of geometry. Would it not be better to define first the rectilinear translation of an invariable figure, as a motion wherein all the points of this figure have rectilinear trajectories; to show that such a translation is possible by making a square glide on a ruler?

Should we keep the classic definition of parallels and say that parallels are two straight lines in the same plane that never meet, no matter how far they're extended? No, because this definition is negative, it's not something we can confirm through experiments, and therefore we can't consider it an immediate fact of intuition. No, especially because it completely clashes with the idea of groups and the movement of solid objects, which is, as I've mentioned, the true foundation of geometry. Wouldn’t it be better to first define the straight-line movement of an unchanging shape as a motion in which all the points of this shape follow straight-line paths? We could demonstrate that such a movement is possible by making a square slide along a ruler?

From this experimental ascertainment, set up as an assumption, it would be easy to derive the notion of parallel and Euclid's postulate itself.

From this experimental finding, assumed as a basis, it would be straightforward to derive the concept of parallel lines and Euclid's postulate itself.

Mechanics

I need not return to the definition of velocity, or acceleration, or other kinematic notions; they may be advantageously connected with that of the derivative.

I don't need to revisit the definitions of velocity, acceleration, or other kinematic concepts; they can be effectively linked to the concept of the derivative.

I shall insist, on the other hand, upon the dynamic notions of force and mass.

I will emphasize, on the other hand, the dynamic ideas of force and mass.

I am struck by one thing: how very far the young people who have received a high-school education are from applying to the real world the mechanical laws they have been taught. It is not only that they are incapable of it; they do not even think of it. For them the world of science and the world of reality are separated by an impervious partition wall.

I’m amazed by one thing: how far off the young people who have a high school education are from applying the mechanical principles they’ve learned to the real world. It’s not just that they can’t do it; they don’t even consider it. For them, the world of science and the world of reality are divided by an unbreakable wall.

If we try to analyze the state of mind of our scholars, this will astonish us less. What is for them the real definition of force? Not that which they recite, but that which, crouching in a nook of their mind, from there directs it wholly. Here is the definition: forces are arrows with which one makes parallelograms. These arrows are imaginary things which have nothing to do with anything existing in nature. This would not happen if they had been shown forces in reality before representing them by arrows.

If we look closely at how our scholars think, it’ll be less surprising. What do they really mean by force? It’s not just the definition they repeat, but what lurks in the back of their minds, guiding their thoughts completely. Here’s the definition: forces are like arrows used to create parallelograms. These arrows are just imaginary concepts that don’t connect to anything in the natural world. This wouldn’t be the case if they had seen real forces in action before they started representing them as arrows.

How shall we define force?

How do we define force?

I think I have elsewhere sufficiently shown there is no good logical definition. There is the anthropomorphic definition, the sensation of muscular effort; this is really too rough and nothing useful can be drawn from it.

I believe I've shown in other places that there is no solid logical definition. There's the human-centered definition, which refers to the feeling of muscle strain; however, that's really too vague and doesn't provide any useful insights.

Here is how we should go: first, to make known the genus force, we must show one after the other all the species of this genus; they are very numerous and very different; there is the pressure of fluids on the insides of the vases wherein they are contained; the tension of threads; the elasticity of a spring; the gravity working on all the molecules of a body; friction; the normal mutual action and reaction of two solids in contact.

Here’s the plan: first, to explain the concept of force, we need to illustrate all the different types of this concept one by one; there are many and they vary greatly. These include the pressure of fluids inside the containers they’re in, the tension in threads, the elasticity of a spring, the force of gravity acting on all the molecules in a body, friction, and the regular interactions and reactions between two solid objects in contact.

This is only a qualitative definition; it is necessary to learn to measure force. For that begin by showing that one force may be replaced by another without destroying equilibrium; we may find the first example of this substitution in the balance and Borda's double weighing.

This is just a qualitative definition; we need to learn how to measure force. To start, show that one force can be replaced by another without disrupting equilibrium; we can find the first example of this substitution in the balance and Borda's double weighing.

Then show that a weight may be replaced, not only by another[Pg 445] weight, but by force of a different nature; for instance, Prony's brake permits replacing weight by friction.

Then demonstrate that a weight can be substituted, not just with another[Pg 445] weight, but also by a force of a different kind; for example, Prony's brake allows for weight to be replaced by friction.

From all this arises the notion of the equivalence of two forces.

From all this comes the idea of the equivalence of two forces.

The direction of a force must be defined. If a force F is equivalent to another force applied to the body considered by means of a stretched string, so that F may be replaced by without affecting the equilibrium, then the point of attachment of the string will be by definition the point of application of the force , and that of the equivalent force F; the direction of the string will be the direction of the force and that of the equivalent force F.

The direction of a force needs to be defined. If a force F is equivalent to another force acting on the body through a stretched string, allowing F to be replaced by without impacting the equilibrium, then the point where the string is attached is, by definition, the point where the force is applied, as well as the equivalent force F; the direction of the string will represent the direction of the force and the equivalent force F.

From that, pass to the comparison of the magnitude of forces. If a force can replace two others with the same direction, it equals their sum; show for example that a weight of 20 grams may replace two 10-gram weights.

From that, move on to comparing the strength of forces. If one force can take the place of two others that are in the same direction, it is the same as their total; for example, demonstrate that a weight of 20 grams can replace two weights of 10 grams each.

Is this enough? Not yet. We now know how to compare the intensity of two forces which have the same direction and same point of application; we must learn to do it when the directions are different. For that, imagine a string stretched by a weight and passing over a pulley; we shall say that the tensor of the two legs of the string is the same and equal to the tension weight.

Is this enough? Not yet. We now understand how to compare the strength of two forces that have the same direction and act at the same point; we need to learn how to do it when the directions are different. To do this, picture a string that is pulled by a weight and goes over a pulley; we’ll say that the tension in both sides of the string is the same and equal to the weight creating the tension.

This definition of ours enables us to compare the tensions of the two pieces of our string, and, using the preceding definitions, to compare any two forces having the same direction as these two pieces. It should be justified by showing that the tension of the last piece of the string remains the same for the same tensor weight, whatever be the number and the disposition of the reflecting pulleys. It has still to be completed by showing this is only true if the pulleys are frictionless.

This definition allows us to compare the tensions of the two sections of our string, and, using the earlier definitions, to compare any two forces that have the same direction as these two sections. We need to justify this by demonstrating that the tension of the last section of the string remains constant for the same weight, regardless of the number and arrangement of the reflecting pulleys. We also need to clarify that this is only true if the pulleys are frictionless.

Once master of these definitions, it is to be shown that the point of application, the direction and the intensity suffice to determine a force; that two forces for which these three elements are the same are always equivalent and may always be replaced by one another, whether in equilibrium or in movement, and this whatever be the other forces acting.

Once you understand these definitions, it will be demonstrated that the point of application, the direction, and the intensity are enough to determine a force; that two forces for which these three elements are the same are always equivalent and can always replace each other, whether in equilibrium or in motion, regardless of any other forces acting.

It must be shown that two concurrent forces may always be replaced by a unique resultant; and that this resultant remains[Pg 446] the same, whether the body be at rest or in motion and whatever be the other forces applied to it.

It must be shown that two concurrent forces can always be replaced by a single, unique resultant; and that this resultant remains[Pg 446] the same, whether the body is at rest or in motion and regardless of any other forces acting on it.

Finally it must be shown that forces thus defined satisfy the principle of the equality of action and reaction.

Finally, it must be demonstrated that the forces defined in this way meet the principle of action and reaction being equal.

Experiment it is, and experiment alone, which can teach us all that. It will suffice to cite certain common experiments, which the scholars make daily without suspecting it, and to perform before them a few experiments, simple and well chosen.

Experimenting is what teaches us all of that. It’s enough to mention some common experiments that students do every day without even realizing it, and to show them a few simple and carefully selected experiments.

It is after having passed through all these meanders that one may represent forces by arrows, and I should even wish that in the development of the reasonings return were made from time to time from the symbol to the reality. For instance it would not be difficult to illustrate the parallelogram of forces by aid of an apparatus formed of three strings, passing over pulleys, stretched by weights and in equilibrium while pulling on the same point.

It is only after navigating through all these twists and turns that we can represent forces with arrows. I would even prefer that during our reasoning, we occasionally return from the symbols to the actual concepts. For example, it wouldn't be hard to demonstrate the parallelogram of forces using a setup made of three strings, running over pulleys, pulled by weights, and in balance while pulling at the same point.

Knowing force, it is easy to define mass; this time the definition should be borrowed from dynamics; there is no way of doing otherwise, since the end to be attained is to give understanding of the distinction between mass and weight. Here again, the definition should be led up to by experiments; there is in fact a machine which seems made expressly to show what mass is, Atwood's machine; recall also the laws of the fall of bodies, that the acceleration of gravity is the same for heavy as for light bodies, and that it varies with the latitude, etc.

Knowing force makes it easy to define mass; this time, the definition should be taken from dynamics. There's no other way to do it, because the goal is to clarify the difference between mass and weight. Again, this definition should be supported by experiments. There's actually a machine designed specifically to demonstrate what mass is: Atwood's machine. Also, remember the laws of falling bodies, which state that the acceleration due to gravity is the same for both heavy and light objects, and that it changes with latitude, etc.

Now, if you tell me that all the methods I extol have long been applied in the schools, I shall rejoice over it more than be surprised at it. I know that on the whole our mathematical teaching is good. I do not wish it overturned; that would even distress me. I only desire betterments slowly progressive. This teaching should not be subjected to brusque oscillations under the capricious blast of ephemeral fads. In such tempests its high educative value would soon founder. A good and sound logic should continue to be its basis. The definition by example is always necessary, but it should prepare the way for the logical definition, it should not replace it; it should at least make this wished for, in the cases where the true logical definition can be advantageously given only in advanced teaching.[Pg 447]

Now, if you tell me that all the methods I praise have long been used in schools, I’ll be more pleased than surprised. I know that overall, our math teaching is good. I don’t want it completely changed; that would actually upset me. I only want gradual improvements. This teaching shouldn’t be subject to sudden swings due to the whims of temporary trends. In such conditions, its valuable educational impact would quickly decline. A solid and sound logic should remain its foundation. Defining concepts through examples is always important, but it should lead to logical definitions, not replace them; at the very least, it should make logical definitions desirable, especially in cases where they can only be effectively given in advanced classes.[Pg 447]

Understand that what I have here said does not imply giving up what I have written elsewhere. I have often had occasion to criticize certain definitions I extol to-day. These criticisms hold good completely. These definitions can only be provisory. But it is by way of them that we must pass.

Understand that what I’ve said here doesn’t mean I'm abandoning what I’ve written before. I’ve often had the chance to criticize certain definitions I promote today. Those criticisms are still valid. These definitions can only be temporary. But we need to go through them to move forward.


CHAPTER III

Math and Logic

Intro

Can mathematics be reduced to logic without having to appeal to principles peculiar to mathematics? There is a whole school, abounding in ardor and full of faith, striving to prove it. They have their own special language, which is without words, using only signs. This language is understood only by the initiates, so that commoners are disposed to bow to the trenchant affirmations of the adepts. It is perhaps not unprofitable to examine these affirmations somewhat closely, to see if they justify the peremptory tone with which they are presented.

Can math be simplified to logic without relying on principles unique to math? There's an entire movement, full of passion and belief, trying to prove this. They have their own special language, which doesn’t use words, only symbols. This language is only understood by those in the know, so regular people tend to accept the strong claims made by the experts. It might be helpful to take a closer look at these claims to see if they really support the authoritative way they're presented.

But to make clear the nature of the question it is necessary to enter upon certain historical details and in particular to recall the character of the works of Cantor.

But to clarify the nature of the question, it's important to go into some historical details and specifically to remember the character of Cantor's works.

Since long ago the notion of infinity had been introduced into mathematics; but this infinite was what philosophers call a becoming. The mathematical infinite was only a quantity capable of increasing beyond all limit: it was a variable quantity of which it could not be said that it had passed all limits, but only that it could pass them.

Since long ago, the concept of infinity has been introduced into mathematics; however, this infinity is what philosophers refer to as a becoming. The mathematical infinity was just a quantity that could grow beyond any limit: it was a variable quantity where it couldn't be said that it had passed all limits, but only that it could pass them.

Cantor has undertaken to introduce into mathematics an actual infinite, that is to say a quantity which not only is capable of passing all limits, but which is regarded as having already passed them. He has set himself questions like these: Are there more points in space than whole numbers? Are there more points in space than points in a plane? etc.

Cantor has taken on the challenge of bringing an actual infinite into mathematics, meaning a quantity that not only can exceed all limits but is also seen as having already done so. He has posed questions like these: Are there more points in space than whole numbers? Are there more points in space than points on a plane? etc.

And then the number of whole numbers, that of the points of space, etc., constitutes what he calls a transfinite cardinal number, that is to say a cardinal number greater than all the ordinary cardinal numbers. And he has occupied himself in comparing these transfinite cardinal numbers. In arranging in a proper order the elements of an aggregate containing an infinity of[Pg 449] them, he has also imagined what he calls transfinite ordinal numbers upon which I shall not dwell.

And then the count of whole numbers, along with the points in space, makes up what he calls a transfinite cardinal number, which means a cardinal number greater than all the regular cardinal numbers. He has focused on comparing these transfinite cardinal numbers. While organizing the elements of a set that contains an infinite number of[Pg 449] them, he also came up with what he calls transfinite ordinal numbers, which I won't discuss here.

Many mathematicians followed his lead and set a series of questions of the sort. They so familiarized themselves with transfinite numbers that they have come to make the theory of finite numbers depend upon that of Cantor's cardinal numbers. In their eyes, to teach arithmetic in a way truly logical, one should begin by establishing the general properties of transfinite cardinal numbers, then distinguish among them a very small class, that of the ordinary whole numbers. Thanks to this détour, one might succeed in proving all the propositions relative to this little class (that is to say all our arithmetic and our algebra) without using any principle foreign to logic. This method is evidently contrary to all sane psychology; it is certainly not in this way that the human mind proceeded in constructing mathematics; so its authors do not dream, I think, of introducing it into secondary teaching. But is it at least logic, or, better, is it correct? It may be doubted.

Many mathematicians followed his example and posed a series of questions like that. They became so familiar with transfinite numbers that they started to make the theory of finite numbers depend on Cantor's cardinal numbers. In their view, to teach arithmetic in a truly logical way, one should first establish the general properties of transfinite cardinal numbers, and then highlight a very small class among them, which is the ordinary whole numbers. Thanks to this detour, it might be possible to prove all the propositions related to this small class (that is, all our arithmetic and algebra) without using any principle that is outside of logic. This method clearly goes against all reasonable psychology; it certainly isn't how the human mind developed mathematics, so its authors likely don't intend to introduce it into secondary education. But is it at least logical, or better yet, is it correct? That is open to doubt.

The geometers who have employed it are however very numerous. They have accumulated formulas and they have thought to free themselves from what was not pure logic by writing memoirs where the formulas no longer alternate with explanatory discourse as in the books of ordinary mathematics, but where this discourse has completely disappeared.

The geometers who have used it are, however, very numerous. They've gathered formulas and believed they could free themselves from anything that wasn't pure logic by writing papers where the formulas no longer alternate with explanatory text as in regular math books, but where this text has completely disappeared.

Unfortunately they have reached contradictory results, what are called the cantorian antinomies, to which we shall have occasion to return. These contradictions have not discouraged them and they have tried to modify their rules so as to make those disappear which had already shown themselves, without being sure, for all that, that new ones would not manifest themselves.

Unfortunately, they have come to contradictory conclusions, known as the cantorian antinomies, which we will return to later. These contradictions haven’t discouraged them, and they’ve attempted to adjust their rules to eliminate those that had already appeared, without being sure that new ones wouldn’t arise.

It is time to administer justice on these exaggerations. I do not hope to convince them; for they have lived too long in this atmosphere. Besides, when one of their demonstrations has been refuted, we are sure to see it resurrected with insignificant alterations, and some of them have already risen several times from their ashes. Such long ago was the Lernæan hydra with its famous heads which always grew again. Hercules got through,[Pg 450] since his hydra had only nine heads, or eleven; but here there are too many, some in England, some in Germany, in Italy, in France, and he would have to give up the struggle. So I appeal only to men of good judgment unprejudiced.

It’s time to bring justice to these exaggerations. I don’t expect to change their minds; they’ve been in this environment for too long. Besides, whenever one of their claims is debunked, we can always expect to see it reappear with minor changes, and some have already come back several times. It reminds me of the Lernaean hydra with its famous heads that always grew back. Hercules managed to handle it since his hydra had only nine heads, or maybe eleven; but here, there are too many—some in England, some in Germany, some in Italy, and some in France—and he would have to give up the fight. So I’m appealing only to fair-minded people who are unbiased.

I

In these latter years numerous works have been published on pure mathematics and the philosophy of mathematics, trying to separate and isolate the logical elements of mathematical reasoning. These works have been analyzed and expounded very clearly by M. Couturat in a book entitled: The Principles of Mathematics.

In recent years, many works have been published on pure mathematics and the philosophy of mathematics, aiming to separate and clarify the logical components of mathematical reasoning. M. Couturat has analyzed and explained these works very clearly in a book titled: The Principles of Mathematics.

For M. Couturat, the new works, and in particular those of Russell and Peano, have finally settled the controversy, so long pending between Leibnitz and Kant. They have shown that there are no synthetic judgments a priori (Kant's phrase to designate judgments which can neither be demonstrated analytically, nor reduced to identities, nor established experimentally), they have shown that mathematics is entirely reducible to logic and that intuition here plays no rôle.

For M. Couturat, the new works, especially those by Russell and Peano, have finally resolved the long-standing debate between Leibnitz and Kant. They have demonstrated that there are no synthetic judgments a priori (Kant's term for judgments that can't be proven analytically, reduced to identities, or established through experiments). They have shown that mathematics can be completely reduced to logic and that intuition does not play a role here.

This is what M. Couturat has set forth in the work just cited; this he says still more explicitly in his Kant jubilee discourse, so that I heard my neighbor whisper: "I well see this is the centenary of Kant's death."

This is what M. Couturat has presented in the previously mentioned work; he expresses this even more clearly in his Kant jubilee speech, leading me to overhear my neighbor whisper: "I can see this is the centenary of Kant's death."

Can we subscribe to this conclusive condemnation? I think not, and I shall try to show why.

Can we agree with this definitive judgment? I don't think so, and I'll explain why.

II

What strikes us first in the new mathematics is its purely formal character: "We think," says Hilbert, "three sorts of things, which we shall call points, straights and planes. We convene that a straight shall be determined by two points, and that in place of saying this straight is determined by these two points, we may say it passes through these two points, or that these two points are situated on this straight." What these things are, not only we do not know, but we should not seek to know. We have no need to, and one who never had seen either point or straight or plane could geometrize as well as we. That[Pg 451] the phrase to pass through, or the phrase to be situated upon may arouse in us no image, the first is simply a synonym of to be determined and the second of to determine.

What stands out to us first in the new mathematics is its purely formal nature: "We think," says Hilbert, "about three types of things, which we’ll call points, lines, and planes. We agree that a line is defined by two points, and instead of saying this line is defined by these two points, we can say it goes through these two points, or that these two points are on this line." What these things are, we don’t know, and we shouldn’t try to figure it out. We don’t need to, and someone who has never seen a point, line, or plane could do geometry just as well as we do. That[Pg 451] the phrase to go through, or the phrase to be on might not conjure any image in us, the first is simply another way of saying to be defined and the second of to define.

Thus, be it understood, to demonstrate a theorem, it is neither necessary nor even advantageous to know what it means. The geometer might be replaced by the logic piano imagined by Stanley Jevons; or, if you choose, a machine might be imagined where the assumptions were put in at one end, while the theorems came out at the other, like the legendary Chicago machine where the pigs go in alive and come out transformed into hams and sausages. No more than these machines need the mathematician know what he does.

Thus, let it be clear that to prove a theorem, it is neither necessary nor even beneficial to understand what it means. The geometer could be substituted by the logic piano envisioned by Stanley Jevons; or, if you prefer, you could imagine a machine where the assumptions are input at one end, while the theorems come out at the other, like the famous Chicago machine where the pigs go in alive and come out turned into hams and sausages. Just like these machines, the mathematician does not need to understand what he is doing.

I do not make this formal character of his geometry a reproach to Hilbert. This is the way he should go, given the problem he set himself. He wished to reduce to a minimum the number of the fundamental assumptions of geometry and completely enumerate them; now, in reasonings where our mind remains active, in those where intuition still plays a part, in living reasonings, so to speak, it is difficult not to introduce an assumption or a postulate which passes unperceived. It is therefore only after having carried back all the geometric reasonings to a form purely mechanical that he could be sure of having accomplished his design and finished his work.

I don't hold Hilbert's formal approach to geometry against him. This was the right path for him to take, considering the challenge he set for himself. He wanted to minimize the number of fundamental assumptions in geometry and list them completely. However, in arguments where our thinking remains engaged, where intuition still plays a role—essentially, in dynamic reasoning—it's hard not to introduce an assumption or a postulate that goes unnoticed. So, it was only after he reduced all geometric reasoning to a purely mechanical form that he could be confident he had achieved his goal and completed his work.

What Hilbert did for geometry, others have tried to do for arithmetic and analysis. Even if they had entirely succeeded, would the Kantians be finally condemned to silence? Perhaps not, for in reducing mathematical thought to an empty form, it is certainly mutilated.

What Hilbert did for geometry, others have tried to do for arithmetic and analysis. Even if they had completely succeeded, would the Kantians be ultimately silenced? Maybe not, because by turning mathematical thinking into a hollow framework, it definitely loses its essence.

Even admitting it were established that all the theorems could be deduced by procedures purely analytic, by simple logical combinations of a finite number of assumptions, and that these assumptions are only conventions; the philosopher would still have the right to investigate the origins of these conventions, to see why they have been judged preferable to the contrary conventions.

Even if it were proven that all theorems could be derived from purely analytical methods, through straightforward logical combinations of a limited number of assumptions, and that these assumptions are just conventions, the philosopher would still have the right to explore the origins of these conventions and understand why they have been considered preferable to opposing conventions.

And then the logical correctness of the reasonings leading from the assumptions to the theorems is not the only thing which should occupy us. The rules of perfect logic, are they[Pg 452] the whole of mathematics? As well say the whole art of playing chess reduces to the rules of the moves of the pieces. Among all the constructs which can be built up of the materials furnished by logic, choice must be made; the true geometer makes this choice judiciously because he is guided by a sure instinct, or by some vague consciousness of I know not what more profound and more hidden geometry, which alone gives value to the edifice constructed.

And the logical accuracy of the arguments that connect the assumptions to the theorems isn't the only thing we should focus on. Are the rules of perfect logic[Pg 452] all there is to mathematics? That would be like saying that the entire game of chess comes down to just the rules for how the pieces move. Out of all the structures we can create using the materials provided by logic, we have to make choices; a true geometer makes these choices wisely because they are guided by a reliable instinct or by some vague awareness of a deeper, hidden geometry that gives real value to the structure they build.

To seek the origin of this instinct, to study the laws of this deep geometry, felt, not stated, would also be a fine employment for the philosophers who do not want logic to be all. But it is not at this point of view I wish to put myself, it is not thus I wish to consider the question. The instinct mentioned is necessary for the inventor, but it would seem at first we might do without it in studying the science once created. Well, what I wish to investigate is if it be true that, the principles of logic once admitted, one can, I do not say discover, but demonstrate, all the mathematical verities without making a new appeal to intuition.

To explore the origin of this instinct, to examine the principles of this deeper geometry, felt but not articulated, would also be an excellent task for philosophers who prefer not to limit everything to logic. However, that’s not the perspective I want to adopt; that’s not how I want to approach the issue. The instinct mentioned is essential for the inventor, but initially, it seems we could manage without it when studying established science. What I aim to investigate is whether it’s true that, once the principles of logic are accepted, one can—I'm not saying discover but demonstrate—every mathematical truth without resorting to intuition again.

III

I once said no to this question:[12] should our reply be modified by the recent works? My saying no was because "the principle of complete induction" seemed to me at once necessary to the mathematician and irreducible to logic. The statement of this principle is: "If a property be true of the number 1, and if we establish that it is true of n + 1 provided it be of n, it will be true of all the whole numbers." Therein I see the mathematical reasoning par excellence. I did not mean to say, as has been supposed, that all mathematical reasonings can be reduced to an application of this principle. Examining these reasonings closely, we there should see applied many other analogous principles, presenting the same essential characteristics. In this category of principles, that of complete induction is only the simplest of all and this is why I have chosen it as type.

I once said no to this question:[12] should we change our response based on recent works? My no came from the belief that "the principle of complete induction" is essential for mathematicians and cannot be simplified to just logic. This principle states: "If a property is true for the number 1, and if we prove it’s true for n + 1 whenever it’s true for n, then it will be true for all whole numbers." I see this as the pinnacle of mathematical reasoning. I didn't mean to imply, as has been assumed, that all mathematical reasoning can be boiled down to this principle. If we examine these reasonings closely, we would see many other similar principles with the same key features at play. In this group of principles, complete induction is just the simplest one, which is why I chose it as an example.

The current name, principle of complete induction, is not justified. This mode of reasoning is none the less a true[Pg 453] mathematical induction which differs from ordinary induction only by its certitude.

The current name, principle of complete induction, isn't justified. This way of reasoning is still a valid[Pg 453] mathematical induction that differs from regular induction only in its certainty.

IV

Definitions and Assumptions

Definitions and Assumptions

The existence of such principles is a difficulty for the uncompromising logicians; how do they pretend to get out of it? The principle of complete induction, they say, is not an assumption properly so called or a synthetic judgment a priori; it is just simply the definition of whole number. It is therefore a simple convention. To discuss this way of looking at it, we must examine a little closely the relations between definitions and assumptions.

The existence of such principles poses a challenge for strict logicians; how do they plan to address it? They argue that the principle of complete induction isn’t an assumption in the traditional sense or a synthetic judgment a priori; it is merely the definition of whole numbers. Thus, it is simply a convention. To discuss this perspective, we need to take a closer look at the relationship between definitions and assumptions.

Let us go back first to an article by M. Couturat on mathematical definitions which appeared in l'Enseignement mathématique, a magazine published by Gauthier-Villars and by Georg at Geneva. We shall see there a distinction between the direct definition and the definition by postulates.

Let’s first revisit an article by M. Couturat about mathematical definitions that was published in l'Enseignement mathématique, a magazine released by Gauthier-Villars and Georg in Geneva. There, we will find a distinction between the direct definition and the definition by postulates.

"The definition by postulates," says M. Couturat, "applies not to a single notion, but to a system of notions; it consists in enumerating the fundamental relations which unite them and which enable us to demonstrate all their other properties; these relations are postulates."

"The definition by postulates," says M. Couturat, "doesn't apply to just one idea, but to a system of ideas; it involves listing the basic relationships that connect them and that allow us to prove all their other characteristics; these relationships are postulates."

If previously have been defined all these notions but one, then this last will be by definition the thing which verifies these postulates. Thus certain indemonstrable assumptions of mathematics would be only disguised definitions. This point of view is often legitimate; and I have myself admitted it in regard for instance to Euclid's postulate.

If all these concepts have been defined before except for one, then this last one will, by definition, be the thing that confirms these principles. Therefore, some unprovable assumptions in mathematics would simply be rephrased definitions. This perspective is often valid; and I have personally accepted it in relation to Euclid's postulate, for example.

The other assumptions of geometry do not suffice to completely define distance; the distance then will be, by definition, among all the magnitudes which satisfy these other assumptions, that which is such as to make Euclid's postulate true.

The other assumptions of geometry aren't enough to fully define distance; so distance will, by definition, be the one among all the magnitudes that meet these other assumptions that makes Euclid's postulate true.

Well the logicians suppose true for the principle of complete induction what I admit for Euclid's postulate; they want to see in it only a disguised definition.

Well, the logicians believe that true for the principle of complete induction what I accept for Euclid's postulate; they only want to see a hidden definition in it.

But to give them this right, two conditions must be fulfilled. Stuart Mill says every definition implies an assumption, that by which the existence of the defined object is affirmed. According[Pg 454] to that, it would no longer be the assumption which might be a disguised definition, it would on the contrary be the definition which would be a disguised assumption. Stuart Mill meant the word existence in a material and empirical sense; he meant to say that in defining the circle we affirm there are round things in nature.

But to give them this right, two conditions must be met. Stuart Mill says every definition carries an assumption that affirms the existence of the defined object. According[Pg 454] to that, it would no longer be an assumption that could be a hidden definition; instead, it would be the definition that acts as a hidden assumption. Stuart Mill was referring to existence in a tangible and empirical sense; he meant that by defining a circle, we affirm that there are round things in nature.

Under this form, his opinion is inadmissible. Mathematics is independent of the existence of material objects; in mathematics the word exist can have only one meaning, it means free from contradiction. Thus rectified, Stuart Mill's thought becomes exact; in defining a thing, we affirm that the definition implies no contradiction.

Under this form, his opinion is not acceptable. Mathematics is separate from the existence of physical objects; in mathematics, the word exist can only mean one thing: free from contradiction. With this clarification, Stuart Mill's idea becomes accurate; when we define something, we confirm that the definition does not imply any contradiction.

If therefore we have a system of postulates, and if we can demonstrate that these postulates imply no contradiction, we shall have the right to consider them as representing the definition of one of the notions entering therein. If we can not demonstrate that, it must be admitted without proof, and that then will be an assumption; so that, seeking the definition under the postulate, we should find the assumption under the definition.

If we have a system of statements, and if we can show that these statements don’t contradict each other, we can treat them as defining one of the concepts involved. If we can’t prove that, we have to accept it without proof, and that will then be an assumption; therefore, while looking for the definition based on the statement, we would actually discover the assumption based on the definition.

Usually, to show that a definition implies no contradiction, we proceed by example, we try to make an example of a thing satisfying the definition. Take the case of a definition by postulates; we wish to define a notion A, and we say that, by definition, an A is anything for which certain postulates are true. If we can prove directly that all these postulates are true of a certain object B, the definition will be justified; the object B will be an example of an A. We shall be certain that the postulates are not contradictory, since there are cases where they are all true at the same time.

Usually, to show that a definition doesn't lead to any contradictions, we move forward by example, trying to find something that fits the definition. Take the situation of a definition by postulates; we want to define a concept A, and we say that, by definition, an A is anything for which certain postulates hold true. If we can directly prove that all these postulates are true for a specific object B, then the definition will be validated; the object B will serve as an example of an A. We can be confident that the postulates aren't contradictory since there are instances where they are all true at the same time.

But such a direct demonstration by example is not always possible.

But a direct demonstration by example isn't always possible.

To establish that the postulates imply no contradiction, it is then necessary to consider all the propositions deducible from these postulates considered as premises, and to show that, among these propositions, no two are contradictory. If these propositions are finite in number, a direct verification is possible. This case is infrequent and uninteresting. If these propositions are infinite in number, this direct verification can no longer be made;[Pg 455] recourse must be had to procedures where in general it is necessary to invoke just this principle of complete induction which is precisely the thing to be proved.

To prove that the postulates don’t contradict each other, we need to look at all the propositions that can be derived from these postulates treated as premises, and show that none of them are contradictory. If there’s a finite number of these propositions, we can check them directly. However, this situation is rare and not very interesting. If there’s an infinite number of propositions, direct verification isn’t possible anymore;[Pg 455] and we must turn to methods that generally require using this principle of complete induction, which is exactly what we are trying to prove.

This is an explanation of one of the conditions the logicians should satisfy, and further on we shall see they have not done it.

This is an explanation of one of the conditions the logicians should meet, and later we will see that they haven't done it.

V

There is a second. When we give a definition, it is to use it.

There is a second. When we define something, it’s to use it.

We therefore shall find in the sequel of the exposition the word defined; have we the right to affirm, of the thing represented by this word, the postulate which has served for definition? Yes, evidently, if the word has retained its meaning, if we do not attribute to it implicitly a different meaning. Now this is what sometimes happens and it is usually difficult to perceive it; it is needful to see how this word comes into our discourse, and if the gate by which it has entered does not imply in reality a definition other than that stated.

We will therefore find the defined word in the following part of the explanation. Can we say that the concept represented by this word meets the criteria we used for definition? Yes, clearly, if the word still holds its meaning and we aren't secretly giving it a different one. However, this can happen sometimes, and it’s often hard to notice; we need to examine how this word enters our conversation and whether the way it came in suggests a definition different from what has been stated.

This difficulty presents itself in all the applications of mathematics. The mathematical notion has been given a definition very refined and very rigorous; and for the pure mathematician all doubt has disappeared; but if one wishes to apply it to the physical sciences for instance, it is no longer a question of this pure notion, but of a concrete object which is often only a rough image of it. To say that this object satisfies, at least approximately, the definition, is to state a new truth, which experience alone can put beyond doubt, and which no longer has the character of a conventional postulate.

This difficulty appears in all areas of mathematics. The mathematical concept has been defined in a highly refined and rigorous way; for the pure mathematician, there’s no question left. However, when trying to apply it to the physical sciences, what we deal with is not this pure concept but a concrete object that often only loosely reflects it. To claim that this object meets the definition, at least somewhat, is to express a new truth that only experience can confirm and that no longer has the nature of a mere conventional assumption.

But without going beyond pure mathematics, we also meet the same difficulty.

But even when sticking to pure mathematics, we still encounter the same challenge.

You give a subtile definition of numbers; then, once this definition given, you think no more of it; because, in reality, it is not it which has taught you what number is; you long ago knew that, and when the word number further on is found under your pen, you give it the same sense as the first comer. To know what is this meaning and whether it is the same in this phrase or that, it is needful to see how you have been led to speak of number and to introduce this word into these two phrases. I shall not for the moment dilate upon this point, because we shall have occasion to return to it.[Pg 456]

You give a subtle definition of numbers, but once that definition is provided, you don’t think about it anymore. In reality, it’s not that definition that taught you what a number is; you already knew that long ago. So, when you write the word "number" later on, you use it with the same meaning as anyone else would. To understand what this meaning is and whether it's the same in this phrase or that one, you need to consider how you came to talk about numbers and how you introduced this word into these two phrases. I won’t go into detail on this point right now, as we will have the chance to revisit it later.[Pg 456]

Thus consider a word of which we have given explicitly a definition A; afterwards in the discourse we make a use of it which implicitly supposes another definition B. It is possible that these two definitions designate the same thing. But that this is so is a new truth which must either be demonstrated or admitted as an independent assumption.

Thus, think about a word for which we have clearly defined A; later in the conversation, we use it in a way that implicitly assumes another definition B. It's possible that these two definitions refer to the same thing. However, whether this is true is a new truth that must be either proven or accepted as a separate assumption.

We shall see farther on that the logicians have not fulfilled the second condition any better than the first.

We will see later that the logicians haven't met the second condition any better than the first.

VI

The definitions of number are very numerous and very different; I forego the enumeration even of the names of their authors. We should not be astonished that there are so many. If one among them was satisfactory, no new one would be given. If each new philosopher occupying himself with this question has thought he must invent another one, this was because he was not satisfied with those of his predecessors, and he was not satisfied with them because he thought he saw a petitio principii.

The definitions of number are countless and vary widely; I won't even list the names of their authors. We shouldn't be surprised that there are so many. If any one of them were satisfactory, no new definitions would emerge. Each new philosopher tackling this issue believes they need to come up with a new one because they're not happy with the ones from their predecessors, and they're not happy with those because they think they see a circular argument.

I have always felt, in reading the writings devoted to this problem, a profound feeling of discomfort; I was always expecting to run against a petitio principii, and when I did not immediately perceive it, I feared I had overlooked it.

I have always felt a deep sense of unease when reading the writings about this issue; I was always waiting to encounter a petitio principii, and when I didn’t see it right away, I worried I had missed it.

This is because it is impossible to give a definition without using a sentence, and difficult to make a sentence without using a number word, or at least the word several, or at least a word in the plural. And then the declivity is slippery and at each instant there is risk of a fall into petitio principii.

This is because it's impossible to define something without using a sentence, and it's hard to create a sentence without using a number word, or at least the word several, or at least a plural word. And then the slope is slippery, and at every moment, there's a risk of falling into circular reasoning.

I shall devote my attention in what follows only to those of these definitions where the petitio principii is most ably concealed.

I will focus in the following sections only on those definitions where the begging the question is most cleverly hidden.

VII

Symbolic writing

The symbolic language created by Peano plays a very grand rôle in these new researches. It is capable of rendering some service, but I think M. Couturat attaches to it an exaggerated importance which must astonish Peano himself.

The symbolic language created by Peano plays a very important role in these new studies. It can be useful, but I think M. Couturat gives it more importance than it deserves, which would probably surprise Peano himself.

The essential element of this language is certain algebraic[Pg 457] signs which represent the different conjunctions: if, and, or, therefore. That these signs may be convenient is possible; but that they are destined to revolutionize all philosophy is a different matter. It is difficult to admit that the word if acquires, when written C, a virtue it had not when written if. This invention of Peano was first called pasigraphy, that is to say the art of writing a treatise on mathematics without using a single word of ordinary language. This name defined its range very exactly. Later, it was raised to a more eminent dignity by conferring on it the title of logistic. This word is, it appears, employed at the Military Academy, to designate the art of the quartermaster of cavalry, the art of marching and cantoning troops; but here no confusion need be feared, and it is at once seen that this new name implies the design of revolutionizing logic.

The key part of this language is certain algebraic[Pg 457] symbols that represent different conjunctions: if, and, or, therefore. While these symbols might be useful, the idea that they will completely change all of philosophy is another story. It's hard to believe that the word if gains, when written as C, a quality it doesn’t have when written as if. Peano originally called this invention pasigraphy, meaning the art of writing a mathematical treatise without using any ordinary language. This name precisely defines its scope. Later, it was given a more prestigious title: logistic. This term is used at the Military Academy to refer to the art of the cavalry quartermaster, managing the marching and placement of troops; however, there's no risk of confusion here, and it's immediately clear that this new name conveys the intention to transform logic.

We may see the new method at work in a mathematical memoir by Burali-Forti, entitled: Una Questione sui numeri transfiniti, inserted in Volume XI of the Rendiconti del circolo matematico di Palermo.

We can see the new method in action in a mathematical paper by Burali-Forti called: Una Questione sui numeri transfiniti, published in Volume XI of the Rendiconti del circolo matematico di Palermo.

I begin by saying this memoir is very interesting, and my taking it here as example is precisely because it is the most important of all those written in the new language. Besides, the uninitiated may read it, thanks to an Italian interlinear translation.

I want to start by saying this memoir is really interesting, and I'm using it as an example because it's the most significant of all the works written in the new language. Plus, people unfamiliar with it can read it, thanks to an Italian interlinear translation.

What constitutes the importance of this memoir is that it has given the first example of those antinomies met in the study of transfinite numbers and making since some years the despair of mathematicians. The aim, says Burali-Forti, of this note is to show there may be two transfinite numbers (ordinals), a and b, such that a is neither equal to, greater than, nor less than b.

What makes this memoir important is that it provides the first example of the contradictions encountered in the study of transfinite numbers, which have frustrated mathematicians for years. The goal, according to Burali-Forti, of this note is to demonstrate that there can be two transfinite numbers (ordinals), a and b, such that a is neither equal to, greater than, nor less than b.

To reassure the reader, to comprehend the considerations which follow, he has no need of knowing what a transfinite ordinal number is.

To reassure the reader and understand the following points, there's no need to know what a transfinite ordinal number is.

Now, Cantor had precisely proved that between two transfinite numbers as between two finite, there can be no other relation than equality or inequality in one sense or the other. But it is not of the substance of this memoir that I wish to speak here; that would carry me much too far from my subject; I only wish to consider the form, and just to ask if this form makes it gain[Pg 458] much in rigor and whether it thus compensates for the efforts it imposes upon the writer and the reader.

Now, Cantor had definitively shown that between two transfinite numbers, just like between two finite numbers, the only relationship possible is equality or inequality in one way or another. However, that’s not the main focus of this paper; discussing it would distract me from my topic. I just want to examine the format and ask whether this format significantly enhances rigor and if it justifies the effort it demands from both the writer and the reader.

First we see Burali-Forti define the number 1 as follows:

First we see Burali-Forti define the number 1 like this:

a definition eminently fitted to give an idea of the number 1 to persons who had never heard speak of it.

a definition perfectly suited to give an idea of the number 1 to people who had never heard it mentioned.

I understand Peanian too ill to dare risk a critique, but still I fear this definition contains a petitio principii, considering that I see the figure 1 in the first member and Un in letters in the second.

I get that Peanian is too sick to take the chance on a critique, but I still worry that this definition has a petitio principii, since I see the number 1 in the first part and "Un" in letters in the second.

However that may be, Burali-Forti starts from this definition and, after a short calculation, reaches the equation:

However that may be, Burali-Forti begins with this definition and, after a brief calculation, arrives at the equation:

which tells us that One is a number.

which tells us that One is a number.

And since we are on these definitions of the first numbers, we recall that M. Couturat has also defined 0 and 1.

And since we're discussing these definitions of the first numbers, we remember that M. Couturat also defined 0 and 1.

What is zero? It is the number of elements of the null class. And what is the null class? It is that containing no element.

What is zero? It is the number of elements in the empty set. And what is the empty set? It is the one that contains no elements.

To define zero by null, and null by no, is really to abuse the wealth of language; so M. Couturat has introduced an improvement in his definition, by writing:

To define zero as null, and null as no, is really to misuse the richness of language; so M. Couturat has made an improvement in his definition by writing:

which means: zero is the number of things satisfying a condition never satisfied.

which means: zero is the number of things that meet a condition never fulfilled.

But as never means in no case I do not see that the progress is great.

But since never means in no case, I don't see that the progress is significant.

I hasten to add that the definition M. Couturat gives of the number 1 is more satisfactory.

I want to add that M. Couturat's definition of the number 1 is more satisfying.

One, says he in substance, is the number of elements in a class in which any two elements are identical.

One, he says in essence, is the number of elements in a class where any two elements are the same.

It is more satisfactory, I have said, in this sense that to define 1, he does not use the word one; in compensation, he uses the word two. But I fear, if asked what is two, M. Couturat would have to use the word one.

It is more satisfying, as I’ve said, in this way that to define 1, he doesn’t use the word one; instead, he uses the word two. But I worry that if asked what two is, M. Couturat would have to use the word one.

VIII

But to return to the memoir of Burali-Forti; I have said his conclusions are in direct opposition to those of Cantor. Now, one day M. Hadamard came to see me and the talk fell upon this antinomy.

But let's get back to Burali-Forti's memoir; I mentioned that his conclusions contradict Cantor's. One day, M. Hadamard came to visit me, and we started discussing this contradiction.

"Burali-Forti's reasoning," I said, "does it not seem to you irreproachable?" "No, and on the contrary I find nothing to object to in that of Cantor. Besides, Burali-Forti had no right to speak of the aggregate of all the ordinal numbers."

"Burali-Forti's reasoning," I said, "doesn't it seem flawless to you?" "No, and actually, I find nothing wrong with Cantor's reasoning. Furthermore, Burali-Forti had no right to refer to the total of all the ordinal numbers."

"Pardon, he had the right, since he could always put

"Pardon, he had the right, since he could always put

I should like to know who was to prevent him, and can it be said a thing does not exist, when we have called it Ω?"

I would like to know who was supposed to stop him, and can we really say something doesn't exist when we have named it Ω?

It was in vain, I could not convince him (which besides would have been sad, since he was right). Was it merely because I do not speak the Peanian with enough eloquence? Perhaps; but between ourselves I do not think so.

It was pointless; I couldn’t change his mind (which would have been unfortunate since he was right). Was it just because I don’t speak Peanian well enough? Maybe; but honestly, I don’t think so.

Thus, despite all this pasigraphic apparatus, the question was not solved. What does that prove? In so far as it is a question only of proving one a number, pasigraphy suffices, but if a difficulty presents itself, if there is an antinomy to solve, pasigraphy becomes impotent.

Thus, even with all this writing system in place, the question remained unresolved. What does that show? If it’s just about assigning a number, the writing system is enough, but if a problem arises, if there’s a contradiction to address, the writing system becomes useless.


CHAPTER IV

The New Logic

I

The Russell Logic

To justify its pretensions, logic had to change. We have seen new logics arise of which the most interesting is that of Russell. It seems he has nothing new to write about formal logic, as if Aristotle there had touched bottom. But the domain Russell attributes to logic is infinitely more extended than that of the classic logic, and he has put forth on the subject views which are original and at times well warranted.

To justify its claims, logic had to evolve. We’ve witnessed the emergence of new logics, the most intriguing of which belongs to Russell. It appears he has nothing new to say about formal logic, as if Aristotle had already covered everything. However, the scope that Russell assigns to logic is vastly broader than that of classical logic, and he has presented ideas on the topic that are original and occasionally well-supported.

First, Russell subordinates the logic of classes to that of propositions, while the logic of Aristotle was above all the logic of classes and took as its point of departure the relation of subject to predicate. The classic syllogism, "Socrates is a man," etc., gives place to the hypothetical syllogism: "If A is true, B is true; now if B is true, C is true," etc. And this is, I think, a most happy idea, because the classic syllogism is easy to carry back to the hypothetical syllogism, while the inverse transformation is not without difficulty.

First, Russell prioritizes the logic of propositions over that of classes, while Aristotle's logic primarily focused on classes and started from the relationship between the subject and the predicate. The classic syllogism, "Socrates is a man," etc., is replaced by the hypothetical syllogism: "If A is true, B is true; now if B is true, C is true," etc. I think this is a really great idea because it's easy to convert the classic syllogism back into the hypothetical syllogism, but doing the reverse transformation is not without its challenges.

And then this is not all. Russell's logic of propositions is the study of the laws of combination of the conjunctions if, and, or, and the negation not.

And that's not all. Russell's logic of propositions examines the rules for combining the conjunctions if, and, or, and the negation not.

In adding here two other conjunctions, and and or, Russell opens to logic a new field. The symbols and, or follow the same laws as the two signs × and +, that is to say the commutative associative and distributive laws. Thus and represents logical multiplication, while or represents logical addition. This also is very interesting.

In adding two more conjunctions, and and or, Russell introduces a new area to logic. The symbols and and or follow the same rules as the two signs × and +, meaning the commutative, associative, and distributive laws apply. Therefore, and represents logical multiplication, while or represents logical addition. This is also quite interesting.

Russell reaches the conclusion that any false proposition implies all other propositions true or false. M. Couturat says this conclusion will at first seem paradoxical. It is sufficient however to have corrected a bad thesis in mathematics to recognize[Pg 461] how right Russell is. The candidate often is at great pains to get the first false equation; but that once obtained, it is only sport then for him to accumulate the most surprising results, some of which even may be true.

Russell concludes that any false statement implies all other statements, whether they are true or false. M. Couturat mentions that this conclusion might initially seem paradoxical. However, it's enough to have corrected a flawed thesis in mathematics to understand how accurate Russell is. The candidate often struggles to find the first false equation; but once they have it, it's merely a game for them to gather the most shocking results, some of which may even be true.

II

We see how much richer the new logic is than the classic logic; the symbols are multiplied and allow of varied combinations which are no longer limited in number. Has one the right to give this extension to the meaning of the word logic? It would be useless to examine this question and to seek with Russell a mere quarrel about words. Grant him what he demands; but be not astonished if certain verities declared irreducible to logic in the old sense of the word find themselves now reducible to logic in the new sense—something very different.

We can see how much richer the new logic is compared to the old logic; the symbols are more numerous and allow for different combinations that are no longer limited in number. Is it acceptable to expand the meaning of the word logic this way? There's no point in debating this and getting into a mere argument over semantics with Russell. Let’s give him what he wants; but don’t be surprised if some truths that were once considered irreducible to logic in the traditional sense now turn out to be reducible to logic in the new sense— which is something quite different.

A great number of new notions have been introduced, and these are not simply combinations of the old. Russell knows this, and not only at the beginning of the first chapter, 'The Logic of Propositions,' but at the beginning of the second and third, 'The Logic of Classes' and 'The Logic of Relations,' he introduces new words that he declares indefinable.

A lot of new ideas have been introduced, and these aren't just mixes of the old ones. Russell understands this, and not only at the start of the first chapter, 'The Logic of Propositions,' but also at the beginning of the second and third chapters, 'The Logic of Classes' and 'The Logic of Relations,' he introduces new terms that he says can't be defined.

And this is not all; he likewise introduces principles he declares indemonstrable. But these indemonstrable principles are appeals to intuition, synthetic judgments a priori. We regard them as intuitive when we meet them more or less explicitly enunciated in mathematical treatises; have they changed character because the meaning of the word logic has been enlarged and we now find them in a book entitled Treatise on Logic? They have not changed nature; they have only changed place.

And that's not all; he also introduces principles that he claims can't be proven. But these unprovable principles are based on intuition and synthetic judgments a priori. We see them as intuitive when they are stated more or less clearly in mathematical works; have they changed just because the meaning of the term logic has broadened and we now see them in a book called Treatise on Logic? They haven’t changed in nature; they’ve only changed in location.

III

Could these principles be considered as disguised definitions? It would then be necessary to have some way of proving that they imply no contradiction. It would be necessary to establish that, however far one followed the series of deductions, he would never be exposed to contradicting himself.

Could these principles be seen as hidden definitions? It would then be essential to find a way to prove that they don’t lead to any contradictions. It’s important to show that no matter how far one goes along the series of deductions, they would never end up contradicting themselves.

We might attempt to reason as follows: We can verify that[Pg 462] the operations of the new logic applied to premises exempt from contradiction can only give consequences equally exempt from contradiction. If therefore after n operations we have not met contradiction, we shall not encounter it after n + 1. Thus it is impossible that there should be a moment when contradiction begins, which shows we shall never meet it. Have we the right to reason in this way? No, for this would be to make use of complete induction; and remember, we do not yet know the principle of complete induction.

We might try to reason like this: We can confirm that[Pg 462] the actions of the new logic applied to premises that are free from contradiction can only produce results that are also free from contradiction. So, if after n operations we haven’t come across a contradiction, we won’t run into one after n + 1. Therefore, it’s impossible for there to be a moment when a contradiction starts, which means we will never encounter it. Are we justified in reasoning this way? No, because that would mean using complete induction; and remember, we don’t yet understand the principle of complete induction.

We therefore have not the right to regard these assumptions as disguised definitions and only one resource remains for us, to admit a new act of intuition for each of them. Moreover I believe this is indeed the thought of Russell and M. Couturat.

We don't have the right to see these assumptions as hidden definitions, and the only option left for us is to accept a new act of intuition for each one. Furthermore, I believe this is exactly what Russell and M. Couturat are suggesting.

Thus each of the nine indefinable notions and of the twenty indemonstrable propositions (I believe if it were I that did the counting, I should have found some more) which are the foundation of the new logic, logic in the broad sense, presupposes a new and independent act of our intuition and (why not say it?) a veritable synthetic judgment a priori. On this point all seem agreed, but what Russell claims, and what seems to me doubtful, is that after these appeals to intuition, that will be the end of it; we need make no others and can build all mathematics without the intervention of any new element.

Thus, each of the nine elusive concepts and the twenty unprovable statements (if I were the one counting, I think I would have found a few more) that form the foundation of the new logic, logic in the broad sense, relies on a new and independent act of our intuition and (why not say it?) a true synthetic judgment a priori. Everyone seems to agree on this point, but what Russell argues, and what I find questionable, is that after these appeals to intuition, that will be the end of it; we don’t need to make any further appeals and can construct all of mathematics without involving any new element.

IV

M. Couturat often repeats that this new logic is altogether independent of the idea of number. I shall not amuse myself by counting how many numeral adjectives his exposition contains, both cardinal and ordinal, or indefinite adjectives such as several. We may cite, however, some examples:

M. Couturat often says that this new logic is completely independent of the concept of number. I won’t bother counting how many numeral adjectives are in his explanation, whether they are cardinal, ordinal, or indefinite adjectives like several. However, we can provide some examples:

"The logical product of two or more propositions is....";

"The logical result of two or more propositions is...."

"All propositions are capable only of two values, true and false";

"All statements can only be two values, true and false."

"The relative product of two relations is a relation";

"The relative product of two relations is a relation."

"A relation exists between two terms," etc., etc.

"A connection exists between two terms," etc., etc.

Sometimes this inconvenience would not be unavoidable, but sometimes also it is essential. A relation is incomprehensible[Pg 463] without two terms; it is impossible to have the intuition of the relation, without having at the same time that of its two terms, and without noticing they are two, because, if the relation is to be conceivable, it is necessary that there be two and only two.

Sometimes this inconvenience isn't unavoidable, but sometimes it's essential. A relationship is hard to understand[Pg 463] without two parts; you can't grasp the relationship without also understanding both parts and recognizing that there are two. To conceptualize the relationship, there must be exactly two.

V

Arithmetic

I reach what M. Couturat calls the ordinal theory which is the foundation of arithmetic properly so called. M. Couturat begins by stating Peano's five assumptions, which are independent, as has been proved by Peano and Padoa.

I arrive at what M. Couturat refers to as the ordinal theory, which is the basis of true arithmetic. M. Couturat starts by outlining Peano's five assumptions, which are independent, as demonstrated by Peano and Padoa.

1. Zero is an integer.

Zero is an integer.

2. Zero is not the successor of any integer.

2. Zero isn’t the successor of any integer.

3. The successor of an integer is an integer.

3. The successor of an integer is still an integer.

To this it would be proper to add,

To this, it would be appropriate to add,

Every integer has a successor.

Every integer has a next number.

4. Two integers are equal if their successors are.

4. Two whole numbers are the same if their next numbers are.

The fifth assumption is the principle of complete induction.

The fifth assumption is the principle of complete induction.

M. Couturat considers these assumptions as disguised definitions; they constitute the definition by postulates of zero, of successor, and of integer.

M. Couturat views these assumptions as hidden definitions; they make up the definition by postulates of zero, successor, and integer.

But we have seen that for a definition by postulates to be acceptable we must be able to prove that it implies no contradiction.

But we've seen that for a definition based on postulates to be acceptable, we need to be able to prove that it doesn't imply any contradictions.

Is this the case here? Not at all.

Is that true here? Not at all.

The demonstration can not be made by example. We can not take a part of the integers, for instance the first three, and prove they satisfy the definition.

The demonstration can't be done by example. We can't take a subset of the integers, like the first three, and prove they meet the definition.

If I take the series 0, 1, 2, I see it fulfils the assumptions 1, 2, 4 and 5; but to satisfy assumption 3 it still is necessary that 3 be an integer, and consequently that the series 0, 1, 2, 3, fulfil the assumptions; we might prove that it satisfies assumptions 1, 2, 4, 5, but assumption 3 requires besides that 4 be an integer and that the series 0, 1, 2, 3, 4 fulfil the assumptions, and so on.

If I consider the series 0, 1, 2, it meets the assumptions 1, 2, 4, and 5. However, to meet assumption 3, it's necessary for 3 to be an integer. Therefore, the series 0, 1, 2, 3 also needs to fulfill the assumptions. We can prove that it meets assumptions 1, 2, 4, and 5, but assumption 3 also requires that 4 be an integer, meaning the series 0, 1, 2, 3, 4 must fulfill the assumptions, and this pattern continues.

It is therefore impossible to demonstrate the assumptions for certain integers without proving them for all; we must give up proof by example.[Pg 464]

It is therefore impossible to show the assumptions for certain integers without proving them for all; we have to give up proof by example.[Pg 464]

It is necessary then to take all the consequences of our assumptions and see if they contain no contradiction.

It’s essential to consider all the implications of our assumptions and check if they contradict each other.

If these consequences were finite in number, this would be easy; but they are infinite in number; they are the whole of mathematics, or at least all arithmetic.

If these consequences had a limited number, it would be simple; but they are limitless; they encompass all of mathematics, or at least all arithmetic.

What then is to be done? Perhaps strictly we could repeat the reasoning of number III.

What should we do now? Maybe, to be precise, we could go over the logic from number III again.

But as we have said, this reasoning is complete induction, and it is precisely the principle of complete induction whose justification would be the point in question.

But as we mentioned, this reasoning is complete induction, and it's exactly the principle of complete induction that would need to be justified.

VI

The Logic of Hilbert

I come now to the capital work of Hilbert which he communicated to the Congress of Mathematicians at Heidelberg, and of which a French translation by M. Pierre Boutroux appeared in l'Enseignement mathématique, while an English translation due to Halsted appeared in The Monist.[13] In this work, which contains profound thoughts, the author's aim is analogous to that of Russell, but on many points he diverges from his predecessor.

I now turn to Hilbert's main work, which he presented to the Congress of Mathematicians in Heidelberg. A French translation by M. Pierre Boutroux was published in l'Enseignement mathématique, and an English version by Halsted appeared in The Monist.[13] This work, filled with deep insights, has an aim similar to that of Russell, but on several points, it takes a different direction from his predecessor.

"But," he says (Monist, p. 340), "on attentive consideration we become aware that in the usual exposition of the laws of logic certain fundamental concepts of arithmetic are already employed; for example, the concept of the aggregate, in part also the concept of number.

"But," he says (Monist, p. 340), "upon closer examination, we realize that the typical explanation of logical laws already incorporates some basic concepts of arithmetic; for instance, the idea of a set, as well as the concept of number."

"We fall thus into a vicious circle and therefore to avoid paradoxes a partly simultaneous development of the laws of logic and arithmetic is requisite."

"We end up in a vicious cycle, so to avoid contradictions, we need a partially simultaneous development of the laws of logic and arithmetic."

We have seen above that what Hilbert says of the principles of logic in the usual exposition applies likewise to the logic of Russell. So for Russell logic is prior to arithmetic; for Hilbert they are 'simultaneous.' We shall find further on other differences still greater, but we shall point them out as we come to them. I prefer to follow step by step the development of Hilbert's thought, quoting textually the most important passages.[Pg 465]

We’ve already established that what Hilbert describes about the principles of logic in the usual exposition also applies to Russell's logic. So, for Russell, logic comes before arithmetic; for Hilbert, they're 'simultaneous.' We’ll uncover even greater differences later on, but we’ll highlight them as we encounter them. I prefer to track the evolution of Hilbert's ideas closely, quoting the most significant passages directly.[Pg 465]

"Let us take as the basis of our consideration first of all a thought-thing 1 (one)" (p. 341). Notice that in so doing we in no wise imply the notion of number, because it is understood that 1 is here only a symbol and that we do not at all seek to know its meaning. "The taking of this thing together with itself respectively two, three or more times...." Ah! this time it is no longer the same; if we introduce the words 'two,' 'three,' and above all 'more,' 'several,' we introduce the notion of number; and then the definition of finite whole number which we shall presently find, will come too late. Our author was too circumspect not to perceive this begging of the question. So at the end of his work he tries to proceed to a truly patching-up process.

"Let’s start by considering a thought-thing 1 (one)" (p. 341). Notice that in doing so, we don’t imply the concept of number, since it’s understood that 1 is just a symbol and we’re not trying to figure out its meaning. "Taking this thing along with itself two, three, or more times...." Ah! Now it’s different; when we use the words 'two,' 'three,' and especially 'more,' or 'several,' we’re introducing the concept of number. At that point, the definition of finite whole number that we’re about to find will come too late. Our author was too careful not to notice this issue. So at the end of his work, he tries to apply a real patching-up process.

Hilbert then introduces two simple objects 1 and =, and considers all the combinations of these two objects, all the combinations of their combinations, etc. It goes without saying that we must forget the ordinary meaning of these two signs and not attribute any to them.

Hilbert then introduces two basic symbols 1 and =, and looks at all the combinations of these two symbols, as well as all the combinations of their combinations, and so on. It's clear that we need to set aside the usual meanings of these two symbols and not assign any significance to them.

Afterwards he separates these combinations into two classes, the class of the existent and the class of the non-existent, and till further orders this separation is entirely arbitrary. Every affirmative statement tells us that a certain combination belongs to the class of the existent; every negative statement tells us that a certain combination belongs to the class of the non-existent.

After that, he divides these combinations into two categories: the category of the existent and the category of the non-existent, and until he states otherwise, this separation is completely arbitrary. Every positive statement indicates that a certain combination is part of the category of the existent; every negative statement indicates that a certain combination is part of the category of the non-existent.

VII

Note now a difference of the highest importance. For Russell any object whatsoever, which he designates by x, is an object absolutely undetermined and about which he supposes nothing; for Hilbert it is one of the combinations formed with the symbols 1 and =; he could not conceive of the introduction of anything other than combinations of objects already defined. Moreover Hilbert formulates his thought in the neatest way, and I think I must reproduce in extenso his statement (p. 348):

Note now a difference of the utmost importance. For Russell, any object he refers to as x is an entirely undefined object about which he assumes nothing; for Hilbert, it’s one of the combinations made with the symbols 1 and =; he could not imagine introducing anything other than combinations of already defined objects. Moreover, Hilbert expresses his ideas in the clearest way, and I believe I should reproduce in extenso his statement (p. 348):

"In the assumptions the arbitraries (as equivalent for the concept 'every' and 'all' in the customary logic) represent only those thought-things and their combinations with one another, which at this stage are laid down as fundamental or are to be[Pg 466] newly defined. Therefore in the deduction of inferences from the assumptions, the arbitraries, which occur in the assumptions, can be replaced only by such thought-things and their combinations.

"In the assumptions, the arbitraries (which are equivalent to the concepts 'every' and 'all' in standard logic) only represent those concepts and their combinations that are established as fundamental or are to be[Pg 466]redefined at this stage. Therefore, when deriving inferences from the assumptions, the arbitraries present in the assumptions can only be substituted with those concepts and their combinations."

"Also we must duly remember, that through the super-addition and making fundamental of a new thought-thing the preceding assumptions undergo an enlargement of their validity, and where necessary, are to be subjected to a change in conformity with the sense."

"Also, we need to remember that by adding a new thought or idea, the earlier assumptions can expand in their validity, and when needed, should be changed to align with the context."

The contrast with Russell's view-point is complete. For this philosopher we may substitute for x not only objects already known, but anything.

The contrast with Russell's viewpoint is total. For this philosopher, we can substitute anything for x, not just objects we already know.

Russell is faithful to his point of view, which is that of comprehension. He starts from the general idea of being, and enriches it more and more while restricting it, by adding new qualities. Hilbert on the contrary recognizes as possible beings only combinations of objects already known; so that (looking at only one side of his thought) we might say he takes the view-point of extension.

Russell is loyal to his perspective, which focuses on understanding. He begins with a broad concept of existence and gradually refines it by incorporating new qualities. In contrast, Hilbert acknowledges as possible beings only combinations of already known objects; thus, if we look at just one aspect of his thought, we could say he adopts the viewpoint of extension.

VIII

Let us continue with the exposition of Hilbert's ideas. He introduces two assumptions which he states in his symbolic language but which signify, in the language of the uninitiated, that every quality is equal to itself and that every operation performed upon two identical quantities gives identical results.

Let’s keep going with explaining Hilbert's ideas. He presents two assumptions in his symbolic language, but in simpler terms, these mean that every quality is the same as itself and that any operation done on two identical quantities will produce the same results.

So stated, they are evident, but thus to present them would be to misrepresent Hilbert's thought. For him mathematics has to combine only pure symbols, and a true mathematician should reason upon them without preconceptions as to their meaning. So his assumptions are not for him what they are for the common people.

So stated, they are clear, but presenting them this way would misrepresent Hilbert's ideas. For him, mathematics should only involve pure symbols, and a true mathematician should work with them without any preconceived notions about their meaning. So his assumptions mean something different to him than they do to the general public.

He considers them as representing the definition by postulates of the symbol (=) heretofore void of all signification. But to justify this definition we must show that these two assumptions lead to no contradiction. For this Hilbert used the reasoning of our number III, without appearing to perceive that he is using complete induction.

He views them as defining the symbol (=), which had previously been meaningless. However, to validate this definition, we need to demonstrate that these two assumptions cause no contradictions. To do this, Hilbert employed the reasoning from our number III, seemingly unaware that he is utilizing complete induction.

IX

The end of Hilbert's memoir is altogether enigmatic and I shall not lay stress upon it. Contradictions accumulate; we feel that the author is dimly conscious of the petitio principii he has committed, and that he seeks vainly to patch up the holes in his argument.

The ending of Hilbert's memoir is completely mysterious, and I won’t emphasize it. Contradictions pile up; we sense that the author is vaguely aware of the petitio principii he's made, and that he’s futilely trying to fix the gaps in his argument.

What does this mean? At the point of proving that the definition of the whole number by the assumption of complete induction implies no contradiction, Hilbert withdraws as Russell and Couturat withdrew, because the difficulty is too great.

What does this mean? At the moment of demonstrating that the definition of whole numbers through the assumption of complete induction leads to no contradictions, Hilbert steps back just like Russell and Couturat did because the challenge is too overwhelming.

X

Geometry

Geometry, says M. Couturat, is a vast body of doctrine wherein the principle of complete induction does not enter. That is true in a certain measure; we can not say it is entirely absent, but it enters very slightly. If we refer to the Rational Geometry of Dr. Halsted (New York, John Wiley and Sons, 1904) built up in accordance with the principles of Hilbert, we see the principle of induction enter for the first time on page 114 (unless I have made an oversight, which is quite possible).[14]

Geometry, according to M. Couturat, is a large body of knowledge where the principle of complete induction isn’t really involved. That's somewhat true; we can't say it’s completely missing, but it plays a very minor role. If we look at Dr. Halsted's Rational Geometry (New York, John Wiley and Sons, 1904), which is based on Hilbert’s principles, we see the principle of induction make an appearance for the first time on page 114 (unless I’ve overlooked something, which is definitely possible).[14]

So geometry, which only a few years ago seemed the domain where the reign of intuition was uncontested, is to-day the realm where the logicians seem to triumph. Nothing could better measure the importance of the geometric works of Hilbert and the profound impress they have left on our conceptions.

So geometry, which just a few years ago seemed to be a field where intuition ruled without question, is now the area where logicians appear to prevail. Nothing better highlights the significance of Hilbert's geometric works and the deep impact they have had on our understanding.

But be not deceived. What is after all the fundamental theorem of geometry? It is that the assumptions of geometry imply no contradiction, and this we can not prove without the principle of induction.

But don’t be fooled. What is, after all, the basic principle of geometry? It’s that the assumptions of geometry imply no contradictions, and we can’t prove this without the principle of induction.

How does Hilbert demonstrate this essential point? By leaning upon analysis and through it upon arithmetic and through it upon the principle of induction.

How does Hilbert show this key point? By relying on analysis, and through that, on arithmetic, and through that, on the principle of induction.

And if ever one invents another demonstration, it will still be necessary to lean upon this principle, since the possible consequences of the assumptions, of which it is necessary to show that they are not contradictory, are infinite in number.

And if someone ever comes up with another demonstration, it will still be important to rely on this principle, because the possible outcomes of the assumptions that need to be shown as non-contradictory are countless.

XI

Conclusion

Our conclusion straightway is that the principle of induction can not be regarded as the disguised definition of the entire world.

Our conclusion right away is that the principle of induction cannot be seen as a hidden definition of the whole world.

Here are three truths: (1) The principle of complete induction; (2) Euclid's postulate; (3) the physical law according to which phosphorus melts at 44° (cited by M. Le Roy).

Here are three truths: (1) The principle of complete induction; (2) Euclid's postulate; (3) the physical law that states phosphorus melts at 44° (cited by M. Le Roy).

These are said to be three disguised definitions: the first, that of the whole number; the second, that of the straight line; the third, that of phosphorus.

These are considered to be three disguised definitions: the first, that of the whole number; the second, that of the straight line; the third, that of phosphorus.

I grant it for the second; I do not admit it for the other two. I must explain the reason for this apparent inconsistency.

I agree with the second one; I don't accept the other two. I need to explain why this seems inconsistent.

First, we have seen that a definition is acceptable only on condition that it implies no contradiction. We have shown likewise that for the first definition this demonstration is impossible; on the other hand, we have just recalled that for the second Hilbert has given a complete proof.

First, we have seen that a definition is acceptable only if it implies no contradiction. We have also shown that for the first definition, this proof is impossible; on the other hand, we have just noted that for the second, Hilbert has provided a complete proof.

As to the third, evidently it implies no contradiction. Does this mean that the definition guarantees, as it should, the existence of the object defined? We are here no longer in the mathematical sciences, but in the physical, and the word existence has no longer the same meaning. It no longer signifies absence of contradiction; it means objective existence.

As for the third point, it clearly doesn't suggest any contradiction. Does this mean that the definition guarantees, as it should, the existence of the defined object? We are no longer in the realm of mathematical sciences but in the physical world, and the term existence doesn't carry the same meaning. It no longer just means lack of contradiction; it refers to objective existence.

You already see a first reason for the distinction I made between the three cases; there is a second. In the applications we have to make of these three concepts, do they present themselves to us as defined by these three postulates?

You already see the first reason for the distinction I made between the three cases; there’s a second. In the applications we need to make of these three concepts, do they appear to us as defined by these three postulates?

The possible applications of the principle of induction are innumerable; take, for example, one of those we have expounded above, and where it is sought to prove that an aggregate of assumptions can lead to no contradiction. For this we consider one of the series of syllogisms we may go on with in starting from these assumptions as premises. When we have finished the nth syllogism, we see we can make still another and this is the n + 1th. Thus the number n serves to count a series of successive operations; it is a number obtainable by successive additions. [Pg 469]This therefore is a number from which we may go back to unity by successive subtractions. Evidently we could not do this if we had n = n − 1, since then by subtraction we should always obtain again the same number. So the way we have been led to consider this number n implies a definition of the finite whole number and this definition is the following: A finite whole number is that which can be obtained by successive additions; it is such that n is not equal to n − 1.

The possible applications of the principle of induction are countless; for example, consider one of the cases we've discussed earlier, where we aim to prove that a combination of assumptions can lead to no contradictions. To demonstrate this, we look at a series of syllogisms that can be built from these assumptions as premises. Once we complete the nth syllogism, we can add another one, which would be the n + 1th. Thus, the number n acts as a count for a series of consecutive operations; it’s a number reached through repeated additions. [Pg 469]This means it's a number that we can trace back to one by successive subtractions. Clearly, we wouldn’t be able to do this if we had n = n − 1, because then subtraction would always yield the same number. Therefore, the way we’ve been led to consider this number n suggests a definition of finite whole numbers, which is: A finite whole number is one that can be reached through repeated additions; it is defined such that n is not equal to n − 1.

That granted, what do we do? We show that if there has been no contradiction up to the nth syllogism, no more will there be up to the n + 1th, and we conclude there never will be. You say: I have the right to draw this conclusion, since the whole numbers are by definition those for which a like reasoning is legitimate. But that implies another definition of the whole number, which is as follows: A whole number is that on which we may reason by recurrence. In the particular case it is that of which we may say that, if the absence of contradiction up to the time of a syllogism of which the number is an integer carries with it the absence of contradiction up to the time of the syllogism whose number is the following integer, we need fear no contradiction for any of the syllogisms whose number is an integer.

That being said, what do we do? We show that if there hasn't been any contradiction up to the nth syllogism, there won't be one up to the n + 1th either, and we conclude that there never will be. You say: I have the right to make this conclusion since whole numbers are, by definition, those for which such reasoning is valid. But that suggests another definition of whole numbers, which is as follows: A whole number is one that we can reason about by recursion. Specifically, it means that if there's been no contradiction up to a syllogism numbered by an integer, then that implies no contradiction up to the syllogism numbered by the next integer, and we need not worry about contradicting any of the syllogisms that fall under integer numbers.

The two definitions are not identical; they are doubtless equivalent, but only in virtue of a synthetic judgment a priori; we can not pass from one to the other by a purely logical procedure. Consequently we have no right to adopt the second, after having introduced the whole number by a way that presupposes the first.

The two definitions aren’t the same; they’re definitely equivalent, but only because of a synthetic judgment a priori; we can’t move from one to the other through purely logical reasoning. Therefore, we can't just choose the second definition after introducing the whole number in a way that assumes the first.

On the other hand, what happens with regard to the straight line? I have already explained this so often that I hesitate to repeat it again, and shall confine myself to a brief recapitulation of my thought. We have not, as in the preceding case, two equivalent definitions logically irreducible one to the other. We have only one expressible in words. Will it be said there is another which we feel without being able to word it, since we have the intuition of the straight line or since we represent to ourselves the straight line? First of all, we can not represent it to ourselves in geometric space, but only in representative space, and then we can represent to ourselves just as well the objects[Pg 470] which possess the other properties of the straight line, save that of satisfying Euclid's postulate. These objects are 'the non-Euclidean straights,' which from a certain point of view are not entities void of sense, but circles (true circles of true space) orthogonal to a certain sphere. If, among these objects equally capable of representation, it is the first (the Euclidean straights) which we call straights, and not the latter (the non-Euclidean straights), this is properly by definition.

On the other hand, what about the straight line? I’ve explained this so many times that I’m hesitant to go over it again, so I’ll keep it brief. We don’t have, like in the previous case, two equivalent definitions that can’t be reduced to each other. We have only one that can be put into words. Could it be said that there’s another version we feel but can’t articulate, since we have an intuition about the straight line or visualize it? First off, we can’t really visualize it in geometric space, only in a representative space. Plus, we can just as easily imagine the objects[Pg 470] that have the other properties of the straight line, except for satisfying Euclid's postulate. These objects are the 'non-Euclidean straights,' which, from a certain perspective, are not meaningless entities, but circles (real circles of real space) that are orthogonal to a specific sphere. If we refer to the first type (the Euclidean straights) as straight lines and not the second (the non-Euclidean straights), that’s simply by definition.

And arriving finally at the third example, the definition of phosphorus, we see the true definition would be: Phosphorus is the bit of matter I see in yonder flask.

And finally getting to the third example, the definition of phosphorus, we see that the true definition is: Phosphorus is the piece of matter I see in that flask over there.

XII

And since I am on this subject, still another word. Of the phosphorus example I said: "This proposition is a real verifiable physical law, because it means that all bodies having all the other properties of phosphorus, save its point of fusion, melt like it at 44°." And it was answered: "No, this law is not verifiable, because if it were shown that two bodies resembling phosphorus melt one at 44° and the other at 50°, it might always be said that doubtless, besides the point of fusion, there is some other unknown property by which they differ."

And since I'm on this topic, just one more thing. About the phosphorus example, I said: "This statement is a real, testable physical law because it means that all substances with all the other characteristics of phosphorus, except for its melting point, will melt at 44°." And the response was: "No, this law can't be tested because if we show that two substances similar to phosphorus melt at 44° and 50° respectively, it could always be argued that aside from the melting point, there is some other unknown characteristic that makes them different."

That was not quite what I meant to say. I should have written, "All bodies possessing such and such properties finite in number (to wit, the properties of phosphorus stated in the books on chemistry, the fusion-point excepted) melt at 44°."

That’s not exactly what I meant to say. I should have written, "All bodies with certain properties that are finite in number (specifically, the properties of phosphorus mentioned in chemistry books, excluding the fusion point) melt at 44°."

And the better to make evident the difference between the case of the straight and that of phosphorus, one more remark. The straight has in nature many images more or less imperfect, of which the chief are the light rays and the rotation axis of the solid. Suppose we find the ray of light does not satisfy Euclid's postulate (for example by showing that a star has a negative parallax), what shall we do? Shall we conclude that the straight being by definition the trajectory of light does not satisfy the postulate, or, on the other hand, that the straight by definition satisfying the postulate, the ray of light is not straight?

And to make the difference between a straight line and phosphorus more clear, let's add one more point. There are many imperfect representations of a straight line in nature, with the main ones being light rays and the axis of rotation of solid objects. If we find that a light ray doesn’t meet Euclid’s postulate (for instance, if we see that a star has a negative parallax), what should we do? Should we conclude that since a straight line is defined as the path of light, it doesn’t meet the postulate? Or should we say that since a straight line must meet the postulate, the light ray isn’t straight?

Assuredly we are free to adopt the one or the other definition and consequently the one or the other conclusion; but to adopt[Pg 471] the first would be stupid, because the ray of light probably satisfies only imperfectly not merely Euclid's postulate, but the other properties of the straight line, so that if it deviates from the Euclidean straight, it deviates no less from the rotation axis of solids which is another imperfect image of the straight line; while finally it is doubtless subject to change, so that such a line which yesterday was straight will cease to be straight to-morrow if some physical circumstance has changed.

Sure, we can choose to accept one definition or the other, and as a result, come to different conclusions. However, going with the first definition would be foolish because a ray of light likely only partially meets not just Euclid's postulate, but also other properties of a straight line. Therefore, if it strays from a Euclidean straight line, it also strays from the axis of rotation of solids, which is another imperfect representation of a straight line. Additionally, it's definitely subject to change, meaning a line that was straight yesterday may no longer be straight tomorrow if some physical condition shifts.

Suppose now we find that phosphorus does not melt at 44°, but at 43.9°. Shall we conclude that phosphorus being by definition that which melts at 44°, this body that we did call phosphorus is not true phosphorus, or, on the other hand, that phosphorous melts at 43.9°? Here again we are free to adopt the one or the other definition and consequently the one or the other conclusion; but to adopt the first would be stupid because we can not be changing the name of a substance every time we determine a new decimal of its fusion-point.

Suppose we find that phosphorus doesn't melt at 44° but at 43.9°. Should we conclude that since phosphorus is defined as the substance that melts at 44°, this substance we called phosphorus isn't true phosphorus, or should we say that phosphorus melts at 43.9°? Again, we can choose one definition or the other and draw a conclusion from that; however, choosing the first would be foolish because we can't keep renaming a substance every time we pinpoint a new decimal place in its melting point.

XIII

To sum up, Russell and Hilbert have each made a vigorous effort; they have each written a work full of original views, profound and often well warranted. These two works give us much to think about and we have much to learn from them. Among their results, some, many even, are solid and destined to live.

To sum up, Russell and Hilbert have both put in strong efforts; they have each authored works filled with fresh ideas that are deep and often well-supported. These two works provide a lot for us to contemplate and there is much we can learn from them. Among their findings, some, many even, are solid and will endure.

But to say that they have finally settled the debate between Kant and Leibnitz and ruined the Kantian theory of mathematics is evidently incorrect. I do not know whether they really believed they had done it, but if they believed so, they deceived themselves.

But saying that they've finally resolved the debate between Kant and Leibnitz and destroyed the Kantian theory of mathematics is clearly wrong. I don't know if they truly thought they had accomplished this, but if they did, they were fooling themselves.


CHAPTER V

The Latest Efforts of the Logisticians

I

The logicians have attempted to answer the preceding considerations. For that, a transformation of logistic was necessary, and Russell in particular has modified on certain points his original views. Without entering into the details of the debate, I should like to return to the two questions to my mind most important: Have the rules of logistic demonstrated their fruitfulness and infallibility? Is it true they afford means of proving the principle of complete induction without any appeal to intuition?

The logicians have tried to address the earlier points. To do this, a shift in logic was needed, and Russell, in particular, has altered some of his original opinions. Without delving into the specifics of the discussion, I want to focus on the two questions I believe are most significant: Have the rules of logic shown their effectiveness and certainty? Is it true that they provide a way to prove the principle of complete induction without relying on intuition?

II

The Infallibility of Logistic

On the question of fertility, it seems M. Couturat has naïve illusions. Logistic, according to him, lends invention 'stilts and wings,' and on the next page: "Ten years ago, Peano published the first edition of his Formulaire." How is that, ten years of wings and not to have flown!

On the topic of fertility, it appears that M. Couturat has some naive misconceptions. According to him, logic gives invention "stilts and wings," and on the next page: "Ten years ago, Peano published the first edition of his Formulaire." How is it that after ten years of wings, there hasn’t been any flight!

I have the highest esteem for Peano, who has done very pretty things (for instance his 'space-filling curve,' a phrase now discarded); but after all he has not gone further nor higher nor quicker than the majority of wingless mathematicians, and would have done just as well with his legs.

I have the greatest respect for Peano, who has created some really impressive work (like his 'space-filling curve,' a term that's fallen out of use); however, in the end, he hasn't achieved more, gone further, or moved faster than most other mathematicians without wings, and he would have done just as well on his own two feet.

On the contrary I see in logistic only shackles for the inventor. It is no aid to conciseness—far from it, and if twenty-seven equations were necessary to establish that 1 is a number, how many would be needed to prove a real theorem? If we distinguish, with Whitehead, the individual x, the class of which the only member is x and which shall be called ιx, then the class of which the only member is the class of which the only member is x and which shall be called μx, do you think these distinctions, useful as they may be, go far to quicken our pace?[Pg 473]

On the contrary, I see logistics as just chains for the inventor. It doesn’t help with being concise—quite the opposite. If twenty-seven equations are needed to prove that 1 is a number, how many would it take to prove a real theorem? If we differentiate, as Whitehead does, between the individual x and the class that only contains x, which we call ιx, then the class that contains only the class that contains x, which we call μx, do you think these distinctions, as useful as they might be, really speed things up?[Pg 473]

Logistic forces us to say all that is ordinarily left to be understood; it makes us advance step by step; this is perhaps surer but not quicker.

Logistics requires us to explain everything that is usually taken for granted; it makes us move forward one step at a time; this might be more reliable but not faster.

It is not wings you logisticians give us, but leading-strings. And then we have the right to require that these leading-strings prevent our falling. This will be their only excuse. When a bond does not bear much interest, it should at least be an investment for a father of a family.

It’s not wings you logistics experts give us, but guiding strings. And then we have the right to expect that these guiding strings keep us from falling. That should be their only justification. When a bond doesn’t earn much interest, it should at least be a good investment for a family man.

Should your rules be followed blindly? Yes, else only intuition could enable us to distinguish among them; but then they must be infallible; for only in an infallible authority can one have a blind confidence. This, therefore, is for you a necessity. Infallible you shall be, or not at all.

Should your rules be followed without question? Yes, otherwise only intuition could help us tell them apart; but they must be foolproof; because only in an infallible authority can we have total trust. Therefore, this is essential for you. You must be infallible, or not at all.

You have no right to say to us: "It is true we make mistakes, but so do you." For us to blunder is a misfortune, a very great misfortune; for you it is death.

You have no right to say to us: "It's true we make mistakes, but so do you." For us to mess up is unfortunate, a very big misfortune; for you, it's a matter of life and death.

Nor may you ask: Does the infallibility of arithmetic prevent errors in addition? The rules of calculation are infallible, and yet we see those blunder who do not apply these rules; but in checking their calculation it is at once seen where they went wrong. Here it is not at all the case; the logicians have applied their rules, and they have fallen into contradiction; and so true is this, that they are preparing to change these rules and to "sacrifice the notion of class." Why change them if they were infallible?

Nor can you ask: Does the infallibility of arithmetic prevent mistakes in addition? The rules of calculation are infallible, yet we see those who make errors when they don’t follow these rules; but when checking their calculations, it's immediately clear where they went wrong. This isn’t the case here; the logicians have applied their rules, and they’ve fallen into contradiction. It’s so true that they’re getting ready to change these rules and "sacrifice the notion of class." Why change them if they were infallible?

"We are not obliged," you say, "to solve hic et nunc all possible problems." Oh, we do not ask so much of you. If, in face of a problem, you would give no solution, we should have nothing to say; but on the contrary you give us two of them and those contradictory, and consequently at least one false; this it is which is failure.

"We're not required," you say, "to solve all possible problems right here and now." Oh, we don’t expect that much from you. If, when faced with a problem, you offered no solution, we wouldn’t have anything to complain about; but instead, you give us two solutions that contradict each other, which means at least one of them is wrong; that's what we consider failure.

Russell seeks to reconcile these contradictions, which can only be done, according to him, "by restricting or even sacrificing the notion of class." And M. Couturat, discovering the success of his attempt, adds: "If the logicians succeed where others have failed, M. Poincaré will remember this phrase, and give the honor of the solution to logistic."

Russell is trying to resolve these contradictions, which he believes can only be achieved "by limiting or even giving up the concept of class." And M. Couturat, noting the success of his effort, adds: "If the logicians succeed where others have not, M. Poincaré will recall this statement and credit the solution to logistic."

But no! Logistic exists, it has its code which has already had[Pg 474] four editions; or rather this code is logistic itself. Is Mr. Russell preparing to show that one at least of the two contradictory reasonings has transgressed the code? Not at all; he is preparing to change these laws and to abrogate a certain number of them. If he succeeds, I shall give the honor of it to Russell's intuition and not to the Peanian logistic which he will have destroyed.

But no! Logic exists; it has its code, which has already gone through[Pg 474] four editions. Or rather, this code is logic itself. Is Mr. Russell getting ready to demonstrate that at least one of the two conflicting arguments has violated the code? Not at all; he is preparing to alter these laws and repeal a number of them. If he succeeds, I will credit it to Russell's intuition rather than the Peanian logic that he will have dismantled.

III

The Liberty of Contradiction

I made two principal objections to the definition of whole number adopted in logistic. What says M. Couturat to the first of these objections?

I had two main objections to the definition of whole numbers used in logic. What does M. Couturat say about the first objection?

What does the word exist mean in mathematics? It means, I said, to be free from contradiction. This M. Couturat contests. "Logical existence," says he, "is quite another thing from the absence of contradiction. It consists in the fact that a class is not empty." To say: a's exist, is, by definition, to affirm that the class a is not null.

What does the word exist mean in mathematics? It means, I said, to be free from contradiction. M. Couturat disagrees. "Logical existence," he says, "is quite different from the absence of contradiction. It means that a class is not empty." To say: a exists, is, by definition, to affirm that the class a is not null.

And doubtless to affirm that the class a is not null, is, by definition, to affirm that a's exist. But one of the two affirmations is as denuded of meaning as the other, if they do not both signify, either that one may see or touch a's which is the meaning physicists or naturalists give them, or that one may conceive an a without being drawn into contradictions, which is the meaning given them by logicians and mathematicians.

And of course, to assert that the class a is not empty is, by definition, to assert that a exists. However, one of these assertions is just as meaningless as the other if they don't both indicate either that one can see or touch a's, which is what physicists or naturalists mean, or that one can imagine an a without falling into contradictions, which is the meaning provided by logicians and mathematicians.

For M. Couturat, "it is not non-contradiction that proves existence, but it is existence that proves non-contradiction." To establish the existence of a class, it is necessary therefore to establish, by an example, that there is an individual belonging to this class: "But, it will be said, how is the existence of this individual proved? Must not this existence be established, in order that the existence of the class of which it is a part may be deduced? Well, no; however paradoxical may appear the assertion, we never demonstrate the existence of an individual. Individuals, just because they are individuals, are always considered as existent.... We never have to express that an individual exists, absolutely speaking, but only that it exists in a class." M.[Pg 475] Couturat finds his own assertion paradoxical, and he will certainly not be the only one. Yet it must have a meaning. It doubtless means that the existence of an individual, alone in the world, and of which nothing is affirmed, can not involve contradiction; in so far as it is all alone it evidently will not embarrass any one. Well, so let it be; we shall admit the existence of the individual, 'absolutely speaking,' but nothing more. It remains to prove the existence of the individual 'in a class,' and for that it will always be necessary to prove that the affirmation, "Such an individual belongs to such a class," is neither contradictory in itself, nor to the other postulates adopted.

For M. Couturat, "it's not non-contradiction that proves existence, but it's existence that proves non-contradiction." To establish that a class exists, you need to show, through an example, that there is at least one individual in that class: "But, someone might ask, how do we prove this individual's existence? Doesn't this existence need to be established so that we can deduce the existence of the class it belongs to? Well, no; no matter how paradoxical this may seem, we never actually demonstrate the existence of an individual. Individuals, simply because they are individuals, are always assumed to exist.... We never have to state that an individual exists, in absolute terms, but only that it exists within a class." M.[Pg 475] Couturat finds his own statement paradoxical, and he certainly won’t be the only one. But it must have some meaning. It likely means that the existence of an individual, alone in the world, about which nothing is stated, can't lead to contradiction; since it is completely alone, it obviously won't cause any issues. Fine, we’ll acknowledge the existence of the individual, 'in absolute terms,' but nothing more. Now we have to prove the individual's existence 'in a class,' and to do that, we must always show that the claim, "Such an individual belongs to such a class," is neither contradictory by itself nor in conflict with the other accepted postulates.

"It is then," continues M. Couturat, "arbitrary and misleading to maintain that a definition is valid only if we first prove it is not contradictory." One could not claim in prouder and more energetic terms the liberty of contradiction. "In any case, the onus probandi rests upon those who believe that these principles are contradictory." Postulates are presumed to be compatible until the contrary is proved, just as the accused person is presumed innocent. Needless to add that I do not assent to this claim. But, you say, the demonstration you require of us is impossible, and you can not ask us to jump over the moon. Pardon me; that is impossible for you, but not for us, who admit the principle of induction as a synthetic judgment a priori. And that would be necessary for you, as for us.

"It is then," continues M. Couturat, "arbitrary and misleading to say that a definition is valid only if we first prove it's not contradictory." One couldn't express the freedom of contradiction in a prouder and more assertive way. "In any case, the onus probandi is on those who believe these principles are contradictory." We assume postulates are compatible until proven otherwise, just like the accused is presumed innocent. It goes without saying that I do not agree with this claim. But, you say, the proof you want from us is impossible, and you can't ask us to jump over the moon. Forgive me; that may be impossible for you, but not for us, who accept the principle of induction as a synthetic judgment a priori. And that is necessary for you, just as it is for us.

To demonstrate that a system of postulates implies no contradiction, it is necessary to apply the principle of complete induction; this mode of reasoning not only has nothing 'bizarre' about it, but it is the only correct one. It is not 'unlikely' that it has ever been employed; and it is not hard to find 'examples and precedents' of it. I have cited two such instances borrowed from Hilbert's article. He is not the only one to have used it, and those who have not done so have been wrong. What I have blamed Hilbert for is not his having recourse to it (a born mathematician such as he could not fail to see a demonstration was necessary and this the only one possible), but his having recourse without recognizing the reasoning by recurrence.

To show that a system of postulates doesn’t imply any contradictions, it's essential to use the principle of complete induction. This way of reasoning isn’t at all 'weird'; in fact, it’s the only correct method. It’s not 'unlikely' that it has been used, and there are plenty of 'examples and precedents' to support it. I’ve mentioned two examples taken from Hilbert’s article. He isn’t the only one who has used it, and those who haven’t are mistaken. What I criticize Hilbert for isn’t that he used this method (a natural mathematician like him would realize a demonstration was needed and that this was the only possible one), but that he employed it without acknowledging the reasoning involved in recurrence.

IV

The Second Objection

I pointed out a second error of logistic in Hilbert's article. To-day Hilbert is excommunicated and M. Couturat no longer regards him as of the logistic cult; so he asks if I have found the same fault among the orthodox. No, I have not seen it in the pages I have read; I know not whether I should find it in the three hundred pages they have written which I have no desire to read.

I pointed out a second mistake in logic in Hilbert's article. Today, Hilbert is no longer part of the community, and M. Couturat doesn’t consider him a member of the logical group anymore, so he asks if I’ve come across the same mistake among the mainstream thinkers. No, I haven't seen it in the pages I've looked at; I don't know if I would find it in the three hundred pages they've written, which I have no interest in reading.

Only, they must commit it the day they wish to make any application of mathematics. This science has not as sole object the eternal contemplation of its own navel; it has to do with nature and some day it will touch it. Then it will be necessary to shake off purely verbal definitions and to stop paying oneself with words.

Only, they must commit to it on the day they want to apply any mathematics. This science isn't just about endlessly analyzing itself; it relates to nature and will eventually interact with it. Then, it will be necessary to move beyond just verbal definitions and stop fooling ourselves with words.

To go back to the example of Hilbert: always the point at issue is reasoning by recurrence and the question of knowing whether a system of postulates is not contradictory. M. Couturat will doubtless say that then this does not touch him, but it perhaps will interest those who do not claim, as he does, the liberty of contradiction.

To revisit Hilbert's example: the main focus is reasoning by recurrence and whether a system of postulates is contradictory. M. Couturat will probably argue that this doesn’t concern him, but it might interest those who, unlike him, do not advocate for the freedom of contradiction.

We wish to establish, as above, that we shall never encounter contradiction after any number of deductions whatever, provided this number be finite. For that, it is necessary to apply the principle of induction. Should we here understand by finite number every number to which by definition the principle of induction applies? Evidently not, else we should be led to most embarrassing consequences. To have the right to lay down a system of postulates, we must be sure they are not contradictory. This is a truth admitted by most scientists; I should have written by all before reading M. Couturat's last article. But what does this signify? Does it mean that we must be sure of not meeting contradiction after a finite number of propositions, the finite number being by definition that which has all properties of recurrent nature, so that if one of these properties fails—if, for instance, we come upon a contradiction—we shall agree to say that the number in question is not finite? In other words, do[Pg 477] we mean that we must be sure not to meet contradictions, on condition of agreeing to stop just when we are about to encounter one? To state such a proposition is enough to condemn it.

We want to make it clear, as mentioned earlier, that we will never come across a contradiction after any finite number of deductions. To achieve this, we need to use the principle of induction. Should we define a finite number as any number that is covered by the principle of induction? Clearly not, because that would lead to some awkward situations. To be able to establish a set of postulates, we must ensure they are not contradictory. This is a truth accepted by most scientists; I would have written all before reading M. Couturat's latest article. But what does this really mean? Does it imply that we must be certain we won’t face a contradiction after a finite number of propositions, with the finite number defined as having all the properties of recurrent nature? So, if one of these properties fails—like if we come across a contradiction—we agree to say that the number in question is not finite? In other words, do[Pg 477] we mean we need to be sure we won't encounter contradictions, provided we agree to stop right before we hit one? To assert such a proposition is enough to invalidate it.

So, Hilbert's reasoning not only assumes the principle of induction, but it supposes that this principle is given us not as a simple definition, but as a synthetic judgment a priori.

So, Hilbert's reasoning not only relies on the principle of induction, but it assumes that this principle is provided to us not just as a simple definition, but as a synthetic judgment a priori.

To sum up:

In summary:

A demonstration is necessary.

A demonstration is needed.

The only demonstration possible is the proof by recurrence.

The only demonstration possible is the proof by recursion.

This is legitimate only if we admit the principle of induction and if we regard it not as a definition but as a synthetic judgment.

This is valid only if we accept the principle of induction and see it not as a definition but as a synthetic judgment.

V

The Cantor Antinomies

Now to examine Russell's new memoir. This memoir was written with the view to conquer the difficulties raised by those Cantor antinomies to which frequent allusion has already been made. Cantor thought he could construct a science of the infinite; others went on in the way he opened, but they soon ran foul of strange contradictions. These antinomies are already numerous, but the most celebrated are:

Now let’s take a look at Russell's new memoir. This memoir was written to tackle the challenges posed by the Cantor antinomies that have been frequently mentioned. Cantor believed he could create a science of the infinite; others followed in his footsteps, but they quickly encountered bizarre contradictions. There are already many of these antinomies, but the most famous ones are:

1. The Burali-Forti antinomy;

The Burali-Forti paradox;

2. The Zermelo-König antinomy;

2. The Zermelo-König paradox;

3. The Richard antinomy.

The Richard paradox.

Cantor proved that the ordinal numbers (the question is of transfinite ordinal numbers, a new notion introduced by him) can be ranged in a linear series; that is to say that of two unequal ordinals one is always less than the other. Burali-Forti proves the contrary; and in fact he says in substance that if one could range all the ordinals in a linear series, this series would define an ordinal greater than all the others; we could afterwards adjoin 1 and would obtain again an ordinal which would be still greater, and this is contradictory.

Cantor showed that ordinal numbers (specifically, transfinite ordinal numbers, a new concept he introduced) can be arranged in a linear sequence; meaning that between any two different ordinals, one is always lesser than the other. Burali-Forti argues the opposite; he essentially states that if we could arrange all the ordinals in a linear sequence, this sequence would create an ordinal greater than all the others. We could then add 1 to it and get an ordinal that is even greater, and this creates a contradiction.

We shall return later to the Zermelo-König antinomy which is of a slightly different nature. The Richard antinomy[15] is as follows: Consider all the decimal numbers definable by a finite[Pg 478] number of words; these decimal numbers form an aggregate E, and it is easy to see that this aggregate is countable, that is to say we can number the different decimal numbers of this assemblage from 1 to infinity. Suppose the numbering effected, and define a number N as follows: If the nth decimal of the nth number of the assemblage E is

We’ll come back later to the Zermelo-König antinomy, which is a bit different. The Richard antinomy[15] goes like this: Consider all the decimal numbers that can be defined using a finite[Pg 478] number of words; these decimal numbers form a set E, and it’s clear that this set is countable, meaning we can list the different decimal numbers in this group from 1 to infinity. Let’s assume that the listing is done, and define a number N as follows: If the nth decimal of the nth number in the set E is

0, 1, 2, 3, 4, 5, 6, 7, 8, 9

0, 1, 2, 3, 4, 5, 6, 7, 8, 9

the nth decimal of N shall be:

the nth decimal of N will be:

1, 2, 3, 4, 5, 6, 7, 8, 1, 1

1, 2, 3, 4, 5, 6, 7, 8, 1, 1

As we see, N is not equal to the nth number of E, and as n is arbitrary, N does not appertain to E and yet N should belong to this assemblage since we have defined it with a finite number of words.

As we can see, N is not equal to the nth number of E, and since n is arbitrary, N doesn’t belong to E yet N should be part of this group because we defined it using a finite number of words.

We shall later see that M. Richard has himself given with much sagacity the explanation of his paradox and that this extends, mutatis mutandis, to the other like paradoxes. Again, Russell cites another quite amusing paradox: What is the least whole number which can not be defined by a phrase composed of less than a hundred English words?

We will later see that M. Richard has wisely provided the explanation for his paradox, and that this also applies, mutatis mutandis, to other similar paradoxes. Additionally, Russell mentions another quite amusing paradox: What is the smallest whole number that cannot be defined by a phrase made up of fewer than a hundred English words?

This number exists; and in fact the numbers capable of being defined by a like phrase are evidently finite in number since the words of the English language are not infinite in number. Therefore among them will be one less than all the others. And, on the other hand, this number does not exist, because its definition implies contradiction. This number, in fact, is defined by the phrase in italics which is composed of less than a hundred English words; and by definition this number should not be capable of definition by a like phrase.

This number exists; and in fact, the numbers that can be defined by a similar phrase are obviously finite since the words in the English language are not endless. Therefore, there will be one that is less than all the others. On the other hand, this number doesn't exist because its definition is contradictory. This number is defined by the italicized phrase, which uses fewer than a hundred English words; and by definition, this number shouldn’t be able to be defined by a similar phrase.

VI

Zigzag Theory and No-class Theory

What is Mr. Russell's attitude in presence of these contradictions? After having analyzed those of which we have just spoken, and cited still others, after having given them a form recalling Epimenides, he does not hesitate to conclude: "A[Pg 479] propositional function of one variable does not always determine a class." A propositional function (that is to say a definition) does not always determine a class. A 'propositional function' or 'norm' may be 'non-predicative.' And this does not mean that these non-predicative propositions determine an empty class, a null class; this does not mean that there is no value of x satisfying the definition and capable of being one of the elements of the class. The elements exist, but they have no right to unite in a syndicate to form a class.

What is Mr. Russell's attitude in the face of these contradictions? After analyzing the ones we've just discussed and mentioning even more, and after shaping them in a way that recalls Epimenides, he confidently concludes: "A[Pg 479] propositional function of one variable does not always define a class." A propositional function (meaning a definition) doesn't always define a class. A 'propositional function' or 'norm' can be 'non-predicative.' And this doesn't mean that these non-predicative propositions define an empty class, or a null class; it doesn't mean there's no value of x that satisfies the definition and can be part of the class. The elements exist, but they aren’t allowed to come together to form a class.

But this is only the beginning and it is needful to know how to recognize whether a definition is or is not predicative. To solve this problem Russell hesitates between three theories which he calls

But this is just the start, and it's important to know how to identify whether a definition is predicative or not. To address this issue, Russell is torn between three theories that he refers to as

A. The zigzag theory;

The zigzag theory;

B. The theory of limitation of size;

B. The theory of limiting size;

C. The no-class theory.

C. The no-class theory.

According to the zigzag theory "definitions (propositional functions) determine a class when they are very simple and cease to do so only when they are complicated and obscure." Who, now, is to decide whether a definition may be regarded as simple enough to be acceptable? To this question there is no answer, if it be not the loyal avowal of a complete inability: "The rules which enable us to recognize whether these definitions are predicative would be extremely complicated and can not commend themselves by any plausible reason. This is a fault which might be remedied by greater ingenuity or by using distinctions not yet pointed out. But hitherto in seeking these rules, I have not been able to find any other directing principle than the absence of contradiction."

According to the zigzag theory, "definitions (propositional functions) determine a class when they are very simple and stop doing so only when they become complicated and unclear." Who, then, gets to decide if a definition is simple enough to be acceptable? There’s really no answer to this question, other than an honest admission of complete inability: "The rules that would help us determine if these definitions are predicative would be extremely complicated and can't be justified by any reasonable argument. This is a problem that might be fixed with more creativity or by using distinctions that haven't been identified yet. But so far, in looking for these rules, I haven't been able to find any guiding principle other than the absence of contradiction."

This theory therefore remains very obscure; in this night a single light—the word zigzag. What Russell calls the 'zigzaginess' is doubtless the particular characteristic which distinguishes the argument of Epimenides.

This theory is still quite unclear; in this darkness, there's just one light—the word zigzag. What Russell refers to as the 'zigzaginess' is definitely the specific trait that sets Epimenides' argument apart.

According to the theory of limitation of size, a class would cease to have the right to exist if it were too extended. Perhaps it might be infinite, but it should not be too much so. But we always meet again the same difficulty; at what precise moment[Pg 480] does it begin to be too much so? Of course this difficulty is not solved and Russell passes on to the third theory.

According to the theory of size limitation, a class would lose its right to exist if it became too large. It might be infinite, but it shouldn't be excessively so. However, we always encounter the same issue: at what exact point does it become overly large?[Pg 480] Naturally, this problem remains unresolved and Russell moves on to the third theory.

In the no-classes theory it is forbidden to speak the word 'class' and this word must be replaced by various periphrases. What a change for logistic which talks only of classes and classes of classes! It becomes necessary to remake the whole of logistic. Imagine how a page of logistic would look upon suppressing all the propositions where it is a question of class. There would only be some scattered survivors in the midst of a blank page. Apparent rari nantes in gurgite vasto.

In the no-classes theory, saying the word "class" is not allowed, and it needs to be replaced with different phrases. What a shift for logic, which only talks about classes and categories of classes! It becomes essential to completely redo the entire field of logic. Imagine what a page of logic would look like if we eliminated all the statements about class. There would only be a few scattered remnants on an otherwise blank page. Apparent rari nantes in gurgite vasto.

Be that as it may, we see how Russell hesitates and the modifications to which he submits the fundamental principles he has hitherto adopted. Criteria are needed to decide whether a definition is too complex or too extended, and these criteria can only be justified by an appeal to intuition.

Be that as it may, we see how Russell hesitates and the changes he makes to the basic principles he has adopted so far. We need criteria to determine if a definition is too complex or too broad, and these criteria can only be justified by relying on intuition.

It is toward the no-classes theory that Russell finally inclines. Be that as it may, logistic is to be remade and it is not clear how much of it can be saved. Needless to add that Cantorism and logistic are alone under consideration; real mathematics, that which is good for something, may continue to develop in accordance with its own principles without bothering about the storms which rage outside it, and go on step by step with its usual conquests which are final and which it never has to abandon.

It is toward the no-classes theory that Russell ultimately leans. Regardless, logic needs to be reworked, and it’s unclear how much of it can be retained. It's worth mentioning that only Cantorism and logic are being considered; real mathematics, which is practical, can keep developing based on its own principles without being affected by the turbulence outside, and continue making its usual, definitive advancements that it never has to relinquish.

VII

The True Solution

What choice ought we to make among these different theories? It seems to me that the solution is contained in a letter of M. Richard of which I have spoken above, to be found in the Revue générale des sciences of June 30, 1905. After having set forth the antinomy we have called Richard's antinomy, he gives its explanation. Recall what has already been said of this antinomy. E is the aggregate of all the numbers definable by a finite number of words, without introducing the notion of the aggregate E itself. Else the definition of E would contain a vicious circle; we must not define E by the aggregate E itself.

What choice should we make among these different theories? It seems to me that the answer is in a letter from M. Richard that I mentioned earlier, found in the Revue générale des sciences from June 30, 1905. After explaining the antinomy we refer to as Richard's antinomy, he provides its explanation. Remember what has already been said about this antinomy. E is the collection of all the numbers that can be defined by a finite number of words, without introducing the concept of the collection E itself. Otherwise, the definition of E would involve a circular reasoning; we must not define E by the collection E itself.

Now we have defined N with a finite number of words, it is[Pg 481] true, but with the aid of the notion of the aggregate E. And this is why N is not part of E. In the example selected by M. Richard, the conclusion presents itself with complete evidence and the evidence will appear still stronger on consulting the text of the letter itself. But the same explanation holds good for the other antinomies, as is easily verified. Thus the definitions which should be regarded as not predicative are those which contain a vicious circle. And the preceding examples sufficiently show what I mean by that. Is it this which Russell calls the 'zigzaginess'? I put the question without answering it.

Now that we've defined N with a specific number of words, it is[Pg 481] true, especially when considering the concept of the aggregate E. This is why N isn't part of E. In the example chosen by M. Richard, the conclusion becomes clear, and the evidence will seem even stronger when you look at the text of the letter itself. The same reasoning applies to the other contradictions, which is easy to verify. So the definitions we should consider as non-predicative are those that involve a circular argument. The previous examples clearly illustrate what I mean by that. Is this what Russell refers to as 'zigzaginess'? I ask the question without providing an answer.

VIII

The Demonstrations of the Principle of Induction

Let us now examine the pretended demonstrations of the principle of induction and in particular those of Whitehead and of Burali-Forti.

Let’s now look at the supposed proofs of the principle of induction, especially those by Whitehead and Burali-Forti.

We shall speak of Whitehead's first, and take advantage of certain new terms happily introduced by Russell in his recent memoir. Call recurrent class every class containing zero, and containing n + 1 if it contains n. Call inductive number every number which is a part of all the recurrent classes. Upon what condition will this latter definition, which plays an essential rôle in Whitehead's proof, be 'predicative' and consequently acceptable?

We will talk about Whitehead's first concept and use some new terms that Russell recently introduced in his memoir. Let's define a recurrent class as any class that contains zero and, if it contains n, also contains n + 1. We'll refer to an inductive number as any number that is part of all the recurrent classes. Under what condition will this second definition, which is crucial to Whitehead's proof, be 'predicative' and therefore acceptable?

In accordance with what has been said, it is necessary to understand by all the recurrent classes, all those in whose definition the notion of inductive number does not enter. Else we fall again upon the vicious circle which has engendered the antinomies.

In line with what has been mentioned, we need to understand by all the recurring classes, all those that do not involve the concept of inductive number in their definition. Otherwise, we end up back in the vicious circle that has created the contradictions.

Now Whitehead has not taken this precaution. Whitehead's reasoning is therefore fallacious; it is the same which led to the antinomies. It was illegitimate when it gave false results; it remains illegitimate when by chance it leads to a true result.

Now Whitehead has not taken this precaution. Whitehead's reasoning is therefore flawed; it is the same reasoning that led to the contradictions. It was invalid when it produced false results; it remains invalid even when, by chance, it leads to a true result.

A definition containing a vicious circle defines nothing. It is of no use to say, we are sure, whatever meaning we may give to our definition, zero at least belongs to the class of inductive numbers; it is not a question of knowing whether this class is void, but whether it can be rigorously deliminated. A[Pg 482] 'non-predicative' class is not an empty class, it is a class whose boundary is undetermined. Needless to add that this particular objection leaves in force the general objections applicable to all the demonstrations.

A definition that involves a circular argument doesn’t define anything. It’s pointless to say, we’re certain, whatever meaning we assign to our definition, that zero at least fits into the category of inductive numbers; the issue isn’t whether this category is empty, but whether it can be clearly defined. A[Pg 482] 'non-predicative' category isn’t an empty category; it’s a category with unclear boundaries. It’s also worth mentioning that this specific objection doesn’t eliminate the general objections that apply to all the arguments.

IX

Burali-Forti has given another demonstration.[16] But he is obliged to assume two postulates: First, there always exists at least one infinite class. The second is thus expressed:

Burali-Forti has shown us something else.[16] But he has to accept two assumptions: First, there is always at least one infinite class. The second is stated as follows:

The first postulate is not more evident than the principle to be proved. The second not only is not evident, but it is false, as Whitehead has shown; as moreover any recruit would see at the first glance, if the axiom had been stated in intelligible language, since it means that the number of combinations which can be formed with several objects is less than the number of these objects.

The first assumption isn't clearer than the principle being proven. The second assumption is not only unclear but also false, as Whitehead pointed out; any newcomer would recognize this right away if the axiom had been explained in simple terms, since it suggests that the number of combinations that can be made with several objects is fewer than the number of those objects.

X

Zermelo's Assumption

A famous demonstration by Zermelo rests upon the following assumption: In any aggregate (or the same in each aggregate of an assemblage of aggregates) we can always choose at random an element (even if this assemblage of aggregates should contain an infinity of aggregates). This assumption had been applied a thousand times without being stated, but, once stated, it aroused doubts. Some mathematicians, for instance M. Borel, resolutely reject it; others admire it. Let us see what, according to his last article, Russell thinks of it. He does not speak out, but his reflections are very suggestive.

A well-known demonstration by Zermelo is based on the following assumption: In any collection (or equally in each collection of a bunch of collections), we can always randomly choose an element (even if this bunch of collections contains an infinite number of collections). This assumption has been used countless times without being explicitly mentioned, but once it was articulated, it raised questions. Some mathematicians, like M. Borel, strongly dismiss it; others appreciate it. Let’s look at what Russell thinks about it according to his latest article. He doesn’t express his thoughts directly, but his reflections are quite suggestive.

And first a picturesque example: Suppose we have as many pairs of shoes as there are whole numbers, and so that we can number the pairs from one to infinity, how many shoes shall we have? Will the number of shoes be equal to the number of pairs? Yes, if in each pair the right shoe is distinguishable from the left; it will in fact suffice to give the number 2n − 1 to the right shoe of the nth pair, and the number 2n to the left[Pg 483] shoe of the nth pair. No, if the right shoe is just like the left, because a similar operation would become impossible—unless we admit Zermelo's assumption, since then we could choose at random in each pair the shoe to be regarded as the right.

And first, let’s consider a colorful example: Imagine we have as many pairs of shoes as there are whole numbers, and we can label the pairs from one to infinity. How many shoes do we end up with? Will the total number of shoes match the number of pairs? Yes, if we can tell the right shoe apart from the left; in fact, we can simply assign the number 2n − 1 to the right shoe of the nth pair and the number 2n to the left shoe of the nth pair. No, if the right shoe is identical to the left, because a similar process wouldn’t work—unless we accept Zermelo's assumption, as that would allow us to choose at random which shoe in each pair is considered the right one.[Pg 483]

XI

Conclusions

A demonstration truly founded upon the principles of analytic logic will be composed of a series of propositions. Some, serving as premises, will be identities or definitions; the others will be deduced from the premises step by step. But though the bond between each proposition and the following is immediately evident, it will not at first sight appear how we get from the first to the last, which we may be tempted to regard as a new truth. But if we replace successively the different expressions therein by their definition and if this operation be carried as far as possible, there will finally remain only identities, so that all will reduce to an immense tautology. Logic therefore remains sterile unless made fruitful by intuition.

A demonstration based on the principles of analytical logic consists of a series of statements. Some of these will act as premises, serving as identities or definitions; the others will be logically derived from these premises step by step. While the connection between each statement and the next is clear, it might not be obvious at first how we move from the first to the last, which we might mistakenly think of as a new truth. However, if we systematically replace the various expressions with their definitions, and continue doing this as much as we can, we will ultimately be left with just identities, resulting in a massive tautology. Therefore, logic remains unproductive if not complemented by intuition.

This I wrote long ago; logistic professes the contrary and thinks it has proved it by actually proving new truths. By what mechanism? Why in applying to their reasonings the procedure just described—namely, replacing the terms defined by their definitions—do we not see them dissolve into identities like ordinary reasonings? It is because this procedure is not applicable to them. And why? Because their definitions are not predicative and present this sort of hidden vicious circle which I have pointed out above; non-predicative definitions can not be substituted for the terms defined. Under these conditions logistic is not sterile, it engenders antinomies.

This was written a long time ago; logic claims the opposite and believes it has proven this by actually demonstrating new truths. But how does that work? When we apply the process I just described—specifically, replacing the defined terms with their definitions—why don’t we see them break down into identities like regular reasoning does? It’s because this process doesn’t apply to them. And why not? Because their definitions aren’t predicative and have a sort of hidden vicious circle that I mentioned earlier; non-predicative definitions can’t be substituted for the terms they define. Under these conditions, logic isn't useless; it creates contradictions.

It is the belief in the existence of the actual infinite which has given birth to those non-predicative definitions. Let me explain. In these definitions the word 'all' figures, as is seen in the examples cited above. The word 'all' has a very precise meaning when it is a question of a finite number of objects; to have another one, when the objects are infinite in number, would require there being an actual (given complete) infinity. Otherwise[Pg 484] all these objects could not be conceived as postulated anteriorly to their definition, and then if the definition of a notion N depends upon all the objects A, it may be infected with a vicious circle, if among the objects A are some indefinable without the intervention of the notion N itself.

It’s the belief in the existence of a true infinite that has led to those non-predicative definitions. Let me clarify. In these definitions, the word 'all' is involved, as seen in the examples mentioned earlier. The word 'all' has a very specific meaning when it comes to a finite number of items; having a different meaning when the items are infinite would require the existence of a complete infinity. Otherwise[Pg 484] all of these items couldn't be imagined as existing prior to their definition, and if the definition of a concept N relies on all the items A, it could fall into a circular reasoning problem if some of the items A can’t be defined without referring to the concept N itself.

The rules of formal logic express simply the properties of all possible classifications. But for them to be applicable it is necessary that these classifications be immutable and that we have no need to modify them in the course of the reasoning. If we have to classify only a finite number of objects, it is easy to keep our classifications without change. If the objects are indefinite in number, that is to say if one is constantly exposed to seeing new and unforeseen objects arise, it may happen that the appearance of a new object may require the classification to be modified, and thus it is we are exposed to antinomies. There is no actual (given complete) infinity. The Cantorians have forgotten this, and they have fallen into contradiction. It is true that Cantorism has been of service, but this was when applied to a real problem whose terms were precisely defined, and then we could advance without fear.

The rules of formal logic simply describe the properties of all possible classifications. However, for these rules to be applicable, the classifications must be unchanging, and we shouldn't need to modify them during reasoning. If we are only classifying a finite number of objects, it's easy to keep our classifications constant. But if the number of objects is indefinite, meaning if we constantly encounter new and unexpected objects, a new object's appearance might require us to change our classification, leading us into contradictions. There is no actual (given complete) infinity. The Cantorians have overlooked this and fallen into contradiction. It's true that Cantorism has been useful, but only when applied to a real problem with clearly defined terms, allowing us to proceed without concern.

Logistic also forgot it, like the Cantorians, and encountered the same difficulties. But the question is to know whether they went this way by accident or whether it was a necessity for them. For me, the question is not doubtful; belief in an actual infinity is essential in the Russell logic. It is just this which distinguishes it from the Hilbert logic. Hilbert takes the view-point of extension, precisely in order to avoid the Cantorian antinomies. Russell takes the view-point of comprehension. Consequently for him the genus is anterior to the species, and the summum genus is anterior to all. That would not be inconvenient if the summum genus was finite; but if it is infinite, it is necessary to postulate the infinite, that is to say to regard the infinite as actual (given complete). And we have not only infinite classes; when we pass from the genus to the species in restricting the concept by new conditions, these conditions are still infinite in number. Because they express generally that the envisaged object presents such or such a relation with all the objects of an infinite class.[Pg 485]

Logistic also forgot about it, like the Cantorians, and faced the same challenges. But the real question is whether they took this route by chance or if it was necessary for them. For me, there’s no doubt; belief in an actual infinity is crucial in Russell's logic. This is what sets it apart from Hilbert's logic. Hilbert adopts the perspective of extension to avoid the Cantorian paradoxes. Russell, on the other hand, takes the perspective of comprehension. Therefore, for him, the genus comes before the species, and the summum genus comes before everything else. This wouldn’t be an issue if the summum genus were finite; however, if it is infinite, it’s essential to assume the infinite, meaning to consider the infinite as actual (completely given). And we not only have infinite classes; when we move from the genus to the species by limiting the concept with new conditions, those conditions are still infinite in number. They generally indicate that the object in question has a specific relationship with all the objects in an infinite class.[Pg 485]

But that is ancient history. Russell has perceived the peril and takes counsel. He is about to change everything, and, what is easily understood, he is preparing not only to introduce new principles which shall allow of operations formerly forbidden, but he is preparing to forbid operations he formerly thought legitimate. Not content to adore what he burned, he is about to burn what he adored, which is more serious. He does not add a new wing to the building, he saps its foundation.

But that’s ancient history. Russell has recognized the danger and is seeking advice. He’s about to change everything, and it’s easy to see that he’s getting ready to introduce new principles that will permit actions that were previously off-limits, but he’s also preparing to ban actions he once believed were acceptable. Not satisfied with just rejecting what he used to embrace, he’s about to reject what he once cherished, which is even more serious. He isn’t just adding another part to the building; he’s undermining its foundation.

The old logistic is dead, so much so that already the zigzag theory and the no-classes theory are disputing over the succession. To judge of the new, we shall await its coming.

The old logistics are gone, to the point that the zigzag theory and the no-classes theory are already competing for their place. To evaluate the new one, we will wait for its arrival.


BOOK III

THE NEW MECHANICS


CHAPTER I

Mechanics and Radium

I

Introduction

The general principles of Dynamics, which have, since Newton, served as foundation for physical science, and which appeared immovable, are they on the point of being abandoned or at least profoundly modified? This is what many people have been asking themselves for some years. According to them, the discovery of radium has overturned the scientific dogmas we believed the most solid: on the one hand, the impossibility of the transmutation of metals; on the other hand, the fundamental postulates of mechanics.

The basic principles of Dynamics, which have been the cornerstone of physical science since Newton and seemed unshakeable, are they about to be discarded or at least deeply changed? This is what many people have been pondering for several years. They believe that the discovery of radium has challenged the scientific beliefs we thought were most robust: on one side, the idea that transmuting metals was impossible; on the other side, the essential principles of mechanics.

Perhaps one is too hasty in considering these novelties as finally established, and breaking our idols of yesterday; perhaps it would be proper, before taking sides, to await experiments more numerous and more convincing. None the less is it necessary, from to-day, to know the new doctrines and the arguments, already very weighty, upon which they rest.

Perhaps we're too quick to see these new ideas as set in stone and to discard our past beliefs; maybe it’s wise to wait for more experiments that are more numerous and convincing before choosing a side. Still, it's important, starting today, to understand the new doctrines and the already significant arguments that support them.

In few words let us first recall in what those principles consist:

In simple terms, let's quickly review what those principles are:

A. The motion of a material point isolated and apart from all exterior force is straight and uniform; this is the principle of inertia: without force no acceleration;

A. The movement of a material point on its own, away from all external forces, is straight and constant; this is the principle of inertia: without force, there is no acceleration;

B. The acceleration of a moving point has the same direction as the resultant of all the forces to which it is subjected; it is equal to the quotient of this resultant by a coefficient called mass of the moving point.

B. The acceleration of a moving object is in the same direction as the combined force acting on it; it is equal to the resulting force divided by a value known as the object's mass.

The mass of a moving point, so defined, is a constant; it does[Pg 487] not depend upon the velocity acquired by this point; it is the same whether the force, being parallel to this velocity, tends only to accelerate or to retard the motion of the point, or whether, on the contrary, being perpendicular to this velocity, it tends to make this motion deviate toward the right, or the left, that is to say to curve the trajectory;

The mass of a moving object, as defined here, is constant; it doesn't change with the speed that the object reaches. It remains the same whether the force acting along this speed is trying to speed it up or slow it down, or whether, on the other hand, it's acting at a right angle to this speed, causing the object to deviate to the right or left, which means to curve the path;

C. All the forces affecting a material point come from the action of other material points; they depend only upon the relative positions and velocities of these different material points.

C. All the forces acting on a material point come from the interaction with other material points; they depend solely on the relative positions and speeds of these different material points.

Combining the two principles B and C, we reach the principle of relative motion, in virtue of which the laws of the motion of a system are the same whether we refer this system to fixed axes, or to moving axes animated by a straight and uniform motion of translation, so that it is impossible to distinguish absolute motion from a relative motion with reference to such moving axes;

Combining the two principles B and C, we arrive at the principle of relative motion, which states that the laws governing the motion of a system remain unchanged whether we consider this system to be at fixed points or to be moving with a straight, steady speed. Therefore, it's impossible to tell the difference between absolute motion and relative motion when referencing such moving points;

D. If a material point A acts upon another material point B, the body B reacts upon A, and these two actions are two equal and directly opposite forces. This is the principle of the equality of action and reaction, or, more briefly, the principle of reaction.

D. If a material point A exerts force on another material point B, then body B also exerts force back on A, and these two forces are equal in strength and directly opposite in direction. This is the principle of action and reaction, or simply, the principle of reaction.

Astronomic observations and the most ordinary physical phenomena seem to have given of these principles a confirmation complete, constant and very precise. This is true, it is now said, but it is because we have never operated with any but very small velocities; Mercury, for example, the fastest of the planets, goes scarcely 100 kilometers a second. Would this planet act the same if it went a thousand times faster? We see there is yet no need to worry; whatever may be the progress of automobilism, it will be long before we must give up applying to our machines the classic principles of dynamics.

Astronomical observations and even the most basic physical phenomena seem to have provided a solid, consistent, and very accurate confirmation of these principles. This is acknowledged today, but it’s because we have only operated at very low speeds; for instance, Mercury, the fastest planet, travels at barely 100 kilometers per second. Would this planet behave the same way if it were moving a thousand times faster? Clearly, there’s no need for concern yet; regardless of advancements in automotive technology, it will be a long time before we have to stop using the traditional principles of dynamics on our machines.

How then have we come to make actual speeds a thousand times greater than that of Mercury, equal, for instance, to a tenth or a third of the velocity of light, or approaching still more closely to that velocity? It is by aid of the cathode rays and the rays from radium.

How have we managed to achieve speeds a thousand times faster than Mercury, comparable to a tenth or a third of the speed of light, or even closer to that speed? It’s thanks to cathode rays and rays from radium.

We know that radium emits three kinds of rays, designated by the three Greek letters α, β, γ; in what follows, unless the contrary be expressly stated, it will always be a question of the β rays, which are analogous to the cathode rays.[Pg 488]

We know that radium releases three types of rays, labeled with the three Greek letters α, β, γ; in the following sections, unless stated otherwise, we will always be referring to the β rays, which are similar to cathode rays.[Pg 488]

After the discovery of the cathode rays two theories appeared. Crookes attributed the phenomena to a veritable molecular bombardment; Hertz, to special undulations of the ether. This was a renewal of the debate which divided physicists a century ago about light; Crookes took up the emission theory, abandoned for light; Hertz held to the undulatory theory. The facts seem to decide in favor of Crookes.

After the discovery of cathode rays, two theories emerged. Crookes believed the phenomena were due to a true molecular bombardment, while Hertz thought they were caused by special waves in the ether. This reignited the debate that had split physicists a hundred years earlier regarding light; Crookes supported the emission theory, which had been abandoned for light, while Hertz adhered to the wave theory. The evidence seems to favor Crookes.

It has been recognized, in the first place, that the cathode rays carry with them a negative electric charge; they are deviated by a magnetic field and by an electric field; and these deviations are precisely such as these same fields would produce upon projectiles animated by a very high velocity and strongly charged with electricity. These two deviations depend upon two quantities: one the velocity, the other the relation of the electric charge of the projectile to its mass; we cannot know the absolute value of this mass, nor that of the charge, but only their relation; in fact, it is clear that if we double at the same time the charge and the mass, without changing the velocity, we shall double the force which tends to deviate the projectile, but, as its mass is also doubled, the acceleration and deviation observable will not be changed. The observation of the two deviations will give us therefore two equations to determine these two unknowns. We find a velocity of from 10,000 to 30,000 kilometers a second; as to the ratio of the charge to the mass, it is very great. We may compare it to the corresponding ratio in regard to the hydrogen ion in electrolysis; we then find that a cathodic projectile carries about a thousand times more electricity than an equal mass of hydrogen would carry in an electrolyte.

It has been recognized, first of all, that cathode rays carry a negative electric charge; they are deflected by a magnetic field and by an electric field; and these deflections are exactly what these same fields would cause on projectiles moving at very high speed and heavily charged with electricity. These two deflections depend on two factors: one is the speed, and the other is the ratio of the electric charge of the projectile to its mass; we cannot determine the absolute values of either the mass or the charge, only their relationship; in fact, it’s clear that if we double both the charge and the mass at the same time, without changing the speed, we will double the force that causes the projectile to deviate, but since its mass is also doubled, the observable acceleration and deviation will not change. The observation of the two deflections will give us two equations to solve for these two unknowns. We find a speed ranging from 10,000 to 30,000 kilometers per second; regarding the ratio of charge to mass, it is very high. We can compare it to the same ratio for the hydrogen ion in electrolysis; we then find that a cathodic projectile carries about a thousand times more electricity than the same mass of hydrogen would carry in an electrolyte.

To confirm these views, we need a direct measurement of this velocity to compare with the velocity so calculated. Old experiments of J. J. Thomson had given results more than a hundred times too small; but they were exposed to certain causes of error. The question was taken up again by Wiechert in an arrangement where the Hertzian oscillations were utilized; results were found agreeing with the theory, at least as to order of magnitude; it would be of great interest to repeat these experiments. However that may be, the theory of undulations appears powerless to account for this complex of facts.[Pg 489]

To confirm these views, we need to directly measure this velocity to compare it with the calculated one. Old experiments by J. J. Thomson produced results that were over a hundred times too small, but they were subject to certain errors. Wiechert revisited the issue using an arrangement that took advantage of Hertzian oscillations; his results matched the theory, at least in terms of order of magnitude. It would be very interesting to repeat these experiments. Regardless, the theory of undulations seems inadequate to explain this complex of facts.[Pg 489]

The same calculations made with reference to the β rays of radium have given velocities still greater: 100,000 or 200,000 kilometers or more yet. These velocities greatly surpass all those we know. It is true that light has long been known to go 300,000 kilometers a second; but it is not a carrying of matter, while, if we adopt the emission theory for the cathode rays, there would be material molecules really impelled at the velocities in question, and it is proper to investigate whether the ordinary laws of mechanics are still applicable to them.

The same calculations referencing the β rays of radium have shown even higher speeds: 100,000 to 200,000 kilometers or more. These speeds far exceed anything we know. It's true that light has been known to travel at 300,000 kilometers per second; however, light doesn't carry matter. If we accept the emission theory for the cathode rays, there would be actual material molecules moving at those velocities, making it important to explore whether the usual laws of mechanics still apply to them.

II

Mass Longitudinal and Mass Transversal

We know that electric currents produce the phenomena of induction, in particular self-induction. When a current increases, there develops an electromotive force of self-induction which tends to oppose the current; on the contrary, when the current decreases, the electromotive force of self-induction tends to maintain the current. The self-induction therefore opposes every variation of the intensity of the current, just as in mechanics the inertia of a body opposes every variation of its velocity.

We know that electric currents create induction effects, especially self-induction. When a current grows, it generates a self-inductive electromotive force that works against the current; conversely, when the current shrinks, the self-inductive electromotive force tries to keep the current going. Therefore, self-induction resists any change in the current's intensity, just like inertia in mechanics resists any change in a body's velocity.

Self-induction is a veritable inertia. Everything happens as if the current could not establish itself without putting in motion the surrounding ether and as if the inertia of this ether tended, in consequence, to keep constant the intensity of this current. It would be requisite to overcome this inertia to establish the current, it would be necessary to overcome it again to make the current cease.

Self-induction is like real inertia. Everything seems to suggest that the current can't set itself up without setting the surrounding ether in motion, and that the inertia of this ether tends to maintain the intensity of the current. To establish the current, we'd need to overcome this inertia, and we'd have to do it again to stop the current.

A cathode ray, which is a rain of projectiles charged with negative electricity, may be likened to a current; doubtless this current differs, at first sight at least, from the currents of ordinary conduction, where the matter does not move and where the electricity circulates through the matter. This is a current of convection, where the electricity, attached to a material vehicle, is carried along by the motion of this vehicle. But Rowland has proved that currents of convection produce the same magnetic effects as currents of conduction; they should produce also the same effects of induction. First, if this were not so, the principle of the conservation of energy would be violated; besides,[Pg 490] Crémieu and Pender have employed a method putting in evidence directly these effects of induction.

A cathode ray, which is a stream of particles carrying a negative electric charge, can be compared to a current; this current clearly differs, at least on the surface, from regular conduction currents, where the material itself doesn’t move and electricity flows through that material. This is a convection current, where electricity, attached to a physical medium, is carried along by the movement of that medium. However, Rowland has shown that convection currents generate the same magnetic effects as conduction currents; they should also produce similar induction effects. First, if this weren’t the case, it would contradict the principle of conservation of energy; furthermore,[Pg 490] Crémieu and Pender have used a method that directly demonstrates these induction effects.

If the velocity of a cathode corpuscle varies, the intensity of the corresponding current will likewise vary; and there will develop effects of self-induction which will tend to oppose this variation. These corpuscles should therefore possess a double inertia: first their own proper inertia, and then the apparent inertia, due to self-induction, which produces the same effects. They will therefore have a total apparent mass, composed of their real mass and of a fictitious mass of electromagnetic origin. Calculation shows that this fictitious mass varies with the velocity, and that the force of inertia of self-induction is not the same when the velocity of the projectile accelerates or slackens, or when it is deviated; therefore so it is with the force of the total apparent inertia.

If the speed of a cathode particle changes, the strength of the corresponding current will also change, and effects of self-induction will arise that will work against this change. These particles should therefore have a double inertia: first, their own actual inertia, and then the apparent inertia from self-induction, which creates the same effects. As a result, they will have a total apparent mass made up of their real mass and an imaginary mass from electromagnetic effects. Calculations show that this imaginary mass changes with speed, and the force of inertia from self-induction is different when the speed of the particle speeds up or slows down, or when it changes direction; thus, the force of the total apparent inertia varies in the same way.

The total apparent mass is therefore not the same when the real force applied to the corpuscle is parallel to its velocity and tends to accelerate the motion as when it is perpendicular to this velocity and tends to make the direction vary. It is necessary therefore to distinguish the total longitudinal mass from the total transversal mass. These two total masses depend, moreover, upon the velocity. This follows from the theoretical work of Abraham.

The total apparent mass is not the same when the actual force applied to the particle is aligned with its velocity and works to speed it up, compared to when it's perpendicular to this velocity and causes its direction to change. Therefore, it's important to differentiate between the total longitudinal mass and the total transversal mass. Additionally, these two total masses depend on the velocity. This is supported by the theoretical work of Abraham.

In the measurements of which we speak in the preceding section, what is it we determine in measuring the two deviations? It is the velocity on the one hand, and on the other hand the ratio of the charge to the total transversal mass. How, under these conditions, can we make out in this total mass the part of the real mass and that of the fictitious electromagnetic mass? If we had only the cathode rays properly so called, it could not be dreamed of; but happily we have the rays of radium which, as we have seen, are notably swifter. These rays are not all identical and do not behave in the same way under the action of an electric field and a magnetic field. It is found that the electric deviation is a function of the magnetic deviation, and we are able, by receiving on a sensitive plate radium rays which have been subjected to the action of the two fields, to photograph the curve which represents the relation between these two deviations. This is what Kaufmann has done, deducing from it the relation[Pg 491] between the velocity and the ratio of the charge to the total apparent mass, a ratio we shall call ε.

In the measurements we discussed in the previous section, what are we determining when we measure the two deviations? It's the velocity on one hand and the ratio of the charge to the total transversal mass on the other. Given these conditions, how can we distinguish between the actual mass and the fictitious electromagnetic mass within this total mass? If we only had the cathode rays per se, it wouldn't be thought possible; but fortunately, we have radium rays, which, as we've seen, are significantly faster. These rays are not all the same and react differently in an electric field and a magnetic field. It turns out that the electric deviation is related to the magnetic deviation, and by capturing radium rays that have been affected by both fields on a sensitive plate, we can photograph the curve that shows the relationship between these two deviations. This is what Kaufmann did, deriving the relation[Pg 491] between the velocity and the ratio of the charge to the total apparent mass, a ratio we will refer to as ε.

One might suppose there are several species of rays, each characterized by a fixed velocity, by a fixed charge and by a fixed mass. But this hypothesis is improbable; why, in fact, would all the corpuscles of the same mass take always the same velocity? It is more natural to suppose that the charge as well as the real mass are the same for all the projectiles, and that these differ only by their velocity. If the ratio ε is a function of the velocity, this is not because the real mass varies with this velocity; but, since the fictitious electromagnetic mass depends upon this velocity, the total apparent mass, alone observable, must depend upon it, though the real mass does not depend upon it and may be constant.

One might think there are several types of rays, each with a fixed speed, charge, and mass. However, this idea doesn't seem likely; why would all the particles with the same mass always move at the same speed? It's more reasonable to assume that both the charge and the actual mass are the same for all the projectiles, and that the only difference among them is their speed. If the ratio ε is influenced by speed, it's not because the actual mass changes with that speed; rather, since the fictitious electromagnetic mass is affected by speed, the total apparent mass, which is the only one we can observe, must depend on it, even though the actual mass does not change and may remain constant.

The calculations of Abraham let us know the law according to which the fictitious mass varies as a function of the velocity; Kaufmann's experiment lets us know the law of variation of the total mass.

The calculations of Abraham inform us about the law governing how the fictitious mass changes with velocity; Kaufmann's experiment reveals the law of variation of the total mass.

The comparison of these two laws will enable us therefore to determine the ratio of the real mass to the total mass.

The comparison of these two laws will help us determine the ratio of the actual mass to the total mass.

Such is the method Kaufmann used to determine this ratio. The result is highly surprising: the real mass is naught.

Such is the method Kaufmann used to determine this ratio. The result is quite surprising: the real mass is nothing.

This has led to conceptions wholly unexpected. What had only been proved for cathode corpuscles was extended to all bodies. What we call mass would be only semblance; all inertia would be of electromagnetic origin. But then mass would no longer be constant, it would augment with the velocity; sensibly constant for velocities up to 1,000 kilometers a second, it then would increase and would become infinite for the velocity of light. The transversal mass would no longer be equal to the longitudinal: they would only be nearly equal if the velocity is not too great. The principle B of mechanics would no longer be true.

This has led to completely unexpected ideas. What was proven only for cathode rays was applied to all objects. What we refer to as mass would just be an illusion; all inertia would come from electromagnetic forces. However, this means mass wouldn’t remain constant; it would increase with speed. It would stay mostly constant at speeds up to 1,000 kilometers per second, but would then rise and approach infinity as it nears the speed of light. The transverse mass wouldn’t be the same as the longitudinal mass: they would only be nearly the same if the speed isn’t too high. The principle B of mechanics would no longer hold true.

III

The Canal Rays

At the point where we now are, this conclusion might seem premature. Can one apply to all matter what has been proved[Pg 492] only for such light corpuscles, which are a mere emanation of matter and perhaps not true matter? But before entering upon this question, a word must be said of another sort of rays. I refer to the canal rays, the Kanalstrahlen of Goldstein.

At this stage, this conclusion might seem rushed. Can we apply what has been proven[Pg 492] to all matter when it’s only been shown for these light particles, which are just a byproduct of matter and may not be real matter? But before we dive into that question, we need to mention another type of rays. I’m talking about the canal rays, the Kanalstrahlen of Goldstein.

The cathode, together with the cathode rays charged with negative electricity, emits canal rays charged with positive electricity. In general, these canal rays not being repelled by the cathode, are confined to the immediate neighborhood of this cathode, where they constitute the `chamois cushion,' not very easy to perceive; but, if the cathode is pierced with holes and if it almost completely blocks up the tube, the canal rays spread back of the cathode, in the direction opposite to that of the cathode rays, and it becomes possible to study them. It is thus that it has been possible to show their positive charge and to show that the magnetic and electric deviations still exist, as for the cathode rays, but are much feebler.

The cathode, along with the negatively charged cathode rays, produces canal rays that carry a positive charge. Generally, these canal rays are not repelled by the cathode, so they stay close to it, forming what is known as the 'chamois cushion,' which isn't easy to see. However, if the cathode has holes and almost completely blocks the tube, the canal rays extend back from the cathode, going in the opposite direction of the cathode rays, making it possible to observe them. This is how it has been demonstrated that they have a positive charge and that magnetic and electric deviations still occur similar to the cathode rays, but they are much weaker.

Radium likewise emits rays analogous to the canal rays, and relatively very absorbable, called α rays.

Radium also produces rays similar to canal rays, which are relatively easy to absorb, known as α rays.

We can, as for the cathode rays, measure the two deviations and thence deduce the velocity and the ratio ε. The results are less constant than for the cathode rays, but the velocity is less, as well as the ratio ε; the positive corpuscles are less charged than the negative; or if, which is more natural, we suppose the charges equal and of opposite sign, the positive corpuscles are much the larger. These corpuscles, charged the ones positively, the others negatively, have been called electrons.

We can measure the two deviations of the cathode rays and then figure out the velocity and the ratio ε. The results are not as consistent as those for the cathode rays, but the velocity is lower, as is the ratio ε; the positive particles carry less charge than the negative ones. If we assume the charges are equal and opposite, which makes more sense, the positive particles are significantly larger. These particles, with one carrying a positive charge and the other a negative charge, are called electrons.

IV

The Theory of Lorentz

But the electrons do not merely show us their existence in these rays where they are endowed with enormous velocities. We shall see them in very different rôles, and it is they that account for the principal phenomena of optics and electricity. The brilliant synthesis about to be noticed is due to Lorentz.

But the electrons don't just show us they're there in these rays where they have enormous speeds. We'll see them in very different roles, and they are responsible for the main phenomena in optics and electricity. The impressive synthesis we'll discuss next is thanks to Lorentz.

Matter is formed solely of electrons carrying enormous charges, and, if it seems to us neutral, this is because the charges of opposite sign of these electrons compensate each other. We[Pg 493] may imagine, for example, a sort of solar system formed of a great positive electron, around which gravitate numerous little planets, the negative electrons, attracted by the electricity of opposite name which charges the central electron. The negative charges of these planets would balance the positive charge of this sun, so that the algebraic sum of all these charges would be naught.

Matter is made up entirely of electrons with huge charges, and if it appears neutral to us, that's because the charges of opposite types in these electrons balance each other out. We[Pg 493] can picture, for instance, a kind of solar system where a large positive electron exists at the center, and many small planets, the negative electrons, orbit around it, drawn in by the electric attraction of the oppositely charged central electron. The negative charges of these planets would counterbalance the positive charge of this sun, resulting in a total charge of zero.

All these electrons swim in the ether. The ether is everywhere identically the same, and perturbations in it are propagated according to the same laws as light or the Hertzian oscillations in vacuo. There is nothing but electrons and ether. When a luminous wave enters a part of the ether where electrons are numerous, these electrons are put in motion under the influence of the perturbation of the ether, and they then react upon the ether. So would be explained refraction, dispersion, double refraction and absorption. Just so, if for any cause an electron be put in motion, it would trouble the ether around it and would give rise to luminous waves, and this would explain the emission of light by incandescent bodies.

All these electrons move through the ether. The ether is everywhere the same, and disturbances in it spread according to the same rules as light or radio waves in a vacuum. There is only electrons and ether. When a light wave enters a part of the ether where there are many electrons, these electrons start moving due to the disturbance in the ether, and they then affect the ether in return. This explains refraction, dispersion, double refraction, and absorption. Similarly, if any electron is set in motion, it would disrupt the ether around it and generate light waves, which explains how hot objects emit light.

In certain bodies, the metals for example, we should have fixed electrons, between which would circulate moving electrons enjoying perfect liberty, save that of going out from the metallic body and breaking the surface which separates it from the exterior void or from the air, or from any other non-metallic body.

In some materials, like metals for instance, we would have stationary electrons, while mobile electrons flow freely among them. The only restriction is that they can't leave the metallic body to break through the surface that separates it from the outside void, the air, or any other non-metal material.

These movable electrons behave then, within the metallic body, as do, according to the kinetic theory of gases, the molecules of a gas within the vase where this gas is confined. But, under the influence of a difference of potential, the negative movable electrons would tend to go all to one side, and the positive movable electrons to the other. This is what would produce electric currents, and this is why these bodies would be conductors. On the other hand, the velocities of our electrons would be the greater the higher the temperature, if we accept the assimilation with the kinetic theory of gases. When one of these movable electrons encounters the surface of the metallic body, whose boundary it can not pass, it is reflected like a billiard ball which has hit the cushion, and its velocity undergoes a sudden change of direction. But when an electron changes direction, as we shall see further[Pg 494] on, it becomes the source of a luminous wave, and this is why hot metals are incandescent.

These movable electrons act, within the metal, much like the molecules of a gas do in a closed container according to the kinetic theory of gases. However, when there is a difference in electric potential, the negative movable electrons will tend to move to one side, while the positive movable electrons will go to the other side. This creates electric currents, and that's why these materials are conductors. Additionally, the faster our electrons move, the higher the temperature, if we relate it to the kinetic theory of gases. When one of these movable electrons hits the surface of the metal, which it cannot pass through, it gets reflected like a billiard ball hitting a cushion, and it suddenly changes direction. But when an electron changes direction, as we’ll explore later[Pg 494], it generates a light wave, which is why heated metals glow.

In other bodies, the dielectrics and the transparent bodies, the movable electrons enjoy much less freedom. They remain as if attached to fixed electrons which attract them. The farther they go away from them the greater becomes this attraction and tends to pull them back. They therefore can make only small excursions; they can no longer circulate, but only oscillate about their mean position. This is why these bodies would not be conductors; moreover they would most often be transparent, and they would be refractive, since the luminous vibrations would be communicated to the movable electrons, susceptible of oscillation, and thence a perturbation would result.

In other materials, like dielectrics and transparent substances, the movable electrons have much less freedom. They seem to be attached to fixed electrons that attract them. The farther they move away, the stronger this attraction becomes, pulling them back. As a result, they can only make small movements; they can't circulate freely but can only oscillate around their average position. That's why these materials don't behave like conductors; in addition, they are often transparent and refractive because the light vibrations are transferred to the movable electrons, which can oscillate, causing a disturbance.

I can not here give the details of the calculations; I confine myself to saying that this theory accounts for all the known facts, and has predicted new ones, such as the Zeeman effect.

I can't go into the details of the calculations here; I will just say that this theory explains all the known facts and has predicted new ones, like the Zeeman effect.

V

Mechanical Consequences

We now may face two hypotheses:

We may now consider two possibilities:

1º The positive electrons have a real mass, much greater than their fictitious electromagnetic mass; the negative electrons alone lack real mass. We might even suppose that apart from electrons of the two signs, there are neutral atoms which have only their real mass. In this case, mechanics is not affected; there is no need of touching its laws; the real mass is constant; simply, motions are deranged by the effects of self-induction, as has always been known; moreover, these perturbations are almost negligible, except for the negative electrons which, not having real mass, are not true matter.

1º Positive electrons have real mass, which is much greater than their imaginary electromagnetic mass; only negative electrons lack real mass. We might even consider that besides the two types of electrons, there are neutral atoms that possess only their real mass. In this scenario, mechanics remains unchanged; there's no need to alter its laws; the real mass is constant; motions are simply disrupted by the effects of self-induction, as has always been understood; furthermore, these disruptions are almost insignificant, except for negative electrons which, lacking real mass, are not considered true matter.

2º But there is another point of view; we may suppose there are no neutral atoms, and the positive electrons lack real mass just as the negative electrons. But then, real mass vanishing, either the word mass will no longer have any meaning, or else it must designate the fictitious electromagnetic mass; in this case, mass will no longer be constant, the transversal mass will no longer be equal to the longitudinal, the principles of mechanics will be overthrown.[Pg 495]

2º But there's another perspective; we could assume there are no neutral atoms, and that positive electrons don't actually have mass just like negative electrons. If that's the case and real mass disappears, either the term mass will lose its meaning, or it will have to refer to the imaginary electromagnetic mass. In that scenario, mass won’t be consistent anymore, the transverse mass won’t be equal to the longitudinal, and the principles of mechanics will be upended.[Pg 495]

First a word of explanation. We have said that, for the same charge, the total mass of a positive electron is much greater than that of a negative. And then it is natural to think that this difference is explained by the positive electron having, besides its fictitious mass, a considerable real mass; which takes us back to the first hypothesis. But we may just as well suppose that the real mass is null for these as for the others, but that the fictitious mass of the positive electron is much the greater since this electron is much the smaller. I say advisedly: much the smaller. And, in fact, in this hypothesis inertia is exclusively electromagnetic in origin; it reduces itself to the inertia of the ether; the electrons are no longer anything by themselves; they are solely holes in the ether and around which the ether moves; the smaller these holes are, the more will there be of ether, the greater, consequently, will be the inertia of the ether.

First, let me clarify. We’ve mentioned that, for the same charge, the total mass of a positive electron is significantly greater than that of a negative one. It's natural to think that this difference comes from the positive electron having, in addition to its fictitious mass, a substantial real mass. This brings us back to the initial hypothesis. However, we can equally assume that the real mass is zero for both types, but that the fictitious mass of the positive electron is much larger since this electron is much smaller. I stress: much smaller. In fact, under this hypothesis, inertia comes solely from electromagnetic sources; it boils down to the inertia of the ether. Electrons no longer exist independently; they are merely gaps in the ether around which the ether flows. The smaller these gaps are, the more ether there will be, and consequently, the greater the inertia of the ether will be.

How shall we decide between these two hypotheses? By operating upon the canal rays as Kaufmann did upon the β rays? This is impossible; the velocity of these rays is much too slight. Should each therefore decide according to his temperament, the conservatives going to one side and the lovers of the new to the other? Perhaps, but, to fully understand the arguments of the innovators, other considerations must come in.

How should we choose between these two hypotheses? Should we experiment with the canal rays like Kaufmann did with the β rays? That's not possible; the speed of these rays is way too low. Should everyone then just go with their gut, with conservatives choosing one side and those who embrace the new opting for the other? Maybe, but to really grasp the arguments of the innovators, we need to take other factors into account.


CHAPTER II

Physics and Light

I

Aberration

You know in what the phenomenon of aberration, discovered by Bradley, consists. The light issuing from a star takes a certain time to go through a telescope; during this time, the telescope, carried along by the motion of the earth, is displaced. If therefore the telescope were pointed in the true direction of the star, the image would be formed at the point occupied by the crossing of the threads of the network when the light has reached the objective; and this crossing would no longer be at this same point when the light reached the plane of the network. We would therefore be led to mis-point the telescope to bring the image upon the crossing of the threads. Thence results that the astronomer will not point the telescope in the direction of the absolute velocity of the light, that is to say toward the true position of the star, but just in the direction of the relative velocity of the light with reference to the earth, that is to say toward what is called the apparent position of the star.

You understand what the phenomenon of aberration, discovered by Bradley, is all about. The light coming from a star takes some time to travel through a telescope; during that time, the telescope, moving with the motion of the earth, shifts position. So, if the telescope were aimed directly at the star, the image would appear at the point where the threads of the crosshair intersect when the light reaches the objective. However, this intersection would no longer be at the same point when the light arrives at the plane of the crosshair. As a result, we would have to adjust the telescope to align the image with the crossing of the threads. This means that the astronomer will not aim the telescope in the direction of the absolute speed of light, or toward the true position of the star, but instead in the direction of the relative speed of light with respect to the earth, which is towards what is known as the apparent position of the star.

The velocity of light is known; we might therefore suppose that we have the means of calculating the absolute velocity of the earth. (I shall soon explain my use here of the word absolute.) Nothing of the sort; we indeed know the apparent position of the star we observe; but we do not know its true position; we know the velocity of the light only in magnitude and not in direction.

The speed of light is known; so we might think that we can calculate the absolute speed of the Earth. (I’ll explain my use of the word absolute soon.) Not at all; we do know the apparent position of the star we’re observing, but we don’t know its actual position; we only know the speed of light in terms of how fast it travels, not its direction.

If therefore the absolute velocity of the earth were straight and uniform, we should never have suspected the phenomenon of aberration; but it is variable; it is composed of two parts: the velocity of the solar system, which is straight and uniform; the velocity of the earth with reference to the sun, which is variable. If the velocity of the solar system, that is to say if the constant part existed alone, the observed direction would be invariable.[Pg 497] This position that one would thus observe is called the mean apparent position of the star.

If the Earth's absolute velocity were constant and straightforward, we would never have noticed the phenomenon of aberration; however, it is variable. It consists of two components: the velocity of the solar system, which is constant and straight, and the Earth's velocity relative to the sun, which changes. If only the constant part of the solar system's velocity existed, the observed direction would remain the same.[Pg 497] The position that we would observe in this case is called the mean apparent position of the star.

Taking account now at the same time of the two parts of the velocity of the earth, we shall have the actual apparent position, which describes a little ellipse around the mean apparent position, and it is this ellipse that we observe.

Taking into account both aspects of the Earth's velocity at the same time, we will have the actual apparent position, which describes a small ellipse around the mean apparent position, and it's this ellipse that we observe.

Neglecting very small quantities, we shall see that the dimensions of this ellipse depend only upon the ratio of the velocity of the earth with reference to the sun to the velocity of light, so that the relative velocity of the earth with regard to the sun has alone come in.

Neglecting very small amounts, we'll see that the size of this ellipse depends only on the ratio of the Earth's speed in relation to the Sun to the speed of light, meaning that only the relative speed of the Earth compared to the Sun has been considered.

But wait! This result is not exact, it is only approximate; let us push the approximation a little farther. The dimensions of the ellipse will depend then upon the absolute velocity of the earth. Let us compare the major axes of the ellipse for the different stars: we shall have, theoretically at least, the means of determining this absolute velocity.

But hold on! This result isn’t precise; it’s just an estimate. Let’s refine the estimate a bit more. The size of the ellipse will depend on the actual speed of the earth. If we compare the major axes of the ellipse for the different stars, we should theoretically be able to figure out this actual speed.

That would be perhaps less shocking than it at first seems; it is a question, in fact, not of the velocity with reference to an absolute void, but of the velocity with regard to the ether, which is taken by definition as being absolutely at rest.

That might be less surprising than it seems at first; it's really a question of speed in relation to an absolute void, but rather the speed with respect to the ether, which is considered by definition to be completely still.

Besides, this method is purely theoretical. In fact, the aberration is very small; the possible variations of the ellipse of aberration are much smaller yet, and, if we consider the aberration as of the first order, they should therefore be regarded as of the second order: about a millionth of a second; they are absolutely inappreciable for our instruments. We shall finally see, further on, why the preceding theory should be rejected, and why we could not determine this absolute velocity even if our instruments were ten thousand times more precise!

Besides, this method is purely theoretical. In reality, the aberration is very small; the potential variations in the aberration ellipse are even smaller, and if we consider the aberration as first order, they should be seen as second order: about a millionth of a second; they are completely negligible for our instruments. We will ultimately see later on why the previous theory should be dismissed, and why we couldn't determine this absolute velocity even if our instruments were ten thousand times more precise!

One might imagine some other means, and in fact, so one has. The velocity of light is not the same in water as in air; could we not compare the two apparent positions of a star seen through a telescope first full of air, then full of water? The results have been negative; the apparent laws of reflection and refraction are not altered by the motion of the earth. This phenomenon is capable of two explanations:

One might think of other methods, and in fact, someone has. The speed of light is different in water than in air; can't we compare the two apparent positions of a star viewed through a telescope first filled with air, then filled with water? The results have been negative; the apparent laws of reflection and refraction don't change with the motion of the earth. This phenomenon can be explained in two ways:

1º It might be supposed that the ether is not at rest, but that [Pg 498] it is carried along by the body in motion. It would then not be astonishing that the phenomena of refraction are not altered by the motion of the earth, since all, prisms, telescopes and ether, are carried along together in the same translation. As to the aberration itself, it would be explained by a sort of refraction happening at the surface of separation of the ether at rest in the interstellar spaces and the ether carried along by the motion of the earth. It is upon this hypothesis (bodily carrying along of the ether) that is founded the theory of Hertz on the electrodynamics of moving bodies.

1º It might be assumed that the ether isn’t at rest, but is being moved along by the body in motion. It wouldn’t be surprising, then, that the phenomena of refraction aren’t affected by the motion of the Earth, since everything—prisms, telescopes, and ether—is moving together in the same direction. As for the aberration itself, it could be explained by a kind of refraction happening at the boundary where the ether at rest in interstellar space meets the ether that is being moved by the Earth's motion. This idea (that the ether is being carried along) is the basis of the theory of Hertz on the electrodynamics of moving bodies.

2º Fresnel, on the contrary, supposes that the ether is at absolute rest in the void, at rest almost absolute in the air, whatever be the velocity of this air, and that it is partially carried along by refractive media. Lorentz has given to this theory a more satisfactory form. For him, the ether is at rest, only the electrons are in motion; in the void, where it is only a question of the ether, in the air, where this is almost the case, the carrying along is null or almost null; in refractive media, where perturbation is produced at the same time by vibrations of the ether and those of electrons put in swing by the agitation of the ether, the undulations are partially carried along.

2º Fresnel, on the other hand, assumes that the ether is completely at rest in a vacuum, and almost at rest in the air, regardless of how fast the air is moving, and that it is partially transported by refractive substances. Lorentz has provided a more satisfactory version of this theory. For him, the ether is stationary, while only the electrons are moving; in a vacuum, where only the ether is involved, and in the air, where this is almost the case, the transport is negligible or nearly non-existent; in refractive materials, where disturbances are caused simultaneously by ether vibrations and electrons set in motion by the agitation of the ether, the waves are partially transported.

To decide between the two hypotheses, we have Fizeau's experiment, comparing by measurements of the fringes of interference, the velocity of light in air at rest or in motion. These experiments have confirmed Fresnel's hypothesis of partial carrying along. They have been repeated with the same result by Michelson. The theory of Hertz must therefore be rejected.

To choose between the two hypotheses, we can look at Fizeau's experiment, which compares the measurements of interference fringes to determine the speed of light in stationary or moving air. These experiments have supported Fresnel's hypothesis of partial dragging. Michelson has repeated them with the same outcome. Thus, Hertz's theory must be dismissed.

II

The Principle of Relativity

But if the ether is not carried along by the motion of the earth, is it possible to show, by means of optical phenomena, the absolute velocity of the earth, or rather its velocity with respect to the unmoving ether? Experiment has answered negatively, and yet the experimental procedures have been varied in all possible ways. Whatever be the means employed there will never be disclosed anything but relative velocities; I mean the [Pg 499] velocities of certain material bodies with reference to other material bodies. In fact, if the source of light and the apparatus of observation are on the earth and participate in its motion, the experimental results have always been the same, whatever be the orientation of the apparatus with reference to the orbital motion of the earth. If astronomic aberration happens, it is because the source, a star, is in motion with reference to the observer.

But if the ether isn’t being moved along with the Earth, is it possible to demonstrate, through optical phenomena, the Earth's absolute speed, or more accurately, its speed in relation to the stationary ether? Experiments have shown this to be negative, even though the methods have been tried in every conceivable way. No matter what methods are used, we will only ever reveal relative speeds; I mean the [Pg 499] speeds of certain physical objects in relation to other physical objects. In reality, if both the light source and the observation equipment are on Earth and move with it, the experimental results have always been the same, regardless of how the equipment is positioned in relation to the Earth’s orbit. If astronomical aberration occurs, it’s because the light source, a star, is moving in relation to the observer.

The hypotheses so far made perfectly account for this general result, if we neglect very small quantities of the order of the square of the aberration. The explanation rests upon the notion of local time, introduced by Lorentz, which I shall try to make clear. Suppose two observers, placed one at A, the other at B, and wishing to set their watches by means of optical signals. They agree that B shall send a signal to A when his watch marks an hour determined upon, and A is to put his watch to that hour the moment he sees the signal. If this alone were done, there would be a systematic error, because as the light takes a certain time t to go from B to A, A's watch would be behind B's the time t. This error is easily corrected. It suffices to cross the signals. A in turn must signal B, and, after this new adjustment, B's watch will be behind A's the time t. Then it will be sufficient to take the arithmetic mean of the two adjustments.

The hypotheses we've discussed so far explain this overall result perfectly, if we ignore very small quantities related to the square of the aberration. The explanation is based on the idea of local time, introduced by Lorentz, which I will try to clarify. Imagine two observers, one at A and the other at B, who want to synchronize their watches using optical signals. They agree that B will send a signal to A when his watch shows a predetermined hour, and A will set his watch to that hour the moment he sees the signal. If they only do this, there will be a systematic error because it takes a certain time t for the light to travel from B to A, causing A's watch to lag behind B's by time t. This error can be easily fixed. They just need to swap the signals. A should signal B, and after this new adjustment, B's watch will be behind A's by time t. Then, it will be enough to take the average of the two adjustments.

But this way of doing supposes that light takes the same time to go from A to B as to return from B to A. That is true if the observers are motionless; it is no longer so if they are carried along in a common translation, since then A, for example, will go to meet the light coming from B, while B will flee before the light coming from A. If therefore the observers are borne along in a common translation and if they do not suspect it, their adjustment will be defective; their watches will not indicate the same time; each will show the local time belonging to the point where it is.

But this way of doing things assumes that light takes the same amount of time to travel from A to B as it does to return from B to A. This is true if the observers are stationary; however, that’s not the case if they are moving together, because then A will be moving toward the light coming from B, while B will be moving away from the light coming from A. So, if the observers are moving together and don’t realize it, their synchronization will be off; their clocks won’t show the same time; each will display the local time for its location.

The two observers will have no way of perceiving this, if the unmoving ether can transmit to them only luminous signals all of the same velocity, and if the other signals they might send are transmitted by media carried along with them in their translation. The phenomenon each observes will be too soon or too[Pg 500] late; it would be seen at the same instant only if the translation did not exist; but as it will be observed with a watch that is wrong, this will not be perceived and the appearances will not be altered.

The two observers won’t be able to notice this if the stationary ether can only send them light signals that all travel at the same speed, and if any other signals they send are transmitted through media that move with them. The phenomenon each one sees will be either too soon or too[Pg 500] late; they would see it at the same moment only if there were no movement. However, since it will be observed with an incorrect watch, this won’t be noticed, and the appearances won’t change.

It results from this that the compensation is easy to explain so long as we neglect the square of the aberration, and for a long time the experiments were not sufficiently precise to warrant taking account of it. But the day came when Michelson imagined a much more delicate procedure: he made rays interfere which had traversed different courses, after being reflected by mirrors; each of the paths approximating a meter and the fringes of interference permitting the recognition of a fraction of a thousandth of a millimeter, the square of the aberration could no longer be neglected, and yet the results were still negative. Therefore the theory required to be completed, and it has been by the Lorentz-Fitzgerald hypothesis.

It follows that the compensation is easy to explain as long as we ignore the square of the aberration, and for a long time the experiments weren't precise enough to justify considering it. However, the day arrived when Michelson came up with a much more sensitive method: he made light rays interfere after they had traveled different paths, being reflected by mirrors; each path was about a meter long, and the interference fringes allowed for the detection of a fraction of a thousandth of a millimeter. The square of the aberration could no longer be ignored, and yet the results were still negative. Therefore, the theory needed to be refined, which was done through the Lorentz-Fitzgerald hypothesis.

These two physicists suppose that all bodies carried along in a translation undergo a contraction in the sense of this translation, while their dimensions perpendicular to this translation remain unchanged. This contraction is the same for all bodies; moreover, it is very slight, about one two-hundred-millionth for a velocity such as that of the earth. Furthermore our measuring instruments could not disclose it, even if they were much more precise; our measuring rods in fact undergo the same contraction as the objects to be measured. If the meter exactly fits when applied to a body, if we point the body and consequently the meter in the sense of the motion of the earth, it will not cease to exactly fit in another orientation, and that although the body and the meter have changed in length as well as orientation, and precisely because the change is the same for one as for the other. But it is quite different if we measure a length, not now with a meter, but by the time taken by light to pass along it, and this is just what Michelson has done.

These two physicists believe that all objects in motion experience a contraction in the direction of that motion, while their dimensions perpendicular to that motion stay the same. This contraction is consistent for all objects; in addition, it's very minor, about one two-hundred-millionth for a speed like that of the Earth. Furthermore, our measurement tools wouldn't detect it, even if they were much more accurate; our measuring rods actually undergo the same contraction as the objects being measured. If the meter fits perfectly when applied to an object, and we align both the object and the meter in the direction of the Earth's motion, it will continue to fit perfectly in a different orientation, even though both the object and the meter have changed in length and orientation, precisely because the change is the same for both. However, it's a different story if we measure a length, not with a meter, but by measuring the time it takes for light to travel that distance, which is exactly what Michelson has done.

A body, spherical when at rest, will take thus the form of a flattened ellipsoid of revolution when in motion; but the observer will always think it spherical, since he himself has undergone an analogous deformation, as also all the objects serving as points of reference. On the contrary, the surfaces of the waves of[Pg 501] light, remaining rigorously spherical, will seem to him elongated ellipsoids.

A body that is spherical when still will become a flattened ellipsoid of revolution when it’s in motion; however, the observer will still perceive it as spherical, since they have also experienced a similar deformation, just like all the objects they use as reference points. In contrast, the surfaces of the waves of[Pg 501] light, which stay perfectly spherical, will appear to the observer as elongated ellipsoids.

What happens then? Suppose an observer and a source of light carried along together in the translation: the wave surfaces emanating from the source will be spheres having as centers the successive positions of the source; the distance from this center to the actual position of the source will be proportional to the time elapsed after the emission, that is to say to the radius of the sphere. All these spheres are therefore homothetic one to the other, with relation to the actual position S of the source. But, for our observer, because of the contraction, all these spheres will seem elongated ellipsoids, and all these ellipsoids will moreover be homothetic, with reference to the point S; the excentricity of all these ellipsoids is the same and depends solely upon the velocity of the earth. We shall so select the law of contraction that the point S may be at the focus of the meridian section of the ellipsoid.

What happens next? Imagine an observer and a source of light moving together: the wave surfaces coming from the source will form spheres centered at the source's successive positions; the distance from this center to the source's actual position will correspond to the time that has passed since the emission, which is to say, the radius of the sphere. All these spheres are therefore similar to each other concerning the source's actual position S. However, for our observer, due to the contraction, all these spheres will appear as elongated ellipsoids, and all these ellipsoids will also be similar, relative to the point S; the eccentricity of all these ellipsoids is the same and depends only on the speed of the earth. We will choose the law of contraction so that the point S is at the focus of the meridian section of the ellipsoid.

This time the compensation is rigorous, and this it is which explains Michelson's experiment.

This time the compensation is strict, and that is what explains Michelson's experiment.

I have said above that, according to the ordinary theories, observations of the astronomic aberration would give us the absolute velocity of the earth, if our instruments were a thousand times more precise. I must modify this statement. Yes, the observed angles would be modified by the effect of this absolute velocity, but the graduated circles we use to measure the angles would be deformed by the translation: they would become ellipses; thence would result an error in regard to the angle measured, and this second error would exactly compensate the first.

I mentioned earlier that, based on standard theories, observations of astronomical aberration would provide us with the earth's absolute velocity if our instruments were a thousand times more precise. I need to change that statement. Sure, the observed angles would be affected by this absolute velocity, but the graduated circles we use to measure those angles would be distorted by the movement: they would turn into ellipses; as a result, there would be an error in the measured angle, and this second error would perfectly offset the first.

This Lorentz-Fitzgerald hypothesis seems at first very extraordinary; all we can say for the moment, in its favor, is that it is only the immediate translation of Michelson's experimental result, if we define lengths by the time taken by light to run along them.

This Lorentz-Fitzgerald hypothesis may seem really surprising at first; all we can say for now to support it is that it simply translates Michelson's experimental results if we define lengths by the time it takes light to travel along them.

However that may be, it is impossible to escape the impression that the principle of relativity is a general law of nature, that one will never be able by any imaginable means to show any but relative velocities, and I mean by that not only the[Pg 502] velocities of bodies with reference to the ether, but the velocities of bodies with regard to one another. Too many different experiments have given concordant results for us not to feel tempted to attribute to this principle of relativity a value comparable to that, for example, of the principle of equivalence. In any case, it is proper to see to what consequences this way of looking at things would lead us and then to submit these consequences to the control of experiment.

However that may be, it’s impossible to avoid the impression that the principle of relativity is a fundamental law of nature, and that it will never be possible, through any conceivable means, to demonstrate anything other than relative velocities. I mean by that not only the [Pg 502] velocities of objects concerning the ether, but also the velocities of objects in relation to one another. Too many different experiments have yielded consistent results for us not to be tempted to assign this principle of relativity a significance comparable to that of the principle of equivalence, for instance. In any case, it’s important to consider the consequences this perspective would lead us to and then to test these consequences through experimentation.

III

The Principle of Reaction

Let us see what the principle of the equality of action and reaction becomes in the theory of Lorentz. Consider an electron A which for any cause begins to move; it produces a perturbation in the ether; at the end of a certain time, this perturbation reaches another electron B, which will be disturbed from its position of equilibrium. In these conditions there can not be equality between action and reaction, at least if we do not consider the ether, but only the electrons, which alone are observable, since our matter is made of electrons.

Let’s look at what the principle of action and reaction means in Lorentz's theory. Imagine an electron A that starts moving for any reason; it creates a disturbance in the ether. After some time, this disturbance reaches another electron B, which gets knocked out of its stable position. Under these conditions, we cannot have equality between action and reaction, at least if we only consider the electrons, which are the only things we can observe, since our matter is made up of electrons.

In fact it is the electron A which has disturbed the electron B; even in case the electron B should react upon A, this reaction could be equal to the action, but in no case simultaneous, since the electron B can begin to move only after a certain time, necessary for the propagation. Submitting the problem to a more exact calculation, we reach the following result: Suppose a Hertz discharger placed at the focus of a parabolic mirror to which it is mechanically attached; this discharger emits electromagnetic waves, and the mirror reflects all these waves in the same direction; the discharger therefore will radiate energy in a determinate direction. Well, the calculation shows that the discharger recoils like a cannon which has shot out a projectile. In the case of the cannon, the recoil is the natural result of the equality of action and reaction. The cannon recoils because the projectile upon which it has acted reacts upon it. But here it is no longer the same. What has been sent out is no longer a material projectile: it is energy, and energy has no mass: it has[Pg 503] no counterpart. And, in place of a discharger, we could have considered just simply a lamp with a reflector concentrating its rays in a single direction.

Actually, it's the electron A that has affected the electron B; even if electron B were to react towards A, that reaction could match the action but wouldn't happen at the same time, because electron B can only start moving after a certain delay, due to how information travels. When we calculate this more precisely, we get the following result: Imagine a Hertz discharger placed at the focus of a parabolic mirror that it's physically attached to; this discharger emits electromagnetic waves, and the mirror reflects all these waves in the same direction, allowing the discharger to radiate energy in a specific direction. The calculation shows that the discharger recoils like a cannon firing a projectile. In the cannon's case, the recoil is a natural consequence of the equal and opposite reaction. The cannon recoils because the projectile it has acted upon pushes back. But here it's different. What has been released is no longer a physical projectile: it's energy, and energy has no mass: it has[Pg 503] no equivalent. Instead of a discharger, we could have simply thought of a lamp with a reflector focusing its rays in one direction.

It is true that, if the energy sent out from the discharger or from the lamp meets a material object, this object receives a mechanical push as if it had been hit by a real projectile, and this push will be equal to the recoil of the discharger and of the lamp, if no energy has been lost on the way and if the object absorbs the whole of the energy. Therefore one is tempted to say that there still is compensation between the action and the reaction. But this compensation, even should it be complete, is always belated. It never happens if the light, after leaving its source, wanders through interstellar spaces without ever meeting a material body; it is incomplete, if the body it strikes is not perfectly absorbent.

It’s true that if the energy emitted from the discharger or the lamp hits a physical object, that object gets a mechanical push as if it were struck by an actual projectile, and this push will match the recoil of the discharger and the lamp, assuming no energy was lost along the way and the object absorbs all the energy. So, it’s tempting to say that there is compensation between the action and the reaction. However, this compensation, even if it’s total, always comes too late. It never occurs if the light, after leaving its source, travels through interstellar space without ever encountering a material object; it’s incomplete if the object it impacts isn’t perfectly absorbent.

Are these mechanical actions too small to be measured, or are they accessible to experiment? These actions are nothing other than those due to the Maxwell-Bartholi pressures; Maxwell had predicted these pressures from calculations relative to electrostatics and magnetism; Bartholi reached the same result by thermodynamic considerations.

Are these mechanical actions too small to measure, or can they be tested through experiments? These actions are simply those caused by the Maxwell-Bartholi pressures; Maxwell predicted these pressures based on calculations related to electrostatics and magnetism, while Bartholi arrived at the same conclusion through thermodynamic considerations.

This is how the tails of comets are explained. Little particles detach themselves from the nucleus of the comet; they are struck by the light of the sun, which pushes them back as would a rain of projectiles coming from the sun. The mass of these particles is so little that this repulsion sweeps it away against the Newtonian attraction; so in moving away from the sun they form the tails.

This is how the tails of comets are explained. Small particles break away from the comet's nucleus; they get hit by sunlight, which pushes them away like a shower of projectiles coming from the sun. The mass of these particles is so small that this repulsion outmatches the gravitational pull; so as they move away from the sun, they create the tails.

The direct experimental verification was not easy to obtain. The first endeavor led to the construction of the radiometer. But this instrument turns backward, in the sense opposite to the theoretic sense, and the explanation of its rotation, since discovered, is wholly different. At last success came, by making the vacuum more complete, on the one hand, and on the other by not blackening one of the faces of the paddles and directing a pencil of luminous rays upon one of the faces. The radiometric effects and the other disturbing causes are eliminated by a series of pains-taking precautions, and one obtains a deviation which is very[Pg 504] minute, but which is, it would seem, in conformity with the theory.

The direct experimental confirmation wasn't easy to achieve. The first attempt led to the creation of the radiometer. However, this instrument turns backward, moving in the opposite direction of what theory predicted, and the explanation for its rotation, which has since been understood, is completely different. Eventually, success came by making the vacuum more complete and by not painting one of the paddle faces black while directing a beam of light at one of the surfaces. Radiometric effects and other interfering factors were minimized through a series of meticulous precautions, resulting in a deviation that is very [Pg 504] small, but seems to align with the theory.

The same effects of the Maxwell-Bartholi pressure are forecast likewise by the theory of Hertz of which we have before spoken, and by that of Lorentz. But there is a difference. Suppose that the energy, under the form of light, for example, proceeds from a luminous source to any body through a transparent medium. The Maxwell-Bartholi pressure will act, not alone upon the source at the departure, and on the body lit up at the arrival, but upon the matter of the transparent medium which it traverses. At the moment when the luminous wave reaches a new region of this medium, this pressure will push forward the matter there distributed and will put it back when the wave leaves this region. So that the recoil of the source has for counterpart the forward movement of the transparent matter which is in contact with this source; a little later, the recoil of this same matter has for counterpart the forward movement of the transparent matter which lies a little further on, and so on.

The same effects of the Maxwell-Bartholi pressure are also predicted by Hertz's theory, which we discussed earlier, and by Lorentz's theory. However, there’s a difference. Imagine that energy, like light, travels from a light source to any object through a transparent medium. The Maxwell-Bartholi pressure will not only act on the source at the beginning and on the illuminated object at the end, but also on the particles of the transparent medium it moves through. When the light wave reaches a new area of this medium, this pressure will push the particles in that area forward and then pull them back when the wave leaves. Thus, the recoil of the source corresponds to the forward movement of the transparent matter in contact with it; shortly after, the recoil of that same matter corresponds to the forward movement of the transparent matter a bit further along, and this process continues.

Only, is the compensation perfect? Is the action of the Maxwell-Bartholi pressure upon the matter of the transparent medium equal to its reaction upon the source, and that whatever be this matter? Or is this action by so much the less as the medium is less refractive and more rarefied, becoming null in the void?

Only, is the compensation perfect? Is the effect of the Maxwell-Bartholi pressure on the transparent medium equal to its reaction on the source, no matter what that source is? Or is this effect reduced the less refractive and more rarefied the medium is, becoming nonexistent in a vacuum?

If we admit the theory of Hertz, who regards matter as mechanically bound to the ether, so that the ether may be entirely carried along by matter, it would be necessary to answer yes to the first question and no to the second.

If we accept Hertz's theory, which sees matter as mechanically linked to the ether, meaning the ether can be completely dragged along by matter, we would need to say yes to the first question and no to the second.

There would then be perfect compensation, as required by the principle of the equality of action and reaction, even in the least refractive media, even in the air, even in the interplanetary void, where it would suffice to suppose a residue of matter, however subtile. If on the contrary we admit the theory of Lorentz, the compensation, always imperfect, is insensible in the air and becomes null in the void.

There would then be perfect compensation, as required by the principle of the equality of action and reaction, even in the least refractive media, even in the air, even in the interplanetary void, where it would suffice to suppose a residue of matter, however subtle. If, on the other hand, we accept Lorentz's theory, the compensation, which is always imperfect, becomes undetectable in the air and disappears completely in the void.

But we have seen above that Fizeau's experiment does not permit of our retaining the theory of Hertz; it is necessary[Pg 505] therefore to adopt the theory of Lorentz, and consequently to renounce the principle of reaction.

But we have seen above that Fizeau's experiment doesn't allow us to hold on to Hertz's theory; therefore, it’s necessary[Pg 505] to adopt Lorentz's theory, and as a result, to give up the principle of reaction.

IV

Consequences of the Principle of Relativity

We have seen above the reasons which impel us to regard the principle of relativity as a general law of nature. Let us see to what consequences this principle would lead, should it be regarded as finally demonstrated.

We have seen the reasons that drive us to consider the principle of relativity as a fundamental law of nature. Now, let's explore the implications this principle would have if it were considered definitively proven.

First, it obliges us to generalize the hypothesis of Lorentz and Fitzgerald on the contraction of all bodies in the sense of the translation. In particular, we must extend this hypothesis to the electrons themselves. Abraham considered these electrons as spherical and indeformable; it will be necessary for us to admit that these electrons, spherical when in repose, undergo the Lorentz contraction when in motion and take then the form of flattened ellipsoids.

First, it requires us to expand the hypothesis of Lorentz and Fitzgerald regarding the contraction of all bodies when they are in motion. Specifically, we need to apply this hypothesis to the electrons themselves. Abraham viewed these electrons as spherical and unchangeable; we must accept that these electrons, which are spherical when at rest, experience Lorentz contraction when in motion and become flattened ellipsoids.

This deformation of the electrons will influence their mechanical properties. In fact I have said that the displacement of these charged electrons is a veritable current of convection and that their apparent inertia is due to the self-induction of this current: exclusively as concerns the negative electrons; exclusively or not, we do not yet know, for the positive electrons. Well, the deformation of the electrons, a deformation which depends upon their velocity, will modify the distribution of the electricity upon their surface, consequently the intensity of the convection current they produce, consequently the laws according to which the self-induction of this current will vary as a function of the velocity.

This distortion of the electrons will impact their mechanical properties. In fact, I've mentioned that the movement of these charged electrons is essentially a convection current and that their apparent inertia is due to the self-induction of this current, specifically for the negative electrons; whether this applies to the positive electrons or not is still unclear. Now, the deformation of the electrons, which depends on their speed, will change the distribution of electricity on their surface, thereby affecting the strength of the convection current they generate, and consequently, the rules governing how the self-induction of this current changes with speed.

At this price, the compensation will be perfect and will conform to the requirements of the principle of relativity, but only upon two conditions:

At this price, the compensation will be ideal and will meet the requirements of the principle of relativity, but only under two conditions:

1º That the positive electrons have no real mass, but only a fictitious electromagnetic mass; or at least that their real mass, if it exists, is not constant and varies with the velocity according to the same laws as their fictitious mass;

1º That positive electrons have no actual mass, but only a pretend electromagnetic mass; or at least that their actual mass, if it exists, isn't constant and changes with speed according to the same rules as their pretend mass;

2º That all forces are of electromagnetic origin, or at least[Pg 506] that they vary with the velocity according to the same laws as the forces of electromagnetic origin.

2º That all forces come from electromagnetic origins, or at least[Pg 506] that they change with velocity according to the same principles as the forces from electromagnetic origins.

It still is Lorentz who has made this remarkable synthesis; stop a moment and see what follows therefrom. First, there is no more matter, since the positive electrons no longer have real mass, or at least no constant real mass. The present principles of our mechanics, founded upon the constancy of mass, must therefore be modified. Again, an electromagnetic explanation must be sought of all the known forces, in particular of gravitation, or at least the law of gravitation must be so modified that this force is altered by velocity in the same way as the electromagnetic forces. We shall return to this point.

It’s still Lorentz who has created this impressive synthesis; take a moment to consider what follows from it. First, there is no more matter, since positive electrons don’t have real mass anymore, or at least not a constant real mass. The current principles of our mechanics, based on the constancy of mass, need to be changed. Additionally, we must find an electromagnetic explanation for all the known forces, particularly gravitation, or at least the law of gravitation must be adjusted so that this force changes with velocity in the same way as electromagnetic forces. We’ll come back to this point.

All that appears, at first sight, a little artificial. In particular, this deformation of electrons seems quite hypothetical. But the thing may be presented otherwise, so as to avoid putting this hypothesis of deformation at the foundation of the reasoning. Consider the electrons as material points and ask how their mass should vary as function of the velocity not to contravene the principle of relativity. Or, still better, ask what should be their acceleration under the influence of an electric or magnetic field, that this principle be not violated and that we come back to the ordinary laws when we suppose the velocity very slight. We shall find that the variations of this mass, or of these accelerations, must be as if the electron underwent the Lorentz deformation.

At first glance, everything seems a bit artificial. In particular, this idea of electron deformation feels quite hypothetical. However, we can present it differently to avoid basing our reasoning on this hypothesis of deformation. Think of the electrons as tiny particles and determine how their mass should change based on their speed so that it doesn't violate the principle of relativity. Or better yet, consider what their acceleration should be in an electric or magnetic field, ensuring that this principle holds true and that we revert to the normal laws when we assume the speed is very low. We will find that the changes in mass, or in these accelerations, must be as if the electron experienced Lorentz deformation.

V

Kaufmann's Experiment

We have before us, then, two theories: one where the electrons are indeformable, this is that of Abraham; the other where they undergo the Lorentz deformation. In both cases, their mass increases with the velocity, becoming infinite when this velocity becomes equal to that of light; but the law of the variation is not the same. The method employed by Kaufmann to bring to light the law of variation of the mass seems therefore to give us an experimental means of deciding between the two theories.

We have two theories to consider: one where electrons are unchangeable, which is Abraham's theory; the other where they experience Lorentz deformation. In both scenarios, their mass increases with speed, reaching infinity when the speed equals that of light; however, the way this change occurs is different. The method used by Kaufmann to reveal the mass variation law appears to provide an experimental way to choose between the two theories.

Unhappily, his first experiments were not sufficiently precise for that; so he decided to repeat them with more precautions, and[Pg 507] measuring with great care the intensity of the fields. Under their new form they are in favor of the theory of Abraham. Then the principle of relativity would not have the rigorous value we were tempted to attribute to it; there would no longer be reason for believing the positive electrons denuded of real mass like the negative electrons. However, before definitely adopting this conclusion, a little reflection is necessary. The question is of such importance that it is to be wished Kaufmann's experiment were repeated by another experimenter.[17] Unhappily, this experiment is very delicate and could be carried out successfully only by a physicist of the same ability as Kaufmann. All precautions have been properly taken and we hardly see what objection could be made.

Unfortunately, his initial experiments weren't precise enough for that, so he decided to repeat them with more care, measuring the intensity of the fields very carefully. In their new form, they support Abraham's theory. This means the principle of relativity wouldn't hold the strict value we thought it did; there would be no reason to believe that positive electrons lack real mass just like negative electrons do. However, before fully embracing this conclusion, some thought is needed. The issue is so significant that it would be ideal for another experimenter to repeat Kaufmann's experiment.[17] Unfortunately, this experiment is very sensitive and could only be successfully conducted by a physicist of equal skill to Kaufmann's. All necessary precautions have been taken, and it's hard to see what objections could be raised.

There is one point however to which I wish to draw attention: that is to the measurement of the electrostatic field, a measurement upon which all depends. This field was produced between the two armatures of a condenser; and, between these armatures, there was to be made an extremely perfect vacuum, in order to obtain a complete isolation. Then the difference of potential of the two armatures was measured, and the field obtained by dividing this difference by the distance apart of the armatures. That supposes the field uniform; is this certain? Might there not be an abrupt fall of potential in the neighborhood of one of the armatures, of the negative armature, for example? There may be a difference of potential at the meeting of the metal and the vacuum, and it may be that this difference is not the same on the positive side and on the negative side; what would lead me to think so is the electric valve effects between mercury and vacuum. However slight the probability that it is so, it seems that it should be considered.

There is one point I want to highlight: the measurement of the electrostatic field, which everything depends on. This field was created between the two plates of a capacitor, and within this space, an extremely perfect vacuum needed to be established for complete isolation. Then, the potential difference between the two plates was measured, and the field was determined by dividing this difference by the distance between the plates. This assumes the field is uniform; but is that certain? Could there be a sudden drop in potential near one of the plates, like the negative plate, for instance? There might be a potential difference at the interface of the metal and the vacuum, and it’s possible that this difference isn’t the same on the positive side as it is on the negative side. What makes me think this is the effects seen with electric valves between mercury and vacuum. No matter how slight the possibility, it seems worth considering.

VI

The Principle of Inertia

In the new dynamics, the principle of inertia is still true, that is to say that an isolated electron will have a straight and uniform motion. At least this is generally assumed; however,[Pg 508] Lindemann has made objections to this view; I do not wish to take part in this discussion, which I can not here expound because of its too difficult character. In any case, slight modifications to the theory would suffice to shelter it from Lindemann's objections.

In the new dynamics, the principle of inertia still applies, meaning that an isolated electron will move in a straight line at a constant speed. This is generally accepted; however,[Pg 508] Lindemann has raised concerns about this perspective. I don’t want to get involved in that discussion, which I can't elaborate on here because it's too complex. In any case, minor adjustments to the theory would be enough to protect it from Lindemann's criticisms.

We know that a body submerged in a fluid experiences, when in motion, considerable resistance, but this is because our fluids are viscous; in an ideal fluid, perfectly free from viscosity, the body would stir up behind it a liquid hill, a sort of wake; upon departure, a great effort would be necessary to put it in motion, since it would be necessary to move not only the body itself, but the liquid of its wake. But, the motion once acquired, it would perpetuate itself without resistance, since the body, in advancing, would simply carry with it the perturbation of the liquid, without the total vis viva of the liquid augmenting. Everything would happen therefore as if its inertia was augmented. An electron advancing in the ether would behave in the same way: around it, the ether would be stirred up, but this perturbation would accompany the body in its motion; so that, for an observer carried along with the electron, the electric and magnetic fields accompanying this electron would appear invariable, and would change only if the velocity of the electron varied. An effort would therefore be necessary to put the electron in motion, since it would be necessary to create the energy of these fields; on the contrary, once the movement acquired, no effort would be necessary to maintain it, since the created energy would only have to go along behind the electron as a wake. This energy, therefore, could only augment the inertia of the electron, as the agitation of the liquid augments that of the body submerged in a perfect fluid. And anyhow, the negative electrons at least have no other inertia except that.

We know that a body submerged in a fluid experiences a lot of resistance when it's moving, but that's because our fluids are viscous. In an ideal fluid, which has no viscosity, the body would create a bump of liquid behind it, like a wake. When it starts moving, it would take a lot of effort to get it going, because you'd have to move not just the body, but also the fluid in its wake. However, once it starts moving, it would keep going without resistance, as the body would just carry along the disturbance in the fluid without increasing the total energy of the liquid. Everything would act as if its inertia were increased. An electron moving in the ether would behave in a similar way: it would stir up the ether around it, but this disturbance would move with the body. Thus, for someone moving with the electron, the electric and magnetic fields associated with it would seem constant, changing only if the electron's speed changed. Therefore, it would take effort to set the electron in motion, as you would need to create the energy for these fields; but once it's moving, no effort would be needed to keep it going, since the energy created would just follow it like a wake. This energy would effectively increase the electron's inertia, similar to how the disturbance in the fluid increases that of a body submerged in a perfect fluid. And in any case, the negative electrons don't have any other form of inertia besides that.

In the hypothesis of Lorentz, the vis viva, which is only the energy of the ether, is not proportional to v2. Doubtless if v is very slight, the vis viva is sensibly proportional to v2, the quantity of motion sensibly proportional to v, the two masses sensibly constant and equal to each other. But when the velocity tends toward the velocity of light, the vis viva, the quantity of motion and the two masses increase beyond all limit.[Pg 509]

In Lorentz's hypothesis, the kinetic energy, which is just the energy of the ether, isn't proportional to v2. Sure, if v is very small, the kinetic energy is roughly proportional to v2, and the amount of motion is roughly proportional to v, with the two masses being roughly constant and equal to each other. But when the velocity approaches the speed of light, the kinetic energy, the amount of motion, and the two masses increase without limit.[Pg 509]

In the hypothesis of Abraham, the expressions are a little more complicated; but what we have just said remains true in essentials.

In Abraham's hypothesis, the terms are a bit more complex; but what we've just stated still holds true in the basics.

So the mass, the quantity of motion, the vis viva become infinite when the velocity is equal to that of light.

So the mass, the amount of motion, the kinetic energy becomes infinite when the speed is equal to that of light.

Thence results that no body can attain in any way a velocity beyond that of light. And in fact, in proportion as its velocity increases, its mass increases, so that its inertia opposes to any new increase of velocity a greater and greater obstacle.

Thence results that no body can attain in any way a velocity beyond that of light. And in fact, as its speed increases, its mass increases, which means its inertia creates a bigger and bigger obstacle to any further increase in speed.

A question then suggests itself: let us admit the principle of relativity; an observer in motion would not have any means of perceiving his own motion. If therefore no body in its absolute motion can exceed the velocity of light, but may approach it as nearly as you choose, it should be the same concerning its relative motion with reference to our observer. And then we might be tempted to reason as follows: The observer may attain a velocity of 200,000 kilometers; the body in its relative motion with reference to the observer may attain the same velocity; its absolute velocity will then be 400,000 kilometers, which is impossible, since this is beyond the velocity of light. This is only a seeming, which vanishes when account is taken of how Lorentz evaluates local time.

A question then arises: if we accept the principle of relativity, an observer in motion wouldn’t be able to perceive their own motion. So, since no object in absolute motion can exceed the speed of light, but can get as close to it as desired, the same should apply to its relative motion with respect to our observer. We might be tempted to think this way: the observer can reach a speed of 200,000 kilometers; the object in its relative motion concerning the observer could also reach the same speed; this would make its absolute speed 400,000 kilometers, which is impossible, as it exceeds the speed of light. This is just an illusion, which disappears when considering how Lorentz calculates local time.

VII

The Wave of Acceleration

When an electron is in motion, it produces a perturbation in the ether surrounding it; if its motion is straight and uniform, this perturbation reduces to the wake of which we have spoken in the preceding section. But it is no longer the same, if the motion be curvilinear or varied. The perturbation may then be regarded as the superposition of two others, to which Langevin has given the names wave of velocity and wave of acceleration. The wave of velocity is only the wave which happens in uniform motion.

When an electron moves, it creates a disturbance in the surrounding ether. If it's moving in a straight and steady way, this disturbance is just the wake discussed in the previous section. However, if the motion is curved or changing, it’s different. The disturbance can be viewed as a combination of two other types, which Langevin called the wave of velocity and wave of acceleration. The wave of velocity is simply the wave that occurs during uniform motion.

As to the wave of acceleration, this is a perturbation altogether analogous to light waves, which starts from the electron at the instant when it undergoes an acceleration, and which is then[Pg 510] propagated by successive spherical waves with the velocity of light. Whence follows: in a straight and uniform motion, the energy is wholly conserved; but, when there is an acceleration, there is loss of energy, which is dissipated under the form of luminous waves and goes out to infinity across the ether.

As for the wave of acceleration, this is a disturbance that is similar to light waves, which begins when the electron is accelerated, and is then [Pg 510] spread out by successive spherical waves at the speed of light. This leads to the conclusion: in straight and uniform motion, energy is fully conserved; however, when there is acceleration, energy is lost, which is released as light waves and radiates out to infinity through the ether.

However, the effects of this wave of acceleration, in particular the corresponding loss of energy, are in most cases negligible, that is to say not only in ordinary mechanics and in the motions of the heavenly bodies, but even in the radium rays, where the velocity is very great without the acceleration being so. We may then confine ourselves to applying the laws of mechanics, putting the force equal to the product of acceleration by mass, this mass, however, varying with the velocity according to the laws explained above. We then say the motion is quasi-stationary.

However, the effects of this wave of acceleration, especially the related loss of energy, are usually negligible—not just in everyday mechanics and the movements of celestial bodies, but even in the case of radium rays, where the speed is very high but the acceleration isn't. We can thus stick to applying the laws of mechanics, setting the force equal to the product of acceleration and mass. This mass, however, changes with velocity according to the rules explained earlier. We then say the motion is quasi-stationary.

It would not be the same in all cases where the acceleration is great, of which the chief are the following:

It wouldn't be the same in every case where the acceleration is high, and the main ones are the following:

1º In incandescent gases certain electrons take an oscillatory motion of very high frequency; the displacements are very small, the velocities are finite, and the accelerations very great; energy is then communicated to the ether, and this is why these gases radiate light of the same period as the oscillations of the electron;

1º In glowing gases, some electrons move back and forth at extremely high frequencies; the movements are tiny, the speeds are finite, and the accelerations are very large; energy is then transferred to the ether, which is why these gases emit light that matches the frequency of the electrons' oscillations;

2º Inversely, when a gas receives light, these same electrons are put in swing with strong accelerations and they absorb light;

2º On the other hand, when a gas is exposed to light, these same electrons are set in motion with strong accelerations and they absorb light;

3º In the Hertz discharger, the electrons which circulate in the metallic mass undergo, at the instant of the discharge, an abrupt acceleration and take then an oscillatory motion of high frequency. Thence results that a part of the energy radiates under the form of Hertzian waves;

3º In the Hertz discharger, the electrons moving through the metal experience a sudden acceleration at the moment of the discharge and then engage in high-frequency oscillatory motion. As a result, some of the energy is released in the form of Hertzian waves;

4º In an incandescent metal, the electrons enclosed in this metal are impelled with great velocity; upon reaching the surface of the metal, which they can not get through, they are reflected and thus undergo a considerable acceleration. This is why the metal emits light. The details of the laws of the emission of light by dark bodies are perfectly explained by this hypothesis;

4º In a glowing metal, the electrons trapped in this metal are pushed with great speed; when they hit the surface of the metal, which they can't penetrate, they bounce back and experience a significant increase in speed. This is why the metal gives off light. The specifics of the laws governing light emission from dark objects are fully explained by this idea;

5º Finally when the cathode rays strike the anticathode, the negative electrons, constituting these rays, which are impelled with very great velocity, are abruptly arrested. Because of the[Pg 511] acceleration they thus undergo, they produce undulations in the ether. This, according to certain physicists, is the origin of the Röntgen rays, which would only be light rays of very short wave-length.

5º Finally, when the cathode rays hit the anticathode, the negative electrons that make up these rays, which are moving at very high speeds, are suddenly stopped. Due to the[Pg 511] acceleration they experience, they create waves in the ether. According to some physicists, this is the source of the Röntgen rays, which are essentially just light rays of very short wavelengths.


CHAPTER III

Modern Mechanics and Astronomy

I

Gravitation

Mass may be defined in two ways:

Mass can be defined in two ways:

1º By the quotient of the force by the acceleration; this is the true definition of the mass, which measures the inertia of the body.

1º By dividing the force by the acceleration; this is the true definition of mass, which measures the body's inertia.

2º By the attraction the body exercises upon an exterior body, in virtue of Newton's law. We should therefore distinguish the mass coefficient of inertia and the mass coefficient of attraction. According to Newton's law, there is rigorous proportionality between these two coefficients. But that is demonstrated only for velocities to which the general principles of dynamics are applicable. Now, we have seen that the mass coefficient of inertia increases with the velocity; should we conclude that the mass coefficient of attraction increases likewise with the velocity and remains proportional to the coefficient of inertia, or, on the contrary, that this coefficient of attraction remains constant? This is a question we have no means of deciding.

2º By the attraction that one body exerts on another external body, according to Newton's law. We should therefore distinguish between the mass coefficient of inertia and the mass coefficient of attraction. According to Newton's law, there is a strict proportionality between these two coefficients. However, this is only proven for velocities where the general principles of dynamics apply. Now, we have observed that the mass coefficient of inertia increases with velocity; should we conclude that the mass coefficient of attraction also increases with velocity and stays proportional to the inertial coefficient, or, on the other hand, that the coefficient of attraction remains constant? This is a question we have no way of answering.

On the other hand, if the coefficient of attraction depends upon the velocity, since the velocities of two bodies which mutually attract are not in general the same, how will this coefficient depend upon these two velocities?

On the other hand, if the coefficient of attraction depends on the velocity, and since the velocities of two bodies that attract each other are usually not the same, how will this coefficient depend on these two velocities?

Upon this subject we can only make hypotheses, but we are naturally led to investigate which of these hypotheses would be compatible with the principle of relativity. There are a great number of them; the only one of which I shall here speak is that of Lorentz, which I shall briefly expound.

Upon this topic, we can only propose hypotheses, but we are naturally prompted to explore which of these hypotheses aligns with the principle of relativity. There are many of them; the only one I will discuss here is Lorentz's, which I will explain briefly.

Consider first electrons at rest. Two electrons of the same sign repel each other and two electrons of contrary sign attract each other; in the ordinary theory, their mutual actions are proportional to their electric charges; if therefore we have four[Pg 513] electrons, two positive A and , and two negative B and , the charges of these four being the same in absolute value, the repulsion of A for will be, at the same distance, equal to the repulsion of B for and equal also to the attraction of A for , or of for B. If therefore A and B are very near each other, as also and , and we examine the action of the system A + B upon the system + , we shall have two repulsions and two attractions which will exactly compensate each other and the resulting action will be null.

Consider first electrons at rest. Two electrons with the same charge repel each other, while two electrons with opposite charges attract. In the usual theory, their interactions are proportional to their electric charges. So, if we have four[Pg 513] electrons—two positive A and , and two negative B and —with the charges of these four being the same in absolute value, the repulsion between A and will be equal to the repulsion between B and at the same distance, and also equal to the attraction between A and , or between and B. Therefore, if A and B are very close to each other, as are and , and we look at the action of the system A + B on the system + , we will find two repulsions and two attractions that perfectly balance each other, resulting in no net action.

Now, material molecules should just be regarded as species of solar systems where circulate the electrons, some positive, some negative, and in such a way that the algebraic sum of all the charges is null. A material molecule is therefore wholly analogous to the system A + B of which we have spoken, so that the total electric action of two molecules one upon the other should be null.

Now, material molecules can be seen as types of solar systems where electrons orbit around, some with positive charge and some with negative charge, and in such a way that the algebraic sum of all the charges is zero. A material molecule is thus completely similar to the system A + B that we discussed earlier, meaning that the total electric interaction of two molecules on each other should be zero.

But experiment shows us that these molecules attract each other in consequence of Newtonian gravitation; and then we may make two hypotheses: we may suppose gravitation has no relation to the electrostatic attractions, that it is due to a cause entirely different, and is simply something additional; or else we may suppose the attractions are not proportional to the charges and that the attraction exercised by a charge +1 upon a charge −1 is greater than the mutual repulsion of two +1 charges, or two −1 charges.

But experiments show us that these molecules attract each other because of Newtonian gravity. From this, we can make two assumptions: we can either suppose that gravity has nothing to do with electrostatic attraction and is caused by something completely different, simply adding to the overall effect; or we can assume that the attractions aren’t proportional to the charges, meaning that the attraction between a charge of +1 and a charge of -1 is stronger than the mutual repulsion between two +1 charges or two -1 charges.

In other words, the electric field produced by the positive electrons and that which the negative electrons produce might be superposed and yet remain distinct. The positive electrons would be more sensitive to the field produced by the negative electrons than to the field produced by the positive electrons; the contrary would be the case for the negative electrons. It is clear that this hypothesis somewhat complicates electrostatics, but that it brings back into it gravitation. This was, in sum, Franklin's hypothesis.

In other words, the electric field created by positive electrons and the one created by negative electrons could overlap while still remaining distinct. Positive electrons would react more to the field generated by negative electrons than to the field from positive electrons; the opposite would be true for negative electrons. It's clear that this idea makes electrostatics a bit more complex, but it also reintroduces gravitation into the discussion. This was, in short, Franklin's hypothesis.

What happens now if the electrons are in motion? The positive electrons will cause a perturbation in the ether and produce there an electric and magnetic field. The same will be the case for the negative electrons. The electrons, positive as[Pg 514] well as negative, undergo then a mechanical impulsion by the action of these different fields. In the ordinary theory, the electromagnetic field, due to the motion of the positive electrons, exercises, upon two electrons of contrary sign and of the same absolute charge, equal actions with contrary sign. We may then without inconvenience not distinguish the field due to the motion of the positive electrons and the field due to the motion of the negative electrons and consider only the algebraic sum of these two fields, that is to say the resulting field.

What happens now when the electrons are moving? The positive electrons will create a disturbance in the ether, generating an electric and magnetic field. The same goes for the negative electrons. Both positive and negative electrons experience a mechanical push from the effects of these different fields. In the standard theory, the electromagnetic field caused by the movement of positive electrons acts on two electrons of opposite charges but equal absolute charge with equal forces in opposite directions. So, we can conveniently treat the field created by the motion of positive electrons and the field created by the motion of negative electrons as one, focusing only on the algebraic sum of these two fields, which is the resulting field.

In the new theory, on the contrary, the action upon the positive electrons of the electromagnetic field due to the positive electrons follows the ordinary laws; it is the same with the action upon the negative electrons of the field due to the negative electrons. Let us now consider the action of the field due to the positive electrons upon the negative electrons (or inversely); it will still follow the same laws, but with a different coefficient. Each electron is more sensitive to the field created by the electrons of contrary name than to the field created by the electrons of the same name.

In the new theory, on the other hand, the influence of the electromagnetic field from positive electrons on other positive electrons follows the standard laws; the same applies to the influence of the field from negative electrons on other negative electrons. Now, let’s think about how the field created by positive electrons affects negative electrons (or vice versa); it still follows the same laws, but with a different coefficient. Each electron is more responsive to the field generated by oppositely charged electrons than to the field generated by similarly charged electrons.

Such is the hypothesis of Lorentz, which reduces to Franklin's hypothesis for slight velocities; it will therefore explain, for these small velocities, Newton's law. Moreover, as gravitation goes back to forces of electrodynamic origin, the general theory of Lorentz will apply, and consequently the principle of relativity will not be violated.

Such is Lorentz's hypothesis, which simplifies to Franklin's hypothesis for low speeds; therefore, it will explain Newton's law for these small speeds. Additionally, since gravitation is rooted in forces from electrodynamics, Lorentz's general theory will apply, and as a result, the principle of relativity will remain intact.

We see that Newton's law is no longer applicable to great velocities and that it must be modified, for bodies in motion, precisely in the same way as the laws of electrostatics for electricity in motion.

We can see that Newton's law doesn't work at high speeds anymore and needs to be adjusted, just like the laws of electrostatics have to change for moving electricity.

We know that electromagnetic perturbations spread with the velocity of light. We may therefore be tempted to reject the preceding theory upon remembering that gravitation spreads, according to the calculations of Laplace, at least ten million times more quickly than light, and that consequently it can not be of electromagnetic origin. The result of Laplace is well known, but one is generally ignorant of its signification. Laplace supposed that, if the propagation of gravitation is not instantaneous, its velocity of spread combines with that of the body[Pg 515] attracted, as happens for light in the phenomenon of astronomic aberration, so that the effective force is not directed along the straight joining the two bodies, but makes with this straight a small angle. This is a very special hypothesis, not well justified, and, in any case, entirely different from that of Lorentz. Laplace's result proves nothing against the theory of Lorentz.

We know that electromagnetic disturbances travel at the speed of light. So, we might be tempted to dismiss the previous theory, especially since Laplace calculated that gravitation spreads at least ten million times faster than light, meaning it can't be electromagnetic in nature. Laplace's findings are well known, but many people don’t really understand their implications. He assumed that if gravitation doesn’t propagate instantly, its speed of transmission combines with that of the body[Pg 515] being attracted, similar to light in the phenomenon of astronomical aberration. This means the effective force isn't aligned directly between the two bodies, but instead forms a slight angle with that line. This is a very specific hypothesis that isn't strongly substantiated and is, in any case, completely different from Lorentz's theory. Laplace's result doesn’t disprove Lorentz's theory at all.

II

Comparison with Astronomic Observations

Can the preceding theories be reconciled with astronomic observations?

Can the previous theories be aligned with astronomical observations?

First of all, if we adopt them, the energy of the planetary motions will be constantly dissipated by the effect of the wave of acceleration. From this would result that the mean motions of the stars would constantly accelerate, as if these stars were moving in a resistant medium. But this effect is exceedingly slight, far too much so to be discerned by the most precise observations. The acceleration of the heavenly bodies is relatively slight, so that the effects of the wave of acceleration are negligible and the motion may be regarded as quasi stationary. It is true that the effects of the wave of acceleration constantly accumulate, but this accumulation itself is so slow that thousands of years of observation would be necessary for it to become sensible. Let us therefore make the calculation considering the motion as quasi-stationary, and that under the three following hypotheses:

First of all, if we adopt them, the energy from planetary movements will continuously be lost due to the effect of the wave of acceleration. This would mean that the average movements of the stars would steadily accelerate, as if these stars were traveling through a resistant medium. However, this effect is extremely minor, far too small to be detected even by the most precise observations. The acceleration of heavenly bodies is relatively slight, making the effects of the wave of acceleration negligible, so we can consider the motion as quasi-stationary. It's true that the effects of the wave of acceleration keep building up, but this accumulation happens so slowly that it would take thousands of years of observations for it to be noticeable. Let's proceed with the calculation treating the motion as quasi-stationary, based on the following three hypotheses:

A. Admit the hypothesis of Abraham (electrons indeformable) and retain Newton's law in its usual form;

A. Accept the hypothesis of Abraham (electrons are incompressible) and keep Newton's law in its standard form;

B. Admit the hypothesis of Lorentz about the deformation of electrons and retain the usual Newton's law;

B. Accept Lorentz's hypothesis about the deformation of electrons and keep the standard Newton's law;

C. Admit the hypothesis of Lorentz about electrons and modify Newton's law as we have done in the preceding paragraph, so as to render it compatible with the principle of relativity.

C. Accept Lorentz's hypothesis regarding electrons and adjust Newton's law as we did in the previous paragraph to make it consistent with the principle of relativity.

It is in the motion of Mercury that the effect will be most sensible, since this planet has the greatest velocity. Tisserand formerly made an analogous calculation, admitting Weber's law; I recall that Weber had sought to explain at the same time the[Pg 516] electrostatic and electrodynamic phenomena in supposing that electrons (whose name was not yet invented) exercise, one upon another, attractions and repulsions directed along the straight joining them, and depending not only upon their distances, but upon the first and second derivatives of these distances, consequently upon their velocities and their accelerations. This law of Weber, different enough from those which to-day tend to prevail, none the less presents a certain analogy with them.

It’s in Mercury’s movement that the effect will be most noticeable, since this planet moves the fastest. Tisserand previously did a similar calculation, assuming Weber's law; I remember that Weber was trying to explain both electrostatic and electrodynamic phenomena by suggesting that electrons (a term that hadn’t been coined yet) exert attractions and repulsions on each other along the line connecting them. These forces depend not only on their distances but also on the first and second derivatives of those distances, meaning their velocities and accelerations. This law of Weber, while quite different from the ones that are popular today, still shows some similarities to them.

Tisserand found that, if the Newtonian attraction conformed to Weber's law there resulted, for Mercury's perihelion, secular variation of 14´´, of the same sense as that which has been observed and could not be explained, but smaller, since this is 38´´.

Tisserand discovered that if the Newtonian attraction followed Weber's law, it would lead to a long-term change of 14" for Mercury's perihelion, in the same direction as what has been observed and remains unexplained, but smaller, as the observed change is 38".

Let us recur to the hypotheses A, B and C, and study first the motion of a planet attracted by a fixed center. The hypotheses B and C are no longer distinguished, since, if the attracting point is fixed, the field it produces is a purely electrostatic field, where the attraction varies inversely as the square of the distance, in conformity with Coulomb's electrostatic law, identical with that of Newton.

Let’s go back to hypotheses A, B, and C and first examine the movement of a planet pulled by a fixed center. Hypotheses B and C are no longer separate since, if the attracting point is fixed, the field it creates is purely electrostatic, where the attraction decreases with the square of the distance, in accordance with Coulomb's electrostatic law, which is the same as Newton's.

The vis viva equation holds good, taking for vis viva the new definition; in the same way, the equation of areas is replaced by another equivalent to it; the moment of the quantity of motion is a constant, but the quantity of motion must be defined as in the new dynamics.

The vis viva equation is valid with the new definition of vis viva; similarly, the equation of areas is replaced by another equivalent. The momentum is constant, but we must define momentum according to the new dynamics.

The only sensible effect will be a secular motion of the perihelion. With the theory of Lorentz, we shall find, for this motion, half of what Weber's law would give; with the theory of Abraham, two fifths.

The only sensible outcome will be a steady movement of the perihelion. According to Lorentz's theory, we will find that this movement accounts for half of what Weber's law would predict; with Abraham's theory, it accounts for two-fifths.

If now we suppose two moving bodies gravitating around their common center of gravity, the effects are very little different, though the calculations may be a little more complicated. The motion of Mercury's perihelion would therefore be 7´´ in the theory of Lorentz and 5´´.6 in that of Abraham.

If we now assume two moving bodies gravitating around their shared center of gravity, the effects are quite similar, although the calculations might be slightly more complex. The motion of Mercury's perihelion would therefore be 7´´ in Lorentz's theory and 5´´.6 in Abraham's theory.

The effect moreover is proportional to n3a2, where n is the star's mean motion and a the radius of its orbit. For the planets, in virtue of Kepler's law, the effect varies then inversely as √a5; it is therefore insensible, save for Mercury.[Pg 517]

The effect is also proportional to n3a2, where n is the average motion of the star and a is the radius of its orbit. For the planets, according to Kepler's law, the effect decreases inversely as √a5; it is therefore negligible, except for Mercury.[Pg 517]

It is likewise insensible for the moon though n is great, because a is extremely small; in sum, it is five times less for Venus, and six hundred times less for the moon than for Mercury. We may add that as to Venus and the earth, the motion of the perihelion (for the same angular velocity of this motion) would be much more difficult to discern by astronomic observations, because the excentricity of their orbits is much less than for Mercury.

It also doesn’t make sense for the moon, even though n is large, because a is really tiny; basically, it’s five times smaller for Venus and six hundred times smaller for the moon compared to Mercury. Additionally, when it comes to Venus and Earth, the change in perihelion (given the same angular speed of this motion) would be much harder to detect with astronomical observations, since the eccentricity of their orbits is much lower than that of Mercury.

To sum up, the only sensible effect upon astronomic observations would be a motion of Mercury's perihelion, in the same sense as that which has been observed without being explained, but notably slighter.

To sum up, the only reasonable impact on astronomical observations would be a shift in Mercury's perihelion, similar to what has been seen but not explained, though significantly smaller.

That can not be regarded as an argument in favor of the new dynamics, since it will always be necessary to seek another explanation for the greater part of Mercury's anomaly; but still less can it be regarded as an argument against it.

That can't be seen as a point in favor of the new dynamics, since we will always need to find another explanation for most of Mercury's anomaly; however, it also can't be seen as an argument against it.

III

The Theory of Lesage

It is interesting to compare these considerations with a theory long since proposed to explain universal gravitation.

It’s interesting to compare these thoughts with a theory that was proposed a long time ago to explain universal gravitation.

Suppose that, in the interplanetary spaces, circulate in every direction, with high velocities, very tenuous corpuscles. A body isolated in space will not be affected, apparently, by the impacts of these corpuscles, since these impacts are equally distributed in all directions. But if two bodies A and B are present, the body B will play the rôle of screen and will intercept part of the corpuscles which, without it, would have struck A. Then, the impacts received by A in the direction opposite that from B will no longer have a counterpart, or will now be only partially compensated, and this will push A toward B.

Imagine that, in the vastness of space between planets, tiny particles are moving in all directions at high speeds. An isolated object in space seems unaffected by the hits from these particles since these strikes are spread out evenly all around it. However, when two objects A and B are present, object B acts as a shield and blocks some of the particles that would have otherwise hit A. As a result, the impacts that A receives from the direction opposite B will no longer have an equal counterpart or will be only partially balanced, causing A to be pushed toward B.

Such is the theory of Lesage; and we shall discuss it, taking first the view-point of ordinary mechanics.

Such is Lesage's theory, and we'll discuss it, starting from the perspective of regular mechanics.

First, how should the impacts postulated by this theory take place; is it according to the laws of perfectly elastic bodies, or according to those of bodies devoid of elasticity, or according to an intermediate law? The corpuscles of Lesage can not act as perfectly elastic bodies; otherwise the effect would be null,[Pg 518] since the corpuscles intercepted by the body B would be replaced by others which would have rebounded from B, and calculation proves that the compensation would be perfect. It is necessary then that the impact make the corpuscles lose energy, and this energy should appear under the form of heat. But how much heat would thus be produced? Note that attraction passes through bodies; it is necessary therefore to represent to ourselves the earth, for example, not as a solid screen, but as formed of a very great number of very small spherical molecules, which play individually the rôle of little screens, but between which the corpuscles of Lesage may freely circulate. So, not only the earth is not a solid screen, but it is not even a cullender, since the voids occupy much more space than the plenums. To realize this, recall that Laplace has demonstrated that attraction, in traversing the earth, is weakened at most by one ten-millionth part, and his proof is perfectly satisfactory: in fact, if attraction were absorbed by the body it traverses, it would no longer be proportional to the masses; it would be relatively weaker for great bodies than for small, since it would have a greater thickness to traverse. The attraction of the sun for the earth would therefore be relatively weaker than that of the sun for the moon, and thence would result, in the motion of the moon, a very sensible inequality. We should therefore conclude, if we adopt the theory of Lesage, that the total surface of the spherical molecules which compose the earth is at most the ten-millionth part of the total surface of the earth.

First, how should the impacts suggested by this theory occur; is it according to the laws of perfectly elastic bodies, or according to those of bodies without elasticity, or an intermediate law? Lesage's corpuscles cannot act as perfectly elastic bodies; otherwise, the effect would be null,[Pg 518] since the corpuscles blocked by body B would be replaced by others that would have bounced off B, and calculations show that the compensation would be perfect. Therefore, the impact must cause the corpuscles to lose energy, and this energy should manifest as heat. But how much heat would be produced? It's important to note that attraction passes through bodies; we must envision the Earth, for instance, not as a solid barrier, but as being made up of a very large number of tiny spherical molecules, which individually act like little screens, yet allow Lesage's corpuscles to flow freely between them. So, not only is the Earth not a solid barrier, but it isn’t even a sieve, since the gaps take up much more space than the solids. To understand this, remember that Laplace demonstrated that attraction, when passing through the Earth, is weakened by at most one ten-millionth part, and his proof is completely satisfactory: in fact, if attraction were absorbed by the body it passes through, it would no longer be proportional to the masses; it would be relatively weaker for larger bodies than for smaller ones, as it would have a greater thickness to go through. Thus, the Sun's attraction for the Earth would be relatively weaker than its attraction for the Moon, leading to a noticeable inequality in the Moon's motion. We should therefore conclude, if we accept Lesage's theory, that the total surface area of the spherical molecules that make up the Earth is at most one ten-millionth of the total surface area of the Earth.

Darwin has proved that the theory of Lesage only leads exactly to Newton's law when we postulate particles entirely devoid of elasticity. The attraction exerted by the earth on a mass 1 at a distance 1 will then be proportional, at the same time, to the total surface S of the spherical molecules composing it, to the velocity v of the corpuscles, to the square root of the density ρ of the medium formed by the corpuscles. The heat produced will be proportional to S, to the density ρ, and to the cube of the velocity v.

Darwin has shown that Lesage's theory only aligns with Newton's law when we assume particles have no elasticity. The force that the earth exerts on a mass of 1 at a distance of 1 will then be proportional at the same time to the total surface S of the spherical molecules that make it up, to the velocity v of the particles, and to the square root of the density ρ of the medium made up of the particles. The heat generated will be proportional to S, to the density ρ, and to the cube of the velocity v.

But it is necessary to take account of the resistance experienced by a body moving in such a medium; it can not move, in fact, without going against certain impacts, in fleeing, on the contrary,[Pg 519] before those coming in the opposite direction, so that the compensation realized in the state of rest can no longer subsist. The calculated resistance is proportional to S, to ρ and to v; now, we know that the heavenly bodies move as if they experienced no resistance, and the precision of observations permits us to fix a limit to the resistance of the medium.

But we need to consider the resistance faced by a body moving through a medium; it can't move without encountering certain impacts, and conversely, when it tries to escape, it faces those coming from the opposite direction, so the balance achieved when at rest can no longer hold. The measured resistance is proportional to S, to ρ, and to v; however, we know that celestial bodies move as if they feel no resistance, and the accuracy of observations allows us to determine a limit to the resistance of the medium.[Pg 519]

This resistance varying as Sρv, while the attraction varies as S√(ρv), we see that the ratio of the resistance to the square of the attraction is inversely as the product Sv.

This resistance changes as Sρv, while the attraction changes as S√(ρv), we find that the ratio of the resistance to the square of the attraction is inversely proportional to the product Sv.

We have therefore a lower limit of the product Sv. We have already an upper limit of S (by the absorption of attraction by the body it traverses); we have therefore a lower limit of the velocity v, which must be at least 24·1017 times that of light.

We now have a lower limit for the product Sv. We already have an upper limit for S (due to the attraction absorbed by the body it passes through); therefore, we have a lower limit for the velocity v, which must be at least 24·1017 times the speed of light.

From this we are able to deduce ρ and the quantity of heat produced; this quantity would suffice to raise the temperature 1026 degrees a second; the earth would receive in a given time 1020 times more heat than the sun emits in the same time; I am not speaking of the heat the sun sends to the earth, but of that it radiates in all directions.

From this, we can figure out ρ and the amount of heat produced; this amount would be enough to increase the temperature by 1026 degrees per second; the Earth would receive, over a certain period, 1020 times more heat than the sun emits in the same time frame; I’m not talking about the heat the sun sends to the Earth, but rather the heat it radiates in all directions.

It is evident the earth could not long stand such a régime.

It’s clear that the earth couldn’t endure such a system for much longer.

We should not be led to results less fantastic if, contrary to Darwin's views, we endowed the corpuscles of Lesage with an elasticity imperfect without being null. In truth, the vis viva of these corpuscles would not be entirely converted into heat, but the attraction produced would likewise be less, so that it would be only the part of this vis viva converted into heat, which would contribute to produce the attraction and that would come to the same thing; a judicious employment of the theorem of the viriel would enable us to account for this.

We shouldn't expect results to be less amazing if, against Darwin's ideas, we gave Lesage's particles an imperfect but non-zero elasticity. In reality, the kinetic energy of these particles wouldn't completely turn into heat, but the attraction generated would also be less. This means that only the portion of this kinetic energy turned into heat would contribute to the attraction, and it would essentially amount to the same result; a smart application of the virial theorem would allow us to explain this.

The theory of Lesage may be transformed; suppress the corpuscles and imagine the ether overrun in all senses by luminous waves coming from all points of space. When a material object receives a luminous wave, this wave exercises upon it a mechanical action due to the Maxwell-Bartholi pressure, just as if it had received the impact of a material projectile. The waves in question could therefore play the rôle of the corpuscles of Lesage. This is what is supposed, for example, by M. Tommasina.

The theory of Lesage can be reimagined; eliminate the corpuscles and picture the ether filled in all directions with light waves coming from every part of space. When a physical object absorbs a light wave, this wave applies a mechanical force on it due to the Maxwell-Bartholi pressure, just as if it had been hit by a physical projectile. These waves could thus serve the function of Lesage's corpuscles. This is what is suggested, for instance, by M. Tommasina.

The difficulties are not removed for all that; the velocity of[Pg 520] propagation can be only that of light, and we are thus led, for the resistance of the medium, to an inadmissible figure. Besides, if the light is all reflected, the effect is null, just as in the hypothesis of the perfectly elastic corpuscles.

The challenges still exist; the speed of[Pg 520] propagation can only match the speed of light, which leads us to an unacceptable figure due to the resistance of the medium. Additionally, if all the light is reflected, the effect is zero, similar to the idea of perfectly elastic particles.

That there should be attraction, it is necessary that the light be partially absorbed; but then there is production of heat. The calculations do not differ essentially from those made in the ordinary theory of Lesage, and the result retains the same fantastic character.

That there should be attraction, it is necessary that the light be partially absorbed; but then there is production of heat. The calculations do not differ essentially from those made in the ordinary theory of Lesage, and the result retains the same fantastic character.

On the other hand, attraction is not absorbed by the body it traverses, or hardly at all; it is not so with the light we know. Light which would produce the Newtonian attraction would have to be considerably different from ordinary light and be, for example, of very short wave length. This does not count that, if our eyes were sensible of this light, the whole heavens should appear to us much more brilliant than the sun, so that the sun would seem to us to stand out in black, otherwise the sun would repel us instead of attracting us. For all these reasons, light which would permit of the explanation of attraction would be much more like Röntgen rays than like ordinary light.

On the other hand, attraction isn’t absorbed by the body it moves through, or barely at all; it’s different from the light we know. Light that would create Newtonian attraction would need to be quite different from regular light, for example, having a very short wavelength. Additionally, if our eyes could detect this light, the entire sky would appear much brighter than the sun, making the sun seem black in contrast; otherwise, the sun would repel us instead of attracting us. For all these reasons, light that would explain attraction would resemble Röntgen rays much more than ordinary light.

And besides, the X-rays would not suffice; however penetrating they may seem to us, they could not pass through the whole earth; it would be necessary therefore to imagine X´-rays much more penetrating than the ordinary X-rays. Moreover a part of the energy of these X´-rays would have to be destroyed, otherwise there would be no attraction. If you do not wish it transformed into heat, which would lead to an enormous heat production, you must suppose it radiated in every direction under the form of secondary rays, which might be called X´´ and which would have to be much more penetrating still than the X´-rays, otherwise they would in their turn derange the phenomena of attraction.

And besides, the X-rays wouldn’t be enough; no matter how penetrating they seem to us, they couldn’t go through the whole earth. So we’d need to imagine X'-rays that are much more penetrating than regular X-rays. Also, some of the energy from these X'-rays would have to be lost; otherwise, there wouldn’t be any attraction. If you don’t want it turned into heat, which would generate an enormous amount of heat, you have to assume it radiates out in every direction as secondary rays, which we could call X'' and which would need to be even more penetrating than the X'-rays, or else they would mess up the attraction phenomena as well.

Such are the complicated hypotheses to which we are led when we try to give life to the theory of Lesage.

Such are the complex ideas we come up with when we attempt to bring the theory of Lesage to life.

But all we have said presupposes the ordinary laws of mechanics.

But everything we've mentioned assumes the usual laws of mechanics.

Will things go better if we admit the new dynamics? And first, can we conserve the principles of relativity? Let us give at[Pg 521] first to the theory of Lesage its primitive form, and suppose space ploughed by material corpuscles; if these corpuscles were perfectly elastic, the laws of their impact would conform to this principle of relativity, but we know that then their effect would be null. We must therefore suppose these corpuscles are not elastic, and then it is difficult to imagine a law of impact compatible with the principle of relativity. Besides, we should still find a production of considerable heat, and yet a very sensible resistance of the medium.

Will things improve if we recognize the new dynamics? And first, can we maintain the principles of relativity? Let's start with Lesage's theory in its basic form and assume space is filled with material particles; if these particles were perfectly elastic, their collision behaviors would follow the principle of relativity, but we know that would result in no effect. Therefore, we must assume these particles are not elastic, which makes it hard to envision a collision law that aligns with the principle of relativity. Furthermore, we would still observe significant heat generation and a noticeable resistance from the medium.

If we suppress these corpuscles and revert to the hypothesis of the Maxwell-Bartholi pressure, the difficulties will not be less. This is what Lorentz himself has attempted in his Memoir to the Amsterdam Academy of Sciences of April 25, 1900.

If we ignore these particles and go back to the idea of the Maxwell-Bartholi pressure, the challenges won't be any fewer. This is what Lorentz himself tried to do in his Memoir to the Amsterdam Academy of Sciences on April 25, 1900.

Consider a system of electrons immersed in an ether permeated in every sense by luminous waves; one of these electrons, struck by one of these waves, begins to vibrate; its vibration will be synchronous with that of light; but it may have a difference of phase, if the electron absorbs a part of the incident energy. In fact, if it absorbs energy, this is because the vibration of the ether impels the electron; the electron must therefore be slower than the ether. An electron in motion is analogous to a convection current; therefore every magnetic field, in particular that due to the luminous perturbation itself, must exert a mechanical action upon this electron. This action is very slight; moreover, it changes sign in the current of the period; nevertheless, the mean action is not null if there is a difference of phase between the vibrations of the electron and those of the ether. The mean action is proportional to this difference, consequently to the energy absorbed by the electron. I can not here enter into the detail of the calculations; suffice it to say only that the final result is an attraction of any two electrons, varying inversely as the square of the distance and proportional to the energy absorbed by the two electrons.

Imagine a system of electrons surrounded by an ether filled with light waves; when one of these electrons gets hit by a light wave, it starts to vibrate. Its vibration will sync with that of the light, but it might have a phase difference if the electron takes in some of the incoming energy. In fact, if it absorbs energy, it’s because the ether’s vibrations are pushing the electron; therefore, the electron must be moving slower than the ether. A moving electron is like a convection current; as a result, every magnetic field, especially the one generated by the light disturbances themselves, will put a mechanical force on this electron. This force is quite small; additionally, it reverses direction over time; however, the average effect is not zero if there’s a phase difference between the vibrations of the electron and those of the ether. The average force is proportional to this phase difference and, consequently, to the energy absorbed by the electron. I won’t go into the details of the calculations here; just know that the end result is an attraction between any two electrons that decreases with the square of the distance and is proportional to the energy absorbed by the two electrons.

Therefore there can not be attraction without absorption of light and, consequently, without production of heat, and this it is which determined Lorentz to abandon this theory, which, at bottom, does not differ from that of Lesage-Maxwell-Bartholi. He would have been much more dismayed still if he had pushed[Pg 522] the calculation to the end. He would have found that the temperature of the earth would have to increase 1012 degrees a second.

Therefore, there can't be attraction without the absorption of light and, consequently, without the production of heat. This led Lorentz to abandon this theory, which, at its core, is not different from that of Lesage-Maxwell-Bartholi. He would have been even more shocked if he had completed the calculation. He would have found that the temperature of the earth would need to increase by 1012 degrees per second.

IV

Conclusions

I have striven to give in few words an idea as complete as possible of these new doctrines; I have sought to explain how they took birth; otherwise the reader would have had ground to be frightened by their boldness. The new theories are not yet demonstrated; far from it; only they rest upon an aggregate of probabilities sufficiently weighty for us not to have the right to treat them with disregard.

I’ve tried to provide a concise overview of these new ideas; I wanted to explain how they came about, or else the reader might be understandably wary of their boldness. The new theories aren’t proven yet—not at all; they are based on a collection of probabilities strong enough that we shouldn’t dismiss them.

New experiments will doubtless teach us what we should finally think of them. The knotty point of the question lies in Kaufmann's experiment and those that may be undertaken to verify it.

New experiments will definitely show us what we should ultimately think about them. The tricky part of the question lies in Kaufmann's experiment and the ones that might be conducted to confirm it.

In conclusion, permit me a word of warning. Suppose that, after some years, these theories undergo new tests and triumph; then our secondary education will incur a great danger; certain professors will doubtless wish to make a place for the new theories.

In conclusion, let me give you a word of warning. If, after some years, these theories are put to the test and succeed, our secondary education will face a serious threat; certain professors will definitely want to accommodate the new theories.

Novelties are so attractive, and it is so hard not to seem highly advanced! At least there will be the wish to open vistas to the pupils and, before teaching them the ordinary mechanics, to let them know it has had its day and was at best good enough for that old dolt Laplace. And then they will not form the habit of the ordinary mechanics.

Novelties are really appealing, and it's tough not to come off as super advanced! At the very least, there’s the desire to broaden the students' horizons and, before teaching them the basic mechanics, to make it clear that it’s outdated and was only ever good enough for that old fool Laplace. This way, they won’t get stuck in the habit of the basic mechanics.

Is it well to let them know this is only approximative? Yes; but later, when it has penetrated to their very marrow, when they shall have taken the bent of thinking only through it, when there shall no longer be risk of their unlearning it, then one may, without inconvenience, show them its limits.

Is it okay to let them know this is just an approximation? Yes; but later, once it has sunk deep into their bones, once they start thinking only through it, and once there's no risk of them unlearning it, then it’s fine to show them its limits.

It is with the ordinary mechanics that they must live; this alone will they ever have to apply. Whatever be the progress of automobilism, our vehicles will never attain speeds where it is not true. The other is only a luxury, and we should think of the luxury only when there is no longer any risk of harming the necessary.

It is with the basic mechanics that they have to live; this is all they will ever need to use. No matter how advanced cars become, our vehicles will never reach speeds that aren't true. The other is just a luxury, and we should only consider luxury when there's no longer a risk of damaging what's essential.


BOOK IV

ASTRONOMIC SCIENCE


CHAPTER I

The Milky Way and the Gas Theory

The considerations to be here developed have scarcely as yet drawn the attention of astronomers; there is hardly anything to cite except an ingenious idea of Lord Kelvin's, which has opened a new field of research, but still waits to be followed out. Nor have I original results to impart, and all I can do is to give an idea of the problems presented, but which no one hitherto has undertaken to solve. Every one knows how a large number of modern physicists represent the constitution of gases; gases are formed of an innumerable multitude of molecules which, at high speeds, cross and crisscross in every direction. These molecules probably act at a distance one upon another, but this action decreases very rapidly with distance, so that their trajectories remain sensibly straight; they cease to be so only when two molecules happen to pass very near to each other; in this case, their mutual attraction or repulsion makes them deviate to right or left. This is what is sometimes called an impact; but the word impact is not to be understood in its usual sense; it is not necessary that the two molecules come into contact, it suffices that they approach sufficiently near each other for their mutual attractions to become sensible. The laws of the deviation they undergo are the same as for a veritable impact.

The ideas I'll discuss haven't really caught the attention of astronomers yet; there’s hardly anything to reference except for an innovative thought from Lord Kelvin, which has created a new area for research but still needs to be explored further. I don’t have original findings to share, and all I can do is outline the problems presented, which no one has attempted to tackle so far. Everyone knows how many modern physicists depict the structure of gases: gases consist of countless molecules that move at high speeds, crossing and crisscrossing in every direction. These molecules likely exert forces on one another from a distance, but this force quickly decreases with distance, so their paths remain mostly straight; they only deviate when two molecules happen to pass very close to each other. In this case, their mutual attraction or repulsion causes them to shift to the right or left. This interaction is sometimes referred to as an impact; however, the term impact shouldn’t be taken in its usual sense; it’s not necessary for the two molecules to touch; it’s enough for them to come close enough for their mutual attractions to become noticeable. The rules governing the deviation they experience are the same as those for a true impact.

It seems at first that the disorderly impacts of this innumerable dust can engender only an inextricable chaos before which analysis must recoil. But the law of great numbers, that supreme law of chance, comes to our aid; in presence of a semi-disorder, we must despair, but in extreme disorder, this statistical law[Pg 524] reestablishes a sort of mean order where the mind can recover. It is the study of this mean order which constitutes the kinetic theory of gases; it shows us that the velocities of the molecules are equally distributed among all the directions, that the rapidity of these velocities varies from one molecule to another, but that even this variation is subject to a law called Maxwell's law. This law tells us how many of the molecules move with such and such a velocity. As soon as the gas departs from this law, the mutual impacts of the molecules, in modifying the rapidity and direction of their velocities, tend to bring it promptly back. Physicists have striven, not without success, to explain in this way the experimental properties of gases; for example Mariotte's law.

At first glance, the chaotic effects of this countless dust seem to create an inescapable disorder that analysis can't confront. However, the law of large numbers, that ultimate law of chance, comes to the rescue; in situations of partial disorder, we might lose hope, but in complete chaos, this statistical law[Pg 524] brings about a kind of average order that allows our minds to regroup. The exploration of this average order forms the basis of the kinetic theory of gases. It reveals that the speeds of the molecules are evenly distributed across all directions, that the speed of each molecule varies, but even this variation follows a rule known as Maxwell's law. This law explains how many molecules move at specific speeds. Whenever the gas deviates from this law, the interactions between the molecules, by altering their speed and direction, work to quickly restore it. Physicists have worked hard, with some success, to use this explanation to clarify the experimental properties of gases, including Mariotte's law.

Consider now the milky way; there also we see an innumerable dust; only the grains of this dust are not atoms, they are stars; these grains move also with high velocities; they act at a distance one upon another, but this action is so slight at great distance that their trajectories are straight; and yet, from time to time, two of them may approach near enough to be deviated from their path, like a comet which has passed too near Jupiter. In a word, to the eyes of a giant for whom our suns would be as for us our atoms, the milky way would seem only a bubble of gas.

Consider the Milky Way now; there too we see countless particles; but these particles aren’t atoms, they’re stars; these stars also move at high speeds; they influence each other from a distance, but this influence is so minimal at great distances that their paths remain straight; yet, occasionally, two of them might get close enough to change their path, like a comet that comes too near Jupiter. In short, to the eyes of a giant for whom our suns would be like our atoms, the Milky Way would appear just as a bubble of gas.

Such was Lord Kelvin's leading idea. What may be drawn from this comparison? In how far is it exact? This is what we are to investigate together; but before reaching a definite conclusion, and without wishing to prejudge it, we foresee that the kinetic theory of gases will be for the astronomer a model he should not follow blindly, but from which he may advantageously draw inspiration. Up to the present, celestial mechanics has attacked only the solar system or certain systems of double stars. Before the assemblage presented by the milky way, or the agglomeration of stars, or the resolvable nebulae it recoils, because it sees therein only chaos. But the milky way is not more complicated than a gas; the statistical methods founded upon the calculus of probabilities applicable to a gas are also applicable to it. Before all, it is important to grasp the resemblance of the two cases, and their difference.

Such was Lord Kelvin's main idea. What can we learn from this comparison? How accurate is it? This is what we will explore together; but before reaching a clear conclusion, and without wanting to influence it, we anticipate that the kinetic theory of gases will serve as a model for astronomers—a model they shouldn't follow blindly, but can use for inspiration. So far, celestial mechanics has only focused on the solar system or certain double star systems. When faced with the vastness of the Milky Way, the grouping of stars, or the resolvable nebulae, it hesitates because it only sees chaos there. However, the Milky Way is no more complex than a gas; the statistical methods based on the calculus of probabilities that apply to a gas can also be applied to it. First and foremost, it's essential to understand the similarities and differences between the two cases.

Lord Kelvin has striven to determine in this manner the[Pg 525] dimensions of the milky way; for that we are reduced to counting the stars visible in our telescopes; but we are not sure that behind the stars we see, there are not others we do not see; so that what we should measure in this way would not be the size of the milky way, it would be the range of our instruments.

Lord Kelvin has worked to find out the size of the milky way by doing this[Pg 525]; since we can only count the stars we can see through our telescopes, we can't be sure there aren't more stars hidden behind those we observe. Therefore, what we would actually be measuring is not the size of the milky way, but rather the limitations of our instruments.

The new theory comes to offer us other resources. In fact, we know the motions of the stars nearest us, and we can form an idea of the rapidity and direction of their velocities. If the ideas above set forth are exact, these velocities should follow Maxwell's law, and their mean value will tell us, so to speak, that which corresponds to the temperature of our fictitious gas. But this temperature depends itself upon the dimensions of our gas bubble. In fact, how will a gaseous mass let loose in the void act, if its elements attract one another according to Newton's law? It will take a spherical form; moreover, because of gravitation, the density will be greater at the center, the pressure also will increase from the surface to the center because of the weight of the outer parts drawn toward the center; finally, the temperature will increase toward the center: the temperature and the pressure being connected by the law called adiabatic, as happens in the successive layers of our atmosphere. At the surface itself, the pressure will be null, and it will be the same with the absolute temperature, that is to say with the velocity of the molecules.

The new theory provides us with additional resources. We know the movements of the stars closest to us and can gauge how fast they’re moving and in what direction. If the ideas mentioned earlier are correct, these velocities should align with Maxwell's law, and their average value will indicate what corresponds to the temperature of our hypothetical gas. However, this temperature depends on the size of our gas bubble. How will a mass of gas released into the void behave if its particles attract each other according to Newton's law? It will take on a spherical shape; additionally, due to gravity, the density will be higher at the center, and the pressure will increase from the surface to the center because of the weight of the outer layers being pulled inward. Ultimately, the temperature will rise toward the center, as temperature and pressure are related by the adiabatic law, similar to what we see in the different layers of our atmosphere. At the surface, the pressure will be zero, and the absolute temperature—or the speed of the molecules—will also be zero.

A question comes here: I have spoken of the adiabatic law, but this law is not the same for all gases, since it depends upon the ratio of their two specific heats; for the air and like gases, this ratio is 1.42; but is it to air that it is proper to liken the milky way? Evidently not, it should be regarded as a mono-atomic gas, like mercury vapor, like argon, like helium, that is to say that the ratio of the specific heats should be taken equal to 1.66. And, in fact, one of our molecules would be for example the solar system; but the planets are very small personages, the sun alone counts, so that our molecule is indeed mono-atomic. And even if we take a double star, it is probable that the action of a strange star which might approach it would become sufficiently sensible to deviate the motion of general translation of the system much before being able to trouble the relative orbits[Pg 526] of the two components; the double star, in a word, would act like an indivisible atom.

A question arises here: I have mentioned the adiabatic law, but this law isn't the same for all gases, as it depends on the ratio of their specific heats. For air and similar gases, this ratio is 1.42; but is it appropriate to compare the Milky Way to air? Clearly not; it should be considered a mono-atomic gas, like mercury vapor, argon, or helium, meaning the ratio of specific heats should be taken as 1.66. In fact, one of our molecules could represent the solar system; however, the planets are very minor players, and only the sun truly matters, making our molecule mono-atomic. Even if we consider a binary star system, it's likely that the influence of a nearby star would have a noticeable effect on the overall motion of the system long before disrupting the relative orbits of its two components; in short, the binary star would behave like an indivisible atom.[Pg 526]

However that may be, the pressure, and consequently the temperature, at the center of the gaseous sphere would be by so much the greater as the sphere was larger since the pressure increases by the weight of all the superposed layers. We may suppose that we are nearly at the center of the milky way, and by observing the mean proper velocity of the stars, we shall know that which corresponds to the central temperature of our gaseous sphere and we shall determine its radius.

However that may be, the pressure, and therefore the temperature, at the center of the gaseous sphere would be greater as the sphere got larger since the pressure increases with the weight of all the layers above it. We can assume that we are close to the center of the Milky Way, and by observing the average speed of the stars, we will be able to determine the temperature at the center of our gaseous sphere and find its radius.

We may get an idea of the result by the following considerations: make a simpler hypothesis: the milky way is spherical, and in it the masses are distributed in a homogeneous manner; thence results that the stars in it describe ellipses having the same center. If we suppose the velocity becomes nothing at the surface, we may calculate this velocity at the center by the equation of vis viva. Thus we find that this velocity is proportional to the radius of the sphere and to the square root of its density. If the mass of this sphere was that of the sun and its radius that of the terrestrial orbit, this velocity would be (it is easy to see) that of the earth in its orbit. But in the case we have supposed, the mass of the sun should be distributed in a sphere of radius 1,000,000 times greater, this radius being the distance of the nearest stars; the density is therefore 1018 times less; now, the velocities are of the same order, therefore it is necessary that the radius be 109 times greater, be 1,000 times the distance of the nearest stars, which would give about a thousand millions of stars in the milky way.

We can get an idea of the outcome by considering the following points: let’s simplify our hypothesis: assume the Milky Way is spherical, and that mass is evenly spread throughout it; as a result, the stars within it would orbit in ellipses that share the same center. If we assume the velocity drops to zero at the surface, we can then calculate this velocity at the center using the equation of kinetic energy. This leads us to conclude that this velocity is proportional to the radius of the sphere and the square root of its density. If this sphere's mass were equal to the sun's and its radius matched that of Earth’s orbit, the resulting velocity would be (as can be easily seen) that of Earth in its orbit. However, in the scenario we’ve proposed, the sun’s mass would need to be distributed across a sphere with a radius 1,000,000 times larger, which would be the distance to the nearest stars; therefore, the density is 1018 times lower. Since the velocities are comparable, it follows that the radius needs to be 109 times greater, making it 1,000 times the distance to the nearest stars, which would imply there are about a billion stars in the Milky Way.

But you will say these hypothesis differ greatly from the reality; first, the milky way is not spherical and we shall soon return to this point, and again the kinetic theory of gases is not compatible with the hypothesis of a homogeneous sphere. But in making the exact calculation according to this theory, we should find a different result, doubtless, but of the same order of magnitude; now in such a problem the data are so uncertain that the order of magnitude is the sole end to be aimed at.

But you might say these hypotheses differ greatly from reality; first, the Milky Way isn't spherical, and we'll return to this point soon. Also, the kinetic theory of gases doesn't align with the idea of a homogeneous sphere. However, if we were to do the precise calculations based on this theory, we would undoubtedly get a different result, but it would likely be of a similar order of magnitude. In this type of problem, the data is so uncertain that aiming for just the order of magnitude is the main goal.

And here a first remark presents itself; Lord Kelvin's result, which I have obtained again by an approximative calculation,[Pg 527] agrees sensibly with the evaluations the observers have made with their telescopes; so that we must conclude we are very near to piercing through the milky way. But that enables us to answer another question. There are the stars we see because they shine; but may there not be dark stars circulating in the interstellar spaces whose existence might long remain unknown? But then, what Lord Kelvin's method would give us would be the total number of stars, including the dark stars; as his figure is comparable to that the telescope gives, this means there is no dark matter, or at least not so much as of shining matter.

And here’s a first point to note: Lord Kelvin's result, which I've verified again through an approximate calculation,[Pg 527] aligns closely with the estimates made by observers using their telescopes; so we must conclude we're very close to seeing through the Milky Way. This lets us tackle another question. There are the stars we see because they shine; but could there be dark stars moving around in interstellar space whose existence might remain unknown for a long time? However, what Lord Kelvin's method would give us is the total number of stars, including the dark ones; since his figure is similar to what the telescope provides, this suggests there isn't any dark matter, or at least not as much as there is shining matter.

Before going further, we must look at the problem from another angle. Is the milky way thus constituted truly the image of a gas properly so called? You know Crookes has introduced the notion of a fourth state of matter, where gases having become too rarefied are no longer true gases and become what he calls radiant matter. Considering the slight density of the milky way, is it the image of gaseous matter or of radiant matter? The consideration of what is called the free path will furnish us the answer.

Before we go any further, we need to examine the problem from a different perspective. Is the Milky Way really a proper gas? You know Crookes introduced the idea of a fourth state of matter, where gases that become too thin are no longer true gases and turn into what he calls radiant matter. Given the low density of the Milky Way, is it an example of gaseous matter or radiant matter? Looking at what’s referred to as the free path will give us the answer.

The trajectory of a gaseous molecule may be regarded as formed of straight segments united by very small arcs corresponding to the successive impacts. The length of each of these segments is what is called the free path; of course this length is not the same for all the segments and for all the molecules; but we may take a mean; this is what is called the mean path. This is the greater the less the density of the gas. The matter will be radiant if the mean path is greater than the dimensions of the receptacle wherein the gas is enclosed, so that a molecule has a chance to go across the whole receptacle without undergoing an impact; if the contrary be the case, it is gaseous. From this it follows that the same fluid may be radiant in a little receptacle and gaseous in a big one; this perhaps is why, in a Crookes tube, it is necessary to make the vacuum by so much the more complete as the tube is larger.

The path of a gas molecule can be thought of as made up of straight segments connected by tiny curves that happen during collisions. The length of each of these segments is known as the free path. Naturally, this length varies among different segments and molecules, but we can calculate an average; this is called the mean path. The mean path increases as the gas density decreases. The substance will be considered radiant if the mean path is longer than the dimensions of the container holding the gas, allowing a molecule the chance to cross the entire container without colliding. If not, it's considered gaseous. This means the same substance can be radiant in a small container and gaseous in a larger one; perhaps this is why, in a Crookes tube, it's necessary to create a more complete vacuum as the tube size increases.

How is it then for the milky way? This is a mass of gas of which the density is very slight, but whose dimensions are very great; has a star chances of traversing it without undergoing an impact, that is to say without passing sufficiently near another[Pg 528] star to be sensibly deviated from its route! What do we mean by sufficiently near? That is perforce a little arbitrary; take it as the distance from the sun to Neptune, which would represent a deviation of a dozen degrees; suppose therefore each of our stars surrounded by a protective sphere of this radius; could a straight pass between these spheres? At the mean distance of the stars of the milky way, the radius of these spheres will be seen under an angle of about a tenth of a second; and we have a thousand millions of stars. Put upon the celestial sphere a thousand million little circles of a tenth of a second radius. Are the chances that these circles will cover a great number of times the celestial sphere? Far from it; they will cover only its sixteen thousandth part. So the milky way is not the image of gaseous matter, but of Crookes' radiant matter. Nevertheless, as our foregoing conclusions are happily not at all precise, we need not sensibly modify them.

How is it for the Milky Way? It's a massive cloud of gas with very low density but enormous size; stars have a good chance of moving through it without colliding, meaning they can travel without getting close enough to another star to significantly alter their path! What do we mean by sufficiently near? That’s somewhat arbitrary; let’s consider the distance from the Sun to Neptune, which would indicate a deviation of about twelve degrees. So, let's imagine each of our stars is surrounded by a protective sphere with that radius. Could a straight line pass between these spheres? At the average distance of the stars in the Milky Way, the radius of these spheres would appear as an angle of about one-tenth of a second; and we have one trillion stars. If we placed one trillion tiny circles, each with a radius of one-tenth of a second, on the celestial sphere, would these circles cover much of the sky? Not at all; they would only cover one-sixteenth-thousandth of it. Therefore, the Milky Way is not just a mass of gas, but more like Crookes' radiant matter. However, since our previous conclusions aren't very precise, we don’t really need to change them.

But there is another difficulty: the milky way is not spherical, and we have reasoned hitherto as if it were, since this is the form of equilibrium a gas isolated in space would take. To make amends, agglomerations of stars exist whose form is globular and to which would better apply what we have hitherto said. Herschel has already endeavored to explain their remarkable appearances. He supposed the stars of the aggregates uniformly distributed, so that an assemblage is a homogeneous sphere; each star would then describe an ellipse and all these orbits would be passed over in the same time, so that at the end of a period the aggregate would take again its primitive configuration and this configuration would be stable. Unluckily, the aggregates do not appear to be homogeneous; we see a condensation at the center, we should observe it even were the sphere homogeneous, since it is thicker at the center; but it would not be so accentuated. We may therefore rather compare an aggregate to a gas in adiabatic equilibrium, which takes the spherical form because this is the figure of equilibrium of a gaseous mass.

But there’s another issue: the Milky Way isn’t spherical, and up until now, we’ve assumed it was, since that’s the shape a gas would take when isolated in space. To make up for this, there are clusters of stars that are globular, and what we’ve discussed so far applies better to them. Herschel has tried to explain their interesting characteristics. He theorized that the stars in these clusters are uniformly distributed, making the collection a homogeneous sphere; in this case, each star would trace an elliptical path, and all these orbits would complete in the same amount of time, so after a certain period, the cluster would return to its original shape, which would be stable. Unfortunately, the clusters don’t seem to be homogeneous; we notice a concentration at the center, which we'd expect even in a homogeneous sphere since it's denser at the center, but it wouldn't be as pronounced. Therefore, we can better compare a cluster to a gas in adiabatic equilibrium, which takes on a spherical form because that’s the shape a gaseous mass settles into when it’s balanced.

But, you will say, these aggregates are much smaller than the milky way, of which they even in probability make part, and even though they be more dense, they will rather present something analogous to radiant matter; now, gases attain their adiabatic[Pg 529] equilibrium only through innumerable impacts of the molecules. That might perhaps be adjusted. Suppose the stars of the aggregate have just enough energy for their velocity to become null when they reach the surface; then they may traverse the aggregate without impact, but arrived at the surface they will go back and will traverse it anew; after a great number of crossings, they will at last be deviated by an impact; under these conditions, we should still have a matter which might be regarded as gaseous; if perchance there had been in the aggregate stars whose velocity was greater, they have long gone away out of it, they have left it never to return. For all these reasons, it would be interesting to examine the known aggregates, to seek to account for the law of the densities, and to see if it is the adiabatic law of gases.

But, you might say, these clusters are much smaller than the Milky Way, which they likely belong to, and even if they are denser, they more closely resemble radiant matter. Gases only reach their adiabatic[Pg 529] equilibrium through countless collisions between molecules. That might be adjusted. Suppose the stars in the cluster have just enough energy for their speed to drop to zero when they reach the surface; then they could pass through the cluster without colliding, but once they hit the surface, they would bounce back and move through it again. After a lot of crossings, they would eventually be deflected by a collision; under these circumstances, we would still have a substance that could be considered gaseous. If there happened to be stars in the cluster with greater speeds, they would have long since left and not returned. For all these reasons, it would be worthwhile to study the known clusters, try to explain the density laws, and see if they follow the adiabatic law of gases.

But to return to the milky way; it is not spherical and would rather be represented as a flattened disc. It is clear then that a mass starting without velocity from the surface will reach the center with different velocities, according as it starts from the surface in the neighborhood of the middle of the disc or just on the border of the disc; the velocity would be notably greater in the latter case. Now, up to the present, we have supposed that the proper velocities of the stars, those we observe, must be comparable to those which like masses would attain; this involves a certain difficulty. We have given above a value for the dimensions of the milky way, and we have deduced it from the observed proper velocities which are of the same order of magnitude as that of the earth in its orbit; but which is the dimension we have thus measured? Is it the thickness? Is it the radius of the disc? It is doubtless something intermediate; but what can we say then of the thickness itself, or of the radius of the disc? Data are lacking to make the calculation; I shall confine myself to giving a glimpse of the possibility of basing an evaluation at least approximate upon a deeper discussion of the proper motions.

But to get back to the Milky Way, it's not a sphere; it’s better described as a flattened disc. It’s clear that a mass starting from rest on the surface will reach the center with different speeds, depending on whether it starts near the middle of the disc or at the edge; the speed will be significantly greater in the latter case. Up until now, we've assumed that the actual velocities of the stars, those we observe, should be comparable to what similar masses would reach; this presents some challenges. We've provided a value for the dimensions of the Milky Way, deduced from the observed proper velocities, which are similar to the Earth's orbital speed; but what dimension are we measuring? Is it the thickness? Is it the radius of the disc? It’s probably something in between; but what can we say about the thickness itself or the radius of the disc? We lack data to make the calculation. I’ll just hint at the possibility of forming at least a rough estimate based on a deeper analysis of the proper motions.

And then we find ourselves facing two hypotheses: either the stars of the milky way are impelled by velocities for the most part parallel to the galactic plane, but otherwise distributed uniformly in all directions parallel to this plane. If this be so, observation of the proper motions should show a preponderance of components parallel to the milky way; this is to be determined,[Pg 530] because I do not know whether a systematic discussion has ever been made from this view-point. On the other hand, such an equilibrium could only be provisory, since because of impacts the molecules, I mean the stars, would in the long run acquire notable velocities in the sense perpendicular to the milky way and would end by swerving from its plane, so that the system would tend toward the spherical form, the only figure of equilibrium of an isolated gaseous mass.

And then we find ourselves considering two possibilities: either the stars in the Milky Way are mostly moving parallel to the galactic plane but are otherwise evenly spread out in all directions along that plane. If this is the case, we should observe that their movements show more activity parallel to the Milky Way; this needs to be investigated,[Pg 530] because I’m not sure if a thorough discussion has ever been conducted from this perspective. On the other hand, this kind of balance could only be temporary, since due to collisions, the molecules—or rather, the stars—would eventually gain significant velocities in directions perpendicular to the Milky Way and would end up drifting away from its plane, leading the system to move toward a spherical shape, which is the only stable form for an isolated mass of gas.

Or else the whole system is impelled by a common rotation, and for that reason is flattened like the earth, like Jupiter, like all bodies that twirl. Only, as the flattening is considerable, the rotation must be rapid; rapid doubtless, but it must be understood in what sense this word is used. The density of the milky way is 1023 times less than that of the sun; a velocity of rotation √1025 times less than that of the sun, for it would, therefore, be the equivalent so far as concerns flattening; a velocity 1012 times slower than that of the earth, say a thirtieth of a second of arc in a century, would be a very rapid rotation, almost too rapid for stable equilibrium to be possible.

Or else the entire system is driven by a common rotation, which is why it's flattened like the Earth, like Jupiter, and like all rotating bodies. However, since the flattening is significant, the rotation must be fast; fast, no doubt, but we need to understand what that means. The density of the Milky Way is 1023 times less than that of the Sun; a rotation speed √1025 times slower than that of the Sun, which would be the equivalent in terms of flattening; a speed 1012 times slower than that of the Earth, about a thirtieth of a second of arc in a century, would be considered a very fast rotation, almost too fast for stable equilibrium to be feasible.

In this hypothesis, the observable proper motions would appear to us uniformly distributed, and there would no longer be a preponderance of components parallel to the galactic plane.

In this hypothesis, the observable proper motions would seem uniformly distributed to us, and there would no longer be a majority of components aligned with the galactic plane.

They will tell us nothing about the rotation itself, since we belong to the turning system. If the spiral nebulæ are other milky ways, foreign to ours, they are not borne along in this rotation, and we might study their proper motions. It is true they are very far away; if a nebula has the dimensions of the milky way and if its apparent radius is for example 20´´, its distance is 10,000 times the radius of the milky way.

They won't tell us anything about the rotation itself, since we're part of the rotating system. If the spiral nebulae are other galaxies, separate from ours, they're not affected by this rotation, and we could examine their own movements. It's true they are really far away; if a nebula is the size of the Milky Way and its apparent radius is, for instance, 20'', its distance is 10,000 times the radius of the Milky Way.

But that makes no difference, since it is not about the translation of our system that we ask information from them, but about its rotation. The fixed stars, by their apparent motion, reveal to us the diurnal rotation of the earth, though their distance is immense. Unluckily, the possible rotation of the milky way, however rapid it may be relatively, is very slow viewed absolutely, and besides the pointings on nebulæ can not be very precise; therefore thousands of years of observations would be necessary to learn anything.[Pg 531]

But that doesn't matter, because we're not asking them about the translation of our system, but about its rotation. The fixed stars, by their apparent motion, show us the daily rotation of the Earth, even though they're incredibly far away. Unfortunately, the possible rotation of the Milky Way, no matter how fast it might seem relatively, is very slow when looked at absolutely, and on top of that, the measurements of nebulae can't be very accurate; so we would need thousands of years of observations to learn anything.[Pg 531]

However that may be, in this second hypothesis, the figure of the milky way would be a figure of final equilibrium.

However that may be, in this second hypothesis, the shape of the Milky Way would represent a state of final balance.

I shall not further discuss the relative value of these two hypotheses since there is a third which is perhaps more probable. We know that among the irresolvable nebulæ, several kinds may be distinguished: the irregular nebulæ like that of Orion, the planetary and annular nebulæ, the spiral nebulæ. The spectra of the first two families have been determined, they are discontinuous; these nebulæ are therefore not formed of stars; besides, their distribution on the heavens seems to depend upon the milky way; whether they have a tendency to go away from it, or on the contrary to approach it, they make therefore a part of the system. On the other hand, the spiral nebulæ are generally considered as independent of the milky way; it is supposed that they, like it, are formed of a multitude of stars, that they are, in a word, other milky ways very far away from ours. The recent investigations of Stratonoff tend to make us regard the milky way itself as a spiral nebula, and this is the third hypothesis of which I wish to speak.

I won't discuss the relative value of these two hypotheses any further since there's a third one that might be more likely. We know that among the unresolved nebulae, we can distinguish several types: irregular nebulae like the one in Orion, planetary and annular nebulae, and spiral nebulae. The spectra of the first two types have been analyzed and they are discontinuous; this means these nebulae are not made up of stars. Furthermore, their distribution in the sky seems to be related to the Milky Way; whether they tend to move away from it or move closer, they are part of the system. On the other hand, the spiral nebulae are generally seen as independent of the Milky Way; it's believed that they, like our galaxy, consist of many stars and are, in fact, other Milky Ways far away from ours. Recent studies by Stratonoff suggest that the Milky Way itself could be considered a spiral nebula, and this is the third hypothesis I want to discuss.

How can we explain the very singular appearances presented by the spiral nebulæ, which are too regular and too constant to be due to chance? First of all, to take a look at one of these representations is enough to see that the mass is in rotation; we may even see what the sense of the rotation is; all the spiral radii are curved in the same sense; it is evident that the moving wing lags behind the pivot and that fixes the sense of the rotation. But this is not all; it is evident that these nebulæ can not be likened to a gas at rest, nor even to a gas in relative equilibrium under the sway of a uniform rotation; they are to be compared to a gas in permanent motion in which internal currents prevail.

How can we explain the unique forms of spiral nebulae, which are too regular and consistent to be random? First of all, just looking at one of these images is enough to see that the mass is rotating; we can even tell the direction of the rotation. All the spiral arms curve in the same direction, and it's clear that the moving wing falls behind the point of rotation, determining the direction of the spin. But that's not all; it's clear these nebulae can't be compared to a stationary gas, nor even to a gas in a steady state under uniform rotation; they are more like a gas in constant motion with dominant internal currents.

Suppose, for example, that the rotation of the central nucleus is rapid (you know what I mean by this word), too rapid for stable equilibrium; then at the equator the centrifugal force will drive it away over the attraction, and the stars will tend to break away at the equator and will form divergent currents; but in going away, as their moment of rotation remains constant, while the radius vector augments, their angular velocity will diminish, and this is why the moving wing seems to lag back.[Pg 532]

Suppose, for example, that the rotation of the central nucleus is fast (you know what I mean by that), too fast for stable balance; then at the equator, the centrifugal force will overpower the attraction, causing the stars to drift away at the equator and create divergent currents. However, as they move away, since their rotation speed remains constant while the radius increases, their angular velocity will decrease, which is why the moving wing appears to lag behind.[Pg 532]

From this point of view, there would not be a real permanent motion, the central nucleus would constantly lose matter which would go out of it never to return, and would drain away progressively. But we may modify the hypothesis. In proportion as it goes away, the star loses its velocity and ends by stopping; at this moment attraction regains possession of it and leads it back toward the nucleus; so there will be centripetal currents. We must suppose the centripetal currents are the first rank and the centrifugal currents the second rank, if we adopt the comparison with a troop in battle executing a change of front; and, in fact, it is necessary that the composite centrifugal force be compensated by the attraction exercised by the central layers of the swarm upon the extreme layers.

From this perspective, there wouldn't be a true permanent motion; the central core would continuously lose matter, which would escape and never return, gradually depleting it. However, we can adjust this idea. As it moves away, the star loses velocity and eventually comes to a stop; at that point, attraction takes over and pulls it back toward the core, creating centripetal currents. We should consider the centripetal currents as primary and the centrifugal currents as secondary, if we compare it to a troop in battle changing formations. Indeed, the combined centrifugal force must be balanced by the attraction from the central layers of the cluster acting on the outer layers.

Besides, at the end of a certain time a permanent régime establishes itself; the swarm being curved, the attraction exercised upon the pivot by the moving wing tends to slow up the pivot and that of the pivot upon the moving wing tends to accelerate the advance of this wing which no longer augments its lag, so that finally all the radii end by turning with a uniform velocity. We may still suppose that the rotation of the nucleus is quicker than that of the radii.

Besides, after a certain amount of time, a stable system sets in; as the swarm curves, the force exerted on the center by the moving wing slows down the center, while the center's pull on the moving wing speeds up the movement of this wing, which stops increasing its delay, so ultimately all the arms move at the same constant speed. We can also assume that the center rotates faster than the arms.

A question remains; why do these centripetal and centrifugal swarms tend to concentrate themselves in radii instead of disseminating themselves a little everywhere? Why do these rays distribute themselves regularly? If the swarms concentrate themselves, it is because of the attraction exercised by the already existing swarms upon the stars which go out from the nucleus in their neighborhood. After an inequality is produced, it tends to accentuate itself in this way.

A question still lingers: why do these centripetal and centrifugal swarms tend to gather in specific areas instead of spreading out evenly? Why do these rays spread out evenly? If the swarms are clustering, it’s because the attraction from the existing swarms pulls the stars that move away from the central point towards them. Once there’s an imbalance, it tends to amplify in this manner.

Why do the rays distribute themselves regularly? That is less obvious. Suppose there is no rotation, that all the stars are in two planes at right angles, in such a way that their distribution is symmetric with regard to these two planes.

Why do the rays spread out evenly? That's less clear. Let's assume there’s no rotation, and all the stars are in two planes at right angles, arranged in a way that their distribution is symmetrical with respect to these two planes.

By symmetry, there would be no reason for their going out of these planes, nor for the symmetry changing. This configuration would give us therefore equilibrium, but this would be an unstable equilibrium.

By symmetry, there would be no reason for them to leave these planes, nor for the symmetry to change. This setup would give us equilibrium, but this would be an unstable equilibrium.

If on the contrary, there is rotation, we shall find an analogous[Pg 533] configuration of equilibrium with four curved rays, equal to each other and intersecting at 90°, and if the rotation is sufficiently rapid, this equilibrium is stable.

If, on the other hand, there is rotation, we will discover a similar[Pg 533] configuration of equilibrium with four curved rays, all equal and intersecting at 90°. If the rotation is fast enough, this equilibrium is stable.

I am not in position to make this more precise: enough if you see that these spiral forms may perhaps some day be explained by only the law of gravitation and statistical consideration recalling those of the theory of gases.

I can't specify this any clearer: it's enough for you to understand that these spiral shapes might someday be explained solely by the law of gravitation and statistical analysis similar to those in gas theory.

What has been said of internal currents shows it is of interest to discuss systematically the aggregate of proper motions; this may be done in a hundred years, when the second edition is issued of the chart of the heavens and compared with the first, that we now are making.

What has been said about internal currents indicates that it’s worth discussing the overall proper motions systematically; this can be done in a hundred years when the second edition of the star chart is released and compared with the first one that we are currently creating.

But, in conclusion, I wish to call your attention to a question, that of the age of the milky way or the nebulæ. If what we think we see is confirmed, we can get an idea of it. That sort of statistical equilibrium of which gases give us the model is established only in consequence of a great number of impacts. If these impacts are rare, it can come about only after a very long time; if really the milky way (or at least the agglomerations which are contained in it), if the nebulæ have attained this equilibrium, this means they are very old, and we shall have an inferior limit of their age. Likewise we should have of it a superior limit; this equilibrium is not final and can not last always. Our spiral nebulæ would be comparable to gases impelled by permanent motions; but gases in motion are viscous and their velocities end by wearing out. What here corresponds to the viscosity (and which depends upon the chances of impact of the molecules) is excessively slight, so that the present régime may persist during an extremely long time, yet not forever, so that our milky ways can not live eternally nor become infinitely old.

But, to wrap things up, I want to bring your attention to a question about the age of the Milky Way and the nebulae. If what we think we see is confirmed, we can gain some insights into it. The kind of statistical equilibrium that gases illustrate is established only after a lot of collisions. If these collisions are rare, it takes a very long time to achieve that equilibrium; if the Milky Way (or at least the clusters within it), or the nebulae, have reached this state, it implies they are very old, and we would have a minimum estimate of their age. Similarly, we should also have a maximum estimate; this equilibrium isn’t permanent and can’t last forever. Our spiral nebulae would be similar to gases driven by ongoing motions; however, gases in motion are viscous, and their speeds eventually diminish. What corresponds to viscosity here (which depends on the likelihood of molecular collisions) is extremely small, allowing the current regime to persist for an incredibly long time, but not indefinitely, meaning our Milky Ways cannot exist forever nor become infinitely old.

And this is not all. Consider our atmosphere: at the surface must reign a temperature infinitely small and the velocity of the molecules there is near zero. But this is a question only of the mean velocity; as a consequence of impacts, one of these molecules may acquire (rarely, it is true) an enormous velocity, and then it will rush out of the atmosphere, and once out, it will never return; therefore our atmosphere drains off thus with extreme slowness. The milky way also from time to time loses a[Pg 534] star by the same mechanism, and that likewise limits its duration.

And that's not all. Think about our atmosphere: at the surface, the temperature should be incredibly low, and the speed of the molecules is nearly zero. But that's just the average speed; due to collisions, a molecule can occasionally gain an enormous speed and then shoot out of the atmosphere, never to return. So, our atmosphere slowly leaks away over time. The Milky Way also occasionally loses a[Pg 534] star through the same process, which also limits its lifespan.

Well, it is certain that if we compute in this manner the age of the milky way, we shall get enormous figures. But here a difficulty presents itself. Certain physicists, relying upon other considerations, reckon that suns can have only an ephemeral existence, about fifty million years; our minimum would be much greater than that. Must we believe that the evolution of the milky way began when the matter was still dark? But how have the stars composing it reached all at the same time adult age, an age so briefly to endure? Or must they reach there all successively, and are those we see only a feeble minority compared with those extinguished or which shall one day light up? But how reconcile that with what we have said above on the absence of a noteworthy proportion of dark matter? Should we abandon one of the two hypotheses, and which? I confine myself to pointing out the difficulty without pretending to solve it; I shall end therefore with a big interrogation point.

Well, it's clear that if we calculate the age of the Milky Way this way, we'll end up with huge numbers. But a problem arises here. Some physicists, based on other theories, argue that stars can only exist for about fifty million years; our minimum estimate would be much higher than that. Should we believe that the evolution of the Milky Way started when the matter was still dark? But how did the stars that make it up all reach maturity at the same time, in such a short lifespan? Or do they reach maturity one after another, and are the stars we see only a small fraction compared to those that have burned out or will one day shine? But how do we reconcile this with what we've previously said about the lack of significant dark matter? Should we abandon one of these two theories, and if so, which one? I’ll just point out the difficulty without trying to solve it; so I’ll conclude with a big question mark.

However, it is interesting to set problems, even when their solution seems very far away.

However, it's interesting to pose challenges, even when the solution feels really far off.


CHAPTER II

French Geodesy

Every one understands our interest in knowing the form and dimensions of our earth; but some persons will perhaps be surprised at the exactitude sought after. Is this a useless luxury? What good are the efforts so expended by the geodesist?

Everyone understands our interest in knowing the shape and size of our planet; however, some people might be surprised by the level of precision we're after. Is this just an unnecessary indulgence? What benefit do the geodesists gain from all their hard work?

Should this question be put to a congressman, I suppose he would say: "I am led to believe that geodesy is one of the most useful of the sciences; because it is one of those costing us most dear." I shall try to give you an answer a little more precise.

Should this question be asked to a congressman, I guess he would say: "I believe geodesy is one of the most valuable sciences because it's one of the most expensive." I'll try to give you a more precise answer.

The great works of art, those of peace as well as those of war, are not to be undertaken without long studies which save much groping, miscalculation and useless expense. These studies can only be based upon a good map. But a map will be only a valueless phantasy if constructed without basing it upon a solid framework. As well make stand a human body minus the skeleton.

The great works of art, whether about peace or war, shouldn't be done without extensive research that avoids a lot of trial and error, miscalculations, and wasted resources. This research can only be effective with a good roadmap. However, a roadmap is meaningless if it's not built on a solid foundation. It's like trying to support a human body without a skeleton.

Now, this framework is given us by geodesic measurements; so, without geodesy, no good map; without a good map, no great public works.

Now, this framework is provided to us by geodesic measurements; so, without geodesy, there’s no good map; without a good map, there are no great public works.

These reasons will doubtless suffice to justify much expense; but these are arguments for practical men. It is not upon these that it is proper to insist here; there are others higher and, everything considered, more important.

These reasons will surely be enough to justify a lot of spending; however, these are points meant for practical individuals. It’s not these that should be emphasized here; there are other considerations that are more elevated and, all things considered, more significant.

So we shall put the question otherwise; can geodesy aid us the better to know nature? Does it make us understand its unity and harmony? In reality an isolated fact is of slight value, and the conquests of science are precious only if they prepare for new conquests.

So let’s rephrase the question: can geodesy help us understand nature better? Does it help us grasp its unity and harmony? In truth, an isolated fact doesn’t carry much weight, and the achievements of science are valuable only if they lead to new discoveries.

If therefore a little hump were discovered on the terrestrial ellipsoid, this discovery would be by itself of no great interest. On the other hand, it would become precious if, in seeking the cause of this hump, we hoped to penetrate new secrets.

If a slight bump was found on the Earth's shape, that discovery wouldn't be very exciting. However, it would become valuable if, by trying to figure out the reason for this bump, we aimed to uncover new secrets.

Well, when, in the eighteenth century, Maupertuis and La Condamine braved such opposite climates, it was not solely to[Pg 536] learn the shape of our planet, it was a question of the whole world-system.

Well, when, in the eighteenth century, Maupertuis and La Condamine faced such different climates, it wasn't just to[Pg 536] figure out the shape of our planet; it was about understanding the entire world-system.

If the earth was flattened, Newton triumphed and with him the doctrine of gravitation and the whole modern celestial mechanics.

If the earth were flat, Newton would still be celebrated, along with his theory of gravity and the entire foundation of modern celestial mechanics.

And to-day, a century and a half after the victory of the Newtonians, think you geodesy has nothing more to teach us?

And today, a hundred and fifty years after the victory of the Newtonians, do you really think geodesy has nothing more to teach us?

We know not what is within our globe. The shafts of mines and borings have let us know a layer of 1 or 2 kilometers thickness, that is to say, the millionth part of the total mass; but what is beneath?

We don't know what’s inside our planet. The shafts of mines and drillings have revealed a layer that's about 1 or 2 kilometers thick, which means just a tiny fraction of the total mass; but what lies beneath?

Of all the extraordinary journeys dreamed by Jules Verne, perhaps that to the center of the earth took us to regions least explored.

Of all the amazing adventures imagined by Jules Verne, maybe the one to the center of the Earth took us to the least explored areas.

But these deep-lying rocks we can not reach, exercise from afar their attraction which operates upon the pendulum and deforms the terrestrial spheroid. Geodesy can therefore weigh them from afar, so to speak, and tell us of their distribution. Thus will it make us really see those mysterious regions which Jules Verne only showed us in imagination.

But these deep-seated rocks are out of our reach; they exert their pull from a distance, affecting the pendulum and altering the shape of the Earth. Geodesy can, in a sense, measure them from afar and inform us about their distribution. This will allow us to truly visualize those mysterious areas that Jules Verne only illustrated in our imagination.

This is not an empty illusion. M. Faye, comparing all the measurements, has reached a result well calculated to surprise us. Under the oceans, in the depths, are rocks of very great density; under the continents, on the contrary, are empty spaces.

This is not a false illusion. M. Faye, after analyzing all the measurements, has come to a conclusion that's likely to surprise us. Beneath the oceans, in the depths, there are rocks with very high density; while under the continents, on the other hand, there are empty spaces.

New observations will modify perhaps the details of these conclusions.

New observations may change some details of these conclusions.

In any case, our venerated dean has shown us where to search and what the geodesist may teach the geologist, desirous of knowing the interior constitution of the earth, and even the thinker wishing to speculate upon the past and the origin of this planet.

In any case, our esteemed dean has shown us where to look and what the geodesist can teach the geologist, who wants to understand the inner structure of the earth, as well as the thinker who wants to ponder the past and the origin of this planet.

And now, why have I entitled this chapter French Geodesy? It is because, in each country, this science has taken, more than all others, perhaps, a national character. It is easy to see why.

And now, why have I named this chapter French Geodesy? It’s because, in every country, this science has developed a distinct national identity more than any other. It’s easy to understand why.

There must be rivalry. The scientific rivalries are always courteous, or at least almost always; in any case, they are necessary, because they are always fruitful. Well, in those enterprises which require such long efforts and so many collaborators, the individual is effaced, in spite of himself, of course; no one has the right to say: this is my work. Therefore it is not between men, but between nations that rivalries go on.[Pg 537]

There has to be competition. Scientific rivalries are generally respectful, or at least mostly; in any case, they are essential because they lead to positive outcomes. In projects that demand lengthy efforts and many collaborators, the individual tends to become less prominent, whether they like it or not; no one can claim, "this is my work." So, the rivalries exist not between individuals, but between nations.[Pg 537]

So we are led to seek what has been the part of France. Her part I believe we are right to be proud of.

So we are led to consider what role France has played. I believe we are justified in feeling proud of her contributions.

At the beginning of the eighteenth century, long discussions arose between the Newtonians who believed the earth flattened, as the theory of gravitation requires, and Cassini, who, deceived by inexact measurements, believed our globe elongated. Only direct observation could settle the question. It was our Academy of Sciences that undertook this task, gigantic for the epoch.

At the start of the eighteenth century, there were lengthy debates between the Newtonians, who thought the earth was flattened as the gravity theory suggested, and Cassini, who, misled by inaccurate measurements, believed our planet was elongated. Only direct observation could resolve the issue. It was our Academy of Sciences that took on this monumental task for that time.

While Maupertuis and Clairaut measured a degree of meridian under the polar circle, Bouguer and La Condamine went toward the Andes Mountains, in regions then under Spain which to-day are the Republic of Ecuador.

While Maupertuis and Clairaut measured a degree of meridian below the polar circle, Bouguer and La Condamine traveled toward the Andes Mountains, in areas then part of Spain that are now the Republic of Ecuador.

Our envoys were exposed to great hardships. Traveling was not as easy as at present.

Our envoys faced a lot of challenges. Traveling wasn't as easy as it is today.

Truly, the country where Maupertuis operated was not a desert and he even enjoyed, it is said, among the Laplanders those sweet satisfactions of the heart that real arctic voyagers never know. It was almost the region where, in our days, comfortable steamers carry, each summer, hosts of tourists and young English people. But in those days Cook's agency did not exist and Maupertuis really believed he had made a polar expedition.

Truly, the country where Maupertuis worked was not a desert, and he reportedly enjoyed, among the Laplanders, those sweet feelings of the heart that real Arctic explorers never experience. It was almost the area where, today, comfortable steamers take groups of tourists and young English people every summer. But back then, Cook's agency didn't exist, and Maupertuis genuinely thought he had completed a polar expedition.

Perhaps he was not altogether wrong. The Russians and the Swedes carry out to-day analogous measurements at Spitzbergen, in a country where there is real ice-cap. But they have quite other resources, and the difference of time makes up for that of latitude.

Perhaps he wasn't completely wrong. The Russians and the Swedes are currently conducting similar measurements at Spitzbergen, in a place where there is a real ice cap. But they have very different resources, and the time difference compensates for the difference in latitude.

The name of Maupertuis has reached us much scratched by the claws of Doctor Akakia; the scientist had the misfortune to displease Voltaire, who was then the king of mind. He was first praised beyond measure; but the flatteries of kings are as much to be dreaded as their displeasure, because the days after are terrible. Voltaire himself knew something of this.

The name Maupertuis has come down to us quite scratched up by the claws of Doctor Akakia; the scientist unfortunately upset Voltaire, who was the top intellectual of his time. He was initially praised to an extreme degree; however, the compliments of kings can be as feared as their wrath, because the aftermath can be dreadful. Voltaire himself experienced this firsthand.

Voltaire called Maupertuis, my amiable master in thinking, marquis of the polar circle, dear flattener out of the world and Cassini, and even, flattery supreme, Sir Isaac Maupertuis; he wrote him: "Only the king of Prussia do I put on a level with you; he only lacks being a geometer." But soon the scene changes, he no longer speaks of deifying him, as in days of yore[Pg 538] the Argonauts, or of calling down from Olympus the council of the gods to contemplate his works, but of chaining him up in a madhouse. He speaks no longer of his sublime mind, but of his despotic pride, plated with very little science and much absurdity.

Voltaire referred to Maupertuis as my friendly mentor in thought, the marquis of the polar circle, the beloved flattening force of the world, and even, in a grand compliment, Sir Isaac Maupertuis; he wrote to him: "I only consider the King of Prussia on your level; he just needs to be a geometer." But soon the tone shifts; he no longer talks about elevating him like he did back in the days of the Argonauts, or calling down the gods from Olympus to admire his work. Instead, he talks about imprisoning him in a mental institution. He no longer describes his brilliant mind; instead, he mentions his arrogant pride, which is covered with very little knowledge and a lot of nonsense.[Pg 538]

I care not to relate these comico-heroic combats; but permit me some reflections on two of Voltaire's verses. In his 'Discourse on Moderation' (no question of moderation in praise and criticism), the poet has written:

I don't want to talk about these funny and heroic battles; instead, let me share some thoughts on two of Voltaire's lines. In his 'Discourse on Moderation' (there's no need for moderation when it comes to praise and criticism), the poet wrote:

You have confirmed in regions drear
What Newton discerned without going abroad.

You have confirmed in gloomy areas
What Newton figured out without traveling.

These two verses (which replace the hyperbolic praises of the first period) are very unjust, and doubtless Voltaire was too enlightened not to know it.

These two verses (which take the place of the exaggerated praises from the first period) are very unfair, and surely Voltaire was too insightful not to realize it.

Then, only those discoveries were esteemed which could be made without leaving one's house.

Then, only those discoveries were valued that could be made without leaving home.

To-day, it would rather be theory that one would make light of.

Today, it would likely be theory that one would dismiss.

This is to misunderstand the aim of science.

This misses the point of science.

Is nature governed by caprice, or does harmony rule there? That is the question. It is when it discloses to us this harmony that science is beautiful and so worthy to be cultivated. But whence can come to us this revelation, if not from the accord of a theory with experiment? To seek whether this accord exists or if it fails, this therefore is our aim. Consequently these two terms, which we must compare, are as indispensable the one as the other. To neglect one for the other would be nonsense. Isolated, theory would be empty, experiment would be blind; each would be useless and without interest.

Is nature driven by randomness, or is there a sense of harmony at play? That's the question. It's when this harmony reveals itself to us that science becomes beautiful and truly worth pursuing. But where can we find this revelation, if not through the alignment of a theory with experimentation? Our goal is to investigate whether this alignment exists or not. Therefore, these two elements we need to compare are equally essential. Ignoring one for the sake of the other would be foolish. Alone, theory would be meaningless, and experimentation would be aimless; both would be ineffective and unengaging.

Maupertuis therefore deserves his share of glory. Truly, it will not equal that of Newton, who had received the spark divine; nor even that of his collaborator Clairaut. Yet it is not to be despised, because his work was necessary, and if France, outstripped by England in the seventeenth century, has so well taken her revenge in the century following, it is not alone to the genius of Clairauts, d'Alemberts, Laplaces that she owes it; it is also to the long patience of the Maupertuis and the La Condamines.[Pg 539]

Maupertuis definitely deserves his share of credit. True, it won’t match Newton’s, who had the divine spark; nor even that of his collaborator Clairaut. But it shouldn’t be overlooked, because his work was essential, and if France, which fell behind England in the seventeenth century, has made such a strong comeback in the next century, it’s not just thanks to the genius of Clairaut, d'Alembert, and Laplace; it’s also due to the long efforts of Maupertuis and La Condamine.[Pg 539]

We reach what may be called the second heroic period of geodesy. France is torn within. All Europe is armed against her; it would seem that these gigantic combats might absorb all her forces. Far from it; she still has them for the service of science. The men of that time recoiled before no enterprise, they were men of faith.

We enter what could be considered the second heroic era of geodesy. France is struggling internally. All of Europe is prepared for battle against her; it seems that these massive conflicts could drain all her resources. Not at all; she still has them available for the advancement of science. The people of that time did not shy away from any challenge; they were individuals of conviction.

Delambre and Méchain were commissioned to measure an arc going from Dunkerque to Barcelona. This time there was no going to Lapland or to Peru; the hostile squadrons had closed to us the ways thither. But, though the expeditions are less distant, the epoch is so troubled that the obstacles, the perils even, are just as great.

Delambre and Méchain were tasked with measuring an arc from Dunkirk to Barcelona. This time, there was no journey to Lapland or Peru; hostile forces had blocked those paths. However, even though the expeditions are closer, the time is so tumultuous that the challenges, and even the dangers, are just as significant.

In France, Delambre had to fight against the ill-will of suspicious municipalities. One knows that the steeples, which are visible from so far, and can be aimed at with precision, often serve as signal points to geodesists. But in the region Delambre traversed there were no longer any steeples. A certain proconsul had passed there, and boasted of knocking down all the steeples rising proudly above the humble abode of the sans-culottes. Pyramids then were built of planks and covered with white cloth to make them more visible. That was quite another thing: with white cloth! What was this rash person who, upon our heights so recently set free, dared to raise the hateful standard of the counter-revolution? It was necessary to border the white cloth with blue and red bands.

In France, Delambre had to contend with the skepticism of wary local governments. It's well known that steeples, visible from great distances and easily targeted, often serve as reference points for geodesists. However, in the area Delambre crossed, there were no steeples left. A certain official had come through and bragged about tearing down all the steeples that once stood proudly over the homes of the common people. Instead, pyramids were constructed from wooden planks and covered with white cloth to make them more visible. That was a different story: with white cloth! Who was this bold individual that, upon the heights recently liberated, dared to raise the despised banner of the counter-revolution? It was necessary to trim the white cloth with blue and red bands.

Méchain operated in Spain; the difficulties were other; but they were not less. The Spanish peasants were hostile. There steeples were not lacking: but to install oneself in them with mysterious and perhaps diabolic instruments, was it not sacrilege? The revolutionists were allies of Spain, but allies smelling a little of the stake.

Méchain worked in Spain; the challenges were different, but they were no less significant. The Spanish peasants were unfriendly. There were plenty of steeples, but setting up mysterious and possibly diabolical instruments in them—wasn’t that sacrilege? The revolutionaries were allies of Spain, but allies who carried a hint of danger.

"Without cease," writes Méchain, "they threaten to butcher us." Fortunately, thanks to the exhortations of the priests, to the pastoral letters of the bishops, these ferocious Spaniards contented themselves with threatening.

"Without stopping," writes Méchain, "they threaten to kill us." Luckily, thanks to the encouragement from the priests and the letters from the bishops, these fierce Spaniards settled for just making threats.

Some years after Méchain made a second expedition into Spain: he proposed to prolong the meridian from Barcelona to the Balearics. This was the first time it had been attempted to make[Pg 540] the triangulations overpass a large arm of the sea by observing signals installed upon some high mountain of a far-away isle. The enterprise was well conceived and well prepared; it failed however.

A few years after Méchain's second trip to Spain, he suggested extending the meridian from Barcelona to the Balearic Islands. This was the first time anyone had tried to make the triangulations cross a large body of water by observing signals set up on a high mountain on a distant island. The project was well thought out and carefully planned; however, it ultimately failed.

The French scientist encountered all sorts of difficulties of which he complains bitterly in his correspondence. "Hell," he writes, perhaps with some exaggeration—"hell and all the scourges it vomits upon the earth, tempests, war, the plague and black intrigues are therefore unchained against me!"

The French scientist faced all kinds of challenges that he angrily mentions in his letters. "Hell," he writes, maybe a bit dramatically—"hell and all the disasters it unleashes on the earth, storms, war, the plague, and devious schemes are all aimed at me!"

The fact is that he encountered among his collaborators more of proud obstinacy than of good will and that a thousand accidents retarded his work. The plague was nothing, the fear of the plague was much more redoubtable; all these isles were on their guard against the neighboring isles and feared lest they should receive the scourge from them. Méchain obtained permission to disembark only after long weeks upon the condition of covering all his papers with vinegar; this was the antisepsis of that time.

The reality is that he found more stubborn pride among his collaborators than genuine goodwill, and countless setbacks slowed down his progress. The plague itself was not the worst part; the fear of the plague was far more daunting. All these islands were on high alert against their neighbors, worried they might catch the disease from them. Méchain was only allowed to land after several long weeks, on the condition that he covered all his papers with vinegar; that was the antiseptic method of that era.

Disgusted and sick, he had just asked to be recalled, when he died.

Disgusted and feeling unwell, he had just asked to be brought back when he died.

Arago and Biot it was who had the honor of taking up the unfinished work and carrying it on to completion.

Arago and Biot were the ones who had the privilege of continuing the unfinished work and seeing it through to completion.

Thanks to the support of the Spanish government, to the protection of several bishops and, above all, to that of a famous brigand chief, the operations went rapidly forward. They were successfully completed, and Biot had returned to France when the storm burst.

Thanks to the support of the Spanish government, the protection of several bishops, and especially that of a well-known bandit leader, the operations progressed quickly. They were successfully finished, and Biot had returned to France when the storm hit.

It was the moment when all Spain took up arms to defend her independence against France. Why did this stranger climb the mountains to make signals? It was evidently to call the French army. Arago was able to escape the populace only by becoming a prisoner. In his prison, his only distraction was reading in the Spanish papers the account of his own execution. The papers of that time sometimes gave out news prematurely. He had at least the consolation of learning that he died with courage and like a Christian.

It was the moment when all of Spain took up arms to defend its independence against France. Why did this stranger climb the mountains to make signals? It was clearly to call the French army. Arago could only escape the crowd by becoming a prisoner. In his cell, his only distraction was reading about his own execution in the Spanish newspapers. Those papers sometimes reported news too soon. He at least found some comfort in learning that he died bravely and like a Christian.

Even the prison was no longer safe; he had to escape and reach Algiers. There, he embarked for Marseilles on an Algerian[Pg 541] vessel. This ship was captured by a Spanish corsair, and behold Arago carried back to Spain and dragged from dungeon to dungeon, in the midst of vermin and in the most shocking wretchedness.

Even the prison wasn’t safe anymore; he had to escape and get to Algiers. There, he boarded an Algerian[Pg 541] ship headed for Marseilles. This ship was seized by a Spanish pirate, and suddenly, Arago was taken back to Spain and moved from dungeon to dungeon, surrounded by rats and in the most horrible conditions.

If it had only been a question of his subjects and his guests, the dey would have said nothing. But there were on board two lions, a present from the African sovereign to Napoleon. The dey threatened war.

If it had just been about his subjects and his guests, the dey wouldn't have said anything. But there were two lions on board, a gift from the African king to Napoleon. The dey threatened war.

The vessel and the prisoners were released. The port should have been properly reached, since they had on board an astronomer; but the astronomer was seasick, and the Algerian seamen, who wished to make Marseilles, came out at Bougie. Thence Arago went to Algiers, traversing Kabylia on foot in the midst of a thousand perils. He was long detained in Africa and threatened with the convict prison. Finally he was able to get back to France; his observations, which he had preserved and safeguarded under his shirt, and, what is still more remarkable, his instruments had traversed unhurt these terrible adventures. Up to this point, not only did France hold the foremost place, but she occupied the stage almost alone.

The ship and the prisoners were set free. They should have reached the port properly since an astronomer was on board; however, the astronomer got seasick, and the Algerian crew, who wanted to go to Marseilles, disembarked at Bougie. From there, Arago traveled to Algiers, walking through Kabylia while facing countless dangers. He was stuck in Africa for a long time and was threatened with imprisonment. Eventually, he managed to return to France; his observations, which he had kept safe under his shirt, and even more impressively, his instruments made it through those harrowing experiences unscathed. Up to this point, not only was France in the lead, but she was almost the only country on the stage.

In the years which follow, she has not been inactive and our staff-office map is a model. However, the new methods of observation and calculation have come to us above all from Germany and England. It is only in the last forty years that France has regained her rank. She owes it to a scientific officer, General Perrier, who has successfully executed an enterprise truly audacious, the junction of Spain and Africa. Stations were installed on four peaks upon the two sides of the Mediterranean. For long months they awaited a calm and limpid atmosphere. At last was seen the little thread of light which had traversed 300 kilometers over the sea. The undertaking had succeeded.

In the years that followed, she stayed active, and our staff-office map became a model. However, the new methods of observation and calculation mainly came from Germany and England. It’s only in the last forty years that France has regained its status. This is thanks to a scientific officer, General Perrier, who accomplished a truly bold project: connecting Spain and Africa. Stations were set up on four peaks on both sides of the Mediterranean. They waited for many months for calm and clear weather. Finally, a small beam of light was seen traveling 300 kilometers across the sea. The mission was a success.

To-day have been conceived projects still more bold. From a mountain near Nice will be sent signals to Corsica, not now for geodesic determinations, but to measure the velocity of light. The distance is only 200 kilometers; but the ray of light is to make the journey there and return, after reflection by a mirror installed in Corsica. And it should not wander on the way, for it must return exactly to the point of departure.[Pg 542]

Today, even bolder projects have been conceived. Signals will be sent from a mountain near Nice to Corsica, not for geodesic measurements, but to measure the speed of light. The distance is just 200 kilometers; however, the light beam will travel to Corsica and back after being reflected by a mirror set up there. It must not stray along the way, as it needs to return precisely to the starting point.[Pg 542]

Ever since, the activity of French geodesy has never slackened. We have no more such astonishing adventures to tell; but the scientific work accomplished is immense. The territory of France beyond the sea, like that of the mother country, is covered by triangles measured with precision.

Ever since, the work of French geodesy has never slowed down. We don’t have any more incredible adventures to share, but the scientific achievements are vast. The territory of France overseas, just like that of the mother country, is mapped out with precisely measured triangles.

We have become more and more exacting and what our fathers admired does not satisfy us to-day. But in proportion as we seek more exactitude, the difficulties greatly increase; we are surrounded by snares and must be on our guard against a thousand unsuspected causes of error. It is needful, therefore, to create instruments more and more faultless.

We have become increasingly demanding, and what our fathers admired no longer satisfies us today. However, as we strive for greater accuracy, the challenges grow significantly; we are surrounded by traps and must remain vigilant against a thousand hidden sources of error. Therefore, it is essential to develop instruments that are increasingly flawless.

Here again France has not let herself be distanced. Our appliances for the measurement of bases and angles leave nothing to desire, and, I may also mention the pendulum of Colonel Defforges, which enables us to determine gravity with a precision hitherto unknown.

Here again, France has not fallen behind. Our tools for measuring bases and angles are top-notch, and I should also mention Colonel Defforges's pendulum, which allows us to measure gravity with a precision never seen before.

The future of French geodesy is at present in the hands of the Geographic Service of the army, successively directed by General Bassot and General Berthaut. We can not sufficiently congratulate ourselves upon it. For success in geodesy, scientific aptitudes are not enough; it is necessary to be capable of standing long fatigues in all sorts of climates; the chief must be able to win obedience from his collaborators and to make obedient his native auxiliaries. These are military qualities. Besides, one knows that, in our army, science has always marched shoulder to shoulder with courage.

The future of French geodesy is currently in the hands of the Geographic Service of the army, led by General Bassot and General Berthaut. We can’t congratulate ourselves enough on this. For success in geodesy, scientific skills alone aren’t enough; one must be able to endure long hours in various climates. The leader must be able to gain the cooperation of their team and ensure that their local assistants follow suit. These are military traits. Furthermore, it's well-known that, in our army, science has always gone hand in hand with bravery.

I add that a military organization assures the indispensable unity of action. It would be more difficult to reconcile the rival pretensions of scientists jealous of their independence, solicitous of what they call their fame, and who yet must work in concert, though separated by great distances. Among the geodesists of former times there were often discussions, of which some aroused long echoes. The Academy long resounded with the quarrel of Bouguer and La Condamine. I do not mean to say that soldiers are exempt from passion, but discipline imposes silence upon a too sensitive self-esteem.

I should point out that a military organization ensures the essential unity of action. It would be more challenging to reconcile the conflicting ambitions of scientists who are protective of their independence and concerned about what they consider their reputation, yet must collaborate even when separated by great distances. In the past, geodesists often had debates, some of which created lasting controversies. The Academy was long filled with the dispute between Bouguer and La Condamine. I'm not saying that soldiers are free from passion, but discipline keeps a sensitive ego in check.

Several foreign governments have called upon our officers to[Pg 543] organize their geodesic service: this is proof that the scientific influence of France abroad has not declined.

Several foreign governments have asked our officials to[Pg 543] set up their geodesic service: this shows that France's scientific influence internationally is still strong.

Our hydrographic engineers contribute also to the common achievement a glorious contingent. The survey of our coasts, of our colonies, the study of the tides, offer them a vast domain of research. Finally I may mention the general leveling of France which is carried out by the ingenious and precise methods of M. Lallemand.

Our hydrographic engineers also contribute to a remarkable collective achievement. The survey of our coastlines and colonies, as well as the study of the tides, provide them with a wide range of research opportunities. Lastly, I should mention the comprehensive leveling of France, which is being carried out using the clever and accurate methods of M. Lallemand.

With such men we are sure of the future. Moreover, work for them will not be lacking; our colonial empire opens for them immense expanses illy explored. That is not all: the International Geodetic Association has recognized the necessity of a new measurement of the arc of Quito, determined in days of yore by La Condamine. It is France that has been charged with this operation; she had every right to it, since our ancestors had made, so to speak, the scientific conquest of the Cordilleras. Besides, these rights have not been contested and our government has undertaken to exercise them.

With these men, we’re confident about the future. Plus, there won’t be any shortage of work for them; our colonial empire offers them vast, mostly unexplored areas. That’s not all: the International Geodetic Association has acknowledged the need for a new measurement of the arc of Quito, which was established long ago by La Condamine. France has been assigned to carry out this task; we have every right to it, as our ancestors essentially made the scientific conquest of the Cordilleras. Furthermore, these rights haven’t been challenged, and our government has committed to uphold them.

Captains Maurain and Lacombe completed a first reconnaissance, and the rapidity with which they accomplished their mission, crossing the roughest regions and climbing the most precipitous summits, is worthy of all praise. It won the admiration of General Alfaro, President of the Republic of Ecuador, who called them 'los hombres de hierro,' the men of iron.

Captains Maurain and Lacombe finished their first reconnaissance, and the speed with which they completed their mission, crossing the toughest areas and climbing the steepest peaks, deserves all the praise. It earned the admiration of General Alfaro, President of the Republic of Ecuador, who referred to them as 'los hombres de hierro,' the men of iron.

The final commission then set out under the command of Lieutenant-Colonel (then Major) Bourgeois. The results obtained have justified the hopes entertained. But our officers have encountered unforeseen difficulties due to the climate. More than once, one of them has been forced to remain several months at an altitude of 4,000 meters, in the clouds and the snow, without seeing anything of the signals he had to aim at and which refused to unmask themselves. But thanks to their perseverance and courage, there resulted from this only a delay and an increase of expense, without the exactitude of the measurements suffering therefrom.

The final commission then set out under the command of Lieutenant-Colonel (then Major) Bourgeois. The results achieved have met the expectations. However, our officers faced unexpected challenges due to the climate. More than once, one of them had to stay for several months at an altitude of 4,000 meters, in the clouds and snow, without being able to see any of the signals he needed to aim at, which remained hidden. But thanks to their persistence and bravery, this only led to delays and increased costs, without compromising the accuracy of the measurements.


GENERAL CONCLUSIONS

What I have sought to explain in the preceding pages is how the scientist should guide himself in choosing among the innumerable facts offered to his curiosity, since indeed the natural limitations of his mind compel him to make a choice, even though a choice be always a sacrifice. I have expounded it first by general considerations, recalling on the one hand the nature of the problem to be solved and on the other hand seeking to better comprehend that of the human mind, which is the principal instrument of the solution. I then have explained it by examples; I have not multiplied them indefinitely; I also have had to make a choice, and I have chosen naturally the questions I had studied most. Others would doubtless have made a different choice; but what difference, because I believe they would have reached the same conclusions.

What I’ve tried to explain in the previous pages is how a scientist should decide among the countless facts available to their curiosity, since the natural limitations of their mind require them to make a choice, even if that choice always involves some sacrifice. I started by discussing general ideas, highlighting both the nature of the problem to be solved and trying to better understand the human mind, which is the main tool for finding the solution. Then, I illustrated my points with examples; I didn’t include an endless number of them; I had to make a selection and I naturally chose the questions I had studied the most. Others would likely have made different choices, but it wouldn’t matter much, as I believe they would have come to the same conclusions.

There is a hierarchy of facts; some have no reach; they teach us nothing but themselves. The scientist who has ascertained them has learned nothing but a fact, and has not become more capable of foreseeing new facts. Such facts, it seems, come once, but are not destined to reappear.

There’s a hierarchy of facts; some have no impact; they only teach us about themselves. The scientist who has discovered them has gained nothing more than a fact and hasn’t become better at predicting new facts. These kinds of facts seem to come once but aren’t meant to come back again.

There are, on the other hand, facts of great yield; each of them teaches us a new law. And since a choice must be made, it is to these that the scientist should devote himself.

There are, on the other hand, facts with great value; each one teaches us a new principle. And since a choice must be made, it's to these that the scientist should dedicate himself.

Doubtless this classification is relative and depends upon the weakness of our mind. The facts of slight outcome are the complex facts, upon which various circumstances may exercise a sensible influence, circumstances too numerous and too diverse for us to discern them all. But I should rather say that these are the facts we think complex, since the intricacy of these circumstances surpasses the range of our mind. Doubtless a mind vaster and finer than ours would think differently of them. But what matter; we can not use that superior mind, but only our own.

This classification is definitely relative and depends on the limitations of our minds. The facts with minor outcomes are actually complex, influenced by a variety of circumstances that are too numerous and too different for us to recognize all of them. However, I would argue that these are the facts we perceive as complex, since the intricacy of these circumstances goes beyond our understanding. A mind that is larger and more refined than ours would likely see them differently. But what does it matter? We can only use our own minds, not that superior one.

The facts of great outcome are those we think simple; may be they really are so, because they are influenced only by a small[Pg 545] number of well-defined circumstances, may be they take on an appearance of simplicity because the various circumstances upon which they depend obey the laws of chance and so come to mutually compensate. And this is what happens most often. And so we have been obliged to examine somewhat more closely what chance is.

The facts of significant results are often seen as simple; maybe they truly are because they are affected by a small[Pg 545] number of clear circumstances. Perhaps they seem simple because the different factors they rely on adhere to the laws of chance, causing them to balance each other out. This is what usually occurs. Therefore, we've had to take a closer look at what chance really is.

Facts where the laws of chance apply become easy of access to the scientist who would be discouraged before the extraordinary complication of the problems where these laws are not applicable. We have seen that these considerations apply not only to the physical sciences, but to the mathematical sciences. The method of demonstration is not the same for the physicist and the mathematician. But the methods of invention are very much alike. In both cases they consist in passing up from the fact to the law, and in finding the facts capable of leading to a law.

Facts governed by the laws of chance are easily accessible to scientists who might feel overwhelmed by the complexity of problems where these laws don’t apply. We've noted that this idea relates to both the physical sciences and the mathematical sciences. The approach to proving theories differs between physicists and mathematicians. However, their methods of invention are quite similar. In both cases, they move from the facts to establish a law and in discovering facts that can lead to a law.

To bring out this point, I have shown the mind of the mathematician at work, and under three forms: the mind of the mathematical inventor and creator; that of the unconscious geometer who among our far distant ancestors, or in the misty years of our infancy, has constructed for us our instinctive notion of space; that of the adolescent to whom the teachers of secondary education unveil the first principles of the science, seeking to give understanding of the fundamental definitions. Everywhere we have seen the rôle of intuition and of the spirit of generalization without which these three stages of mathematicians, if I may so express myself, would be reduced to an equal impotence.

To illustrate this point, I've shown how a mathematician's mind operates in three ways: the mind of the mathematical inventor and creator; that of the unconscious geometer from our distant ancestors, or during the early stages of human development, who has shaped our instinctive understanding of space; and that of the teenager who learns the basic principles of the subject from secondary education teachers, aiming to grasp the essential definitions. Throughout, we've observed the importance of intuition and the spirit of generalization, without which these three stages of mathematicians, as I might put it, would be equally powerless.

And in the demonstration itself, the logic is not all; the true mathematical reasoning is a veritable induction, different in many regards from the induction of physics, but proceeding like it from the particular to the general. All the efforts that have been made to reverse this order and to carry back mathematical induction to the rules of logic have eventuated only in failures, illy concealed by the employment of a language inaccessible to the uninitiated. The examples I have taken from the physical sciences have shown us very different cases of facts of great outcome. An experiment of Kaufmann on radium rays revolutionizes at the same time mechanics, optics and astronomy. Why? Because in proportion as these sciences have developed,[Pg 546] we have the better recognized the bonds uniting them, and then we have perceived a species of general design of the chart of universal science. There are facts common to several sciences, which seem the common source of streams diverging in all directions and which are comparable to that knoll of Saint Gothard whence spring waters which fertilize four different valleys.

And in the demonstration itself, logic isn’t everything; true mathematical reasoning is a genuine induction, which is different in many ways from the induction used in physics, but it still moves from the specific to the general. All attempts to flip this order and trace mathematical induction back to the rules of logic have ended in failures, poorly masked by using language that’s hard for newcomers to understand. The examples I’ve taken from the physical sciences have shown us very different cases of significant facts. An experiment by Kaufmann on radium rays simultaneously transforms mechanics, optics, and astronomy. Why? Because as these sciences have advanced,[Pg 546] we have better recognized the connections between them, and we’ve begun to see a sort of overarching design of universal science. There are facts that are shared among several sciences, which appear to be the common source of streams that diverge in all directions, comparable to that knoll of Saint Gothard from where waters spring and nourish four different valleys.

And then we can make choice of facts with more discernment than our predecessors who regarded these valleys as distinct and separated by impassable barriers.

And then we can choose facts with more insight than those who came before us, who saw these valleys as separate and cut off by impossible obstacles.

It is always simple facts which must be chosen, but among these simple facts we must prefer those which are situated upon these sorts of knolls of Saint Gothard of which I have just spoken.

It is always straightforward facts that need to be selected, but among these straightforward facts, we should prefer those that are found on these kinds of knolls of Saint Gothard that I just mentioned.

And when sciences have no direct bond, they still mutually throw light upon one another by analogy. When we studied the laws obeyed by gases we knew we had attacked a fact of great outcome; and yet this outcome was still estimated beneath its value, since gases are, from a certain point of view, the image of the milky way, and those facts which seemed of interest only for the physicist, ere long opened new vistas to astronomy quite unexpected.

And even when sciences don't have a direct connection, they still illuminate each other through analogy. When we examined the laws that govern gases, we realized we were dealing with something significant; however, this significance was still undervalued. From a certain perspective, gases reflect the structure of the Milky Way, and facts that initially seemed relevant only to physicists eventually revealed unexpected new insights for astronomy.

And finally when the geodesist sees it is necessary to move his telescope some seconds to see a signal he has set up with great pains, this is a very small fact; but this is a fact of great outcome, not only because this reveals to him the existence of a small protuberance upon the terrestrial globe, that little hump would be by itself of no great interest, but because this protuberance gives him information about the distribution of matter in the interior of the globe, and through that about the past of our planet, about its future, about the laws of its development.

And finally, when the geodesist realizes he needs to shift his telescope a few seconds to see a signal he has painstakingly set up, it may seem like a minor detail; however, it’s a fact with significant implications. Not only does it reveal the presence of a small bump on the Earth’s surface, which might not seem important on its own, but this bump also provides insights into the distribution of matter inside the planet, and through that, it informs us about the Earth’s past, its future, and the laws governing its evolution.


INDEX

aberration of light, 315, 496

Abraham, 311, 490-1, 505-7, 509, 515-6

absolute motion, 107
orientation, 83
space, 85, 93, 246, 257, 353

acceleration, 94, 98, 486, 509

accidental constant, 112
errors, 171, 402

accommodation of the eyes, 67-8

action at a distance, 137

addition, 34

aim of mathematics, 280

alchemists, 11

Alfaro, 543

algebra, 379

analogy, 220

analysis, 218-9, 279

analysis situs, 53, 239, 381

analyst, 210, 221

ancestral experience, 91

Andrade, 93, 104, 228

Andrews, 153

angle sum of triangle, 58

Anglo-Saxons, 3

antinomies, 449, 457, 477

Arago, 540-1

Aristotle, 205, 292, 460

arithmetic, 34, 379, 441, 463

associativity, 35

assumptions, 451, 453

astronomy, 81, 289, 315, 512

Atwood, 446

axiom, 60, 63, 65, 215


Bacon, 128

Bartholi, 503

Bassot, 542

beauty, 349, 368

Becquerel, 312

Beltrami, 56, 58

Bergson, 321

Berkeley, 4

Berthaut, 542

Bertrand, 156, 190, 211, 395

Betti, 239

Biot, 540

bodies, solid, 72

Boltzmann, 304

Bolyai, 56, 201, 203

Borel, 482

Bouguer, 537, 542

Bourgeois, 543

Boutroux, 390, 464

Bradley, 496

Briot, 298

Brownian movement, 152, 410

Bucherer, 507

Burali-Forti, 457-9, 477, 481-2


Caen, 387-8

Calinon, 228

canal rays, 491-2

canals, semicircular, 276

Cantor, 11, 448-9, 457, 459, 477

Cantorism, 381, 382, 480, 484

capillarity, 298

Carlyle, 128

Carnot's principle, 143, 151, 300, 303-5, 399

Cassini, 537

cathode rays, 487-92

cells, 217

center of gravity, 103

central forces, 297

Chaldeans, 290

chance, 395, 408

change of position, 70
state, 70

chemistry of the stars, 295

circle-squarers, 11

Clairaut, 537-8

Clausius, 119, 123, 143

color sensation, 252

Columbus, 228

commutativity, 35-6
[Pg 548]
compensation, 72

complete induction, 40

Comte, 294

Condorcet, 411

contingence, 340

continuity, 173

continuum, 43
amorphous, 238
mathematical, 46
physical, 46, 240
tridimensional, 240

convention, 50, 93, 106, 125, 173, 208, 317, 440, 451

convergence, 67-8

coordinates, 244

Copernicus, 109, 291, 354

Coulomb, 143, 516

Couturat, 450, 453, 456, 460, 462-3, 467, 472-6

creation, mathematical, 383

creed, 1

Crémieu, 168-9, 490

crisis, 303

Crookes, 195, 488, 527-8

crude fact, 326, 330

Curie, 312-3, 318

current, 186

curvature, 58-9

curve, 213, 346

curves without tangents, 51

cut, 52, 256

cyclones, 353


d'Alembert, 538

Darwin, 518-9

De Cyon, 276, 427

Dedekind, 44-5

Defforges, 542

definitions, 430, 453

deformation, 73, 415

Delage, 277

Delambre, 539

Delbeuf, 414

Descartes, 127

determinism, 123, 340

dictionary, 59

didymium, 333

dilatation, 76

dimensions, 53, 68, 78, 241, 256, 426

direction, 69

Dirichlet, 213

dispersion, 141

displacement, 73, 77, 247, 256

distance, 59, 292

distributivity, 36

Du Bois-Reymond, 50


earth, rotation of, 326, 353

eclipse, 326

electricity, 174

electrified bodies, 117

electrodynamic attraction, 308
induction, 188
mass, 311

electrodynamics, 184, 282

electromagnetic theory of light, 301

electrons, 316, 492-4, 505-8, 510, 512-4

elephant, 217, 436

ellipse, 215

Emerson, 203

empiricism, 86, 271

Epimenides, 478-9

equation of Laplace, 283

Erdély, 203

errors, accidental, 171, 402
law of, 119
systematic, 171, 402
theory of, 402, 406

ether, 145, 351

ethics, 205

Euclid, 62, 86, 202-3, 213

Euclidean geometry, 65, 235-6, 337

Euclid's postulate, 83, 91, 124, 353, 443, 453, 468, 470-1

experience, 90-1

experiment, 127, 317, 336, 446


fact, crude, 326, 330
in the rough, 327
scientific, 326

facts, 362, 371

Fahrenheit, 238

Faraday, 150, 192

Faye, 536

Fechner, 46, 52

Fehr, 383

finite, 57
[Pg 549]
Fitzgerald, 415-6, 500-1, 505

Fizeau, 146, 149, 309, 498, 504

Flammarion, 400, 406-7

flattening of the earth, 353

force, 72, 98, 444
direction of, 445
-flow, 284

forces, central, 297
equivalence of, 445
magnitude of, 445

Foucault's pendulum, 85, 109, 353

four dimensions, 78

Fourier, 298-9

Fourier's problem, 317
series, 286

Franklin, 513-4

Fresnel, 132, 140, 153, 174, 176, 181, 351, 498

Fuchsian, 387-8

function, 213
continuous, 218, 288


Galileo, 97, 331, 353-4

gaseous pressure, 141

gases, theory of, 400, 405, 523

Gauss, 384-5, 406

Gay-Lussac, 157

generalize, 342

geodesy, 535

geometer, 83, 210, 438

geometric space, 66

geometry, 72, 81, 125, 207, 380, 428, 442, 467
Euclidean, 65, 93
fourth, 62
non-Euclidean, 55
projective, 201
qualitative, 238
rational, 5, 467
Riemann's, 57
spheric, 59

Gibbs, 304

Goldstein, 492

Gouy, 152, 305, 410

gravitation, 512

Greeks, 93, 368


Hadamard, 459

Halsted, 3, 203, 464, 467

Hamilton, 115

helium, 294

Helmholtz, 56, 115, 118, 141, 190, 196

Hercules, 449

Hermite, 211, 220, 222, 285

Herschel, 528

Hertz, 102, 145, 194-5, 427, 488, 498, 502, 504, 510

Hertzian oscillator, 309, 317

Hilbert, 5, 11, 203, 433, 450-1, 464-8, 471, 475-7, 484

Himstedt, 195

Hipparchus, 291

homogeneity, 74, 423

homogeneous, 67

hydrodynamics, 284

hyperbola, 215

hypotheses, 6, 15, 127, 133

hysteresis, 151


identity of spaces, 268
of two points, 259

illusions, optical, 202

incommensurable numbers, 44

induction, complete, 40, 452-3, 467-8
electromagnetic, 188
mathematical, 40, 220
principle of, 481

inertia, 93, 486, 489, 507

infinite, 448

infinitesimals, 50

inquisitor, 331

integration, 139

interpolation, 131

intuition, 210, 213, 215

invariant, 333

Ionians, 127

ions, 152

irrational number, 44

irreversible phenomena, 151

isotropic, 67


Japanese mice, 277, 427

Jevons, 451

John Lackland, 128

Jules Verne, 111, 536

Jupiter, 131, 157, 231, 289


Kant, 16, 64, 202-3, 450-1, 471
[Pg 550]
Kauffman, 311, 490-1, 495, 506-7, 522, 545

Kazan, 203

Kelvin, 145, 523-4, 526-7

Kepler, 120, 133, 153, 282, 291-2

Kepler's laws, 136, 516

kinematics, 337

kinetic energy, 116
theory of gases, 141

Kirchhoff, 98-9, 103-5

Klein, 60, 211, 287

knowledge, 201

König, 144, 477

Kovalevski, 212, 286

Kronecker, 44


Lacombe, 543

La Condamine, 535, 537-8, 542-3

Lagrange, 98, 151, 179

Laisant, 383

Lallamand, 543

Langevin, 509

Laplace, 298, 398, 514-5, 518, 522, 538

Laplace's equation, 283, 287

Larmor, 145, 150

Lavoisier's principle, 301, 310, 312

law, 207, 291, 395

Leibnitz, 32, 450, 471

Le Roy, 28, 321-6, 332, 335, 337, 347-8, 354, 468

Lesage, 517-21

Liard, 440

Lie, 62-3, 212

light sensations, 252
theory of, 351
velocity of, 232, 312

Lindemann, 508

line, 203, 243

linkages, 144

Lippmann, 196

Lobachevski, 29, 56, 60, 62, 83, 86, 203

Lobachevski's space, 239

local time, 306-7, 499

logic, 214, 435, 448, 460-2, 464

logistic, 457, 472-4

logisticians, 472

Lorentz, 147, 149, 196-7, 306, 308, 311, 315, 415-6, 492, 498-502, 504-9, 512, 514-6, 521

Lotze, 264

luck, 399

Lumen, 407-8


MacCullagh, 150

Mach, 375

Mach-Delage, 276

magnetism, 149

magnitude, 49

Mariotte's law, 120, 132, 157, 342, 524

Maros, 203

mass, 98, 312, 446, 486, 489, 494, 515

mathematical analysis, 218
continuum, 46
creation, 383
induction, 40, 220
physics, 136, 297, 319

mathematics, 369, 448

matter, 492

Maupertuis, 535, 537-8

Maurain, 543

Maxwell, 140, 152, 175, 177, 181, 193, 282-3, 298, 301, 304-5, 351, 503, 524-5

Maxwell-Bartholi, 309, 503-4, 519, 521

Mayer, 119, 123, 300, 312, 318

measurement, 49

Méchain, 539-40

mechanical explanation, 177
mass, 312

mechanics, 92, 444, 486, 496, 512
anthropomorphic, 103
celestial, 279
statistical, 304

Méray, 211

metaphysician, 221

meteorology, 398

mice, 277

Michelson, 306, 309, 311, 316, 498, 500-1

milky way, 523-30

Mill, Stuart, 60-1, 453-4

Monist, 4, 89, 464

moons of Jupiter, 233

Morley, 309

motion of liquids, 283
[Pg 551]of moon, 28
of planets, 341
relative, 107, 487
without deformation, 236

multiplication, 36

muscular sensations, 69


Nagaoka, 317

nature, 127

navigation, 289

neodymium, 333

neomonics, 283

Neumann, 181

Newton, 85, 96, 98, 109, 153, 291, 370, 486, 516, 536, 538

Newton's argument, 108, 334, 343
law, 111, 118, 132, 136, 149, 157, 233, 282, 292, 512, 514-5, 518, 525
principle, 146, 300, 308-9, 312

no-class theory, 478

nominalism, 28, 125, 321, 333, 335

non-Euclidean geometry, 55, 59, 388
language, 127
space, 55, 235, 237
straight, 236, 470
world, 75

number, 31
big, 88
imaginary, 283
incommensurable, 44
transfinite, 448-9
whole, 44, 469


objectivity, 209, 347, 349, 408

optical illusions, 202

optics, 174, 496

orbit of Saturn, 341

order, 385

orientation, 83

osmotic, 141


Padoa, 463

Panthéon, 414

parallax, 470

parallels, 56, 443

Paris time, 233

parry, 419-22, 427

partition, 45

pasigraphy, 456-7

Pasteur, 128

Peano, 450, 456-9, 463, 472

Pender, 490

pendulum, 224

Perrier, 541

Perrin, 195

phosphorus, 333, 468, 470-1

physical continuum, 46

physics, 127, 140, 144, 279, 297

physics of central forces, 297
of the principles, 299

Pieri, 11, 203

Plato, 292

Poincaré, 473

point, 89, 244

Poncelet, 215

postulates, 382

potential energy, 116

praseodymium, 333

principle, 125, 299
Carnot's, 143, 151, 300, 303-5, 399
Clausius', 119, 123, 143
Hamilton's, 115
Lavoisier's, 300, 310
Mayer's, 119, 121, 123, 300, 312, 318
Newton's, 146, 300, 308-9, 312
of action and reaction, 300, 487, 502
of conservation of energy, 300
of degradation of energy, 300
of inertia, 93, 486, 507
of least action, 118, 300
of relativity, 300, 305, 498, 505

Prony, 445

psychologist, 383

Ptolemy, 110, 291, 353-4

Pythagoras, 292


quadrature of the circle, 161

qualitative geometry, 238
space, 207
time, 224

quaternions, 282
[Pg 552]

radiometer, 503

radium, 312, 318, 486-7

Rados, 201

Ramsay, 313

rational geometry, 5, 467

reaction, 502

reality, 217, 340, 349

Réaumur, 238

recurrence, 37

Regnault, 170

relativity, 83, 305, 417, 423, 498, 505

Richard, 477-8, 480-1

Riemann, 56, 62, 145, 212, 239, 243, 381, 432
surface, 211, 287

Roemer, 233

Röntgen, 511, 520

rotation of earth, 225, 331, 353

roulette, 403

Rowland, 194-7, 305, 489

Royce, 202

Russell, 201, 450, 460-2, 464-7, 471-4, 477-82, 484-5


St. Louis exposition, 208, 320

Sarcey, 442

Saturn, 231, 317

Schiller, 202

Schliemann, 19

science, 205, 321, 323, 340, 354

Science and Hypothesis, 205-7, 220, 240, 246-7, 319, 353, 452

semicircular canals, 276

series, development in, 287
Fourier's, 286

Sirius, 226, 229

solid bodies, 72

space, 55, 66, 89, 235, 256
absolute, 85, 93
amorphous, 417
Bolyai, 56
Euclidean, 65
geometric, 66
Lobachevski's, 239
motor, 69
non-Euclidean, 55, 235, 237
of four dimensions, 78
perceptual, 66, 69
tactile, 68, 264
visual, 67, 252

spectra, 316

spectroscope, 294

Spencer, 9

sponge, 219

Stallo, 10

stars, 292

statistical mechanics, 304

straight, 62, 82, 236, 433, 450, 470

Stratonoff, 531

surfaces, 58

systematic errors, 171


tactile space, 68, 264

Tait, 98

tangent, 51

Tannery, 43

teaching, 430, 437

thermodynamics, 115, 119

Thomson, 98, 488

thread, 104

time, 223
equality, 225
local, 306, 307
measure of, 223-4

Tisserand, 515-6

Tolstoi, 354, 362, 368

Tommasina, 519

Transylvania, 203

triangle, 58
angle sum of, 58

truth, 205

Tycho Brahe, 133, 153, 228


unity of nature, 130

universal invariant, 333

Uriel, 203


van der Waals, 153

Vauban, 210

Veblen, 203

velocity of light, 232, 312

Venus of Milo, 201

verification, 33

Virchow, 21
[Pg 553]
visual impressions, 252
space, 67, 252

Volga, 203

Voltaire, 537-8


Weber, 117, 515-6

Weierstrass, 11, 212, 432

Whitehead, 472, 481-2

whole numbers, 44

Wiechert, 145, 488


x-rays, 152, 511, 520


Zeeman effect, 152, 196, 317, 494

Zeno, 382

Zermelo, 477, 482-3

zigzag theory, 478

zodiac, 398, 404

aberration of light, 315, 496

Abraham, 311, 490-1, 505-7, 509, 515-6

absolute motion, 107
orientation, __A_TAG_PLACEHOLDER_0__
space, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__, __A_TAG_PLACEHOLDER_3__, __A_TAG_PLACEHOLDER_4__

acceleration, 94, 98, 486, 509

accidental constant, 112
errors, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__

accommodation of the eyes, 67-8

action at a distance, 137

addition, 34

aim of mathematics, 280

alchemists, 11

Alfaro, 543

algebra, 379

analogy, 220

analysis, 218-9, 279

analysis situs, 53, 239, 381

analyst, 210, 221

ancestral experience, 91

Andrade, 93, 104, 228

Andrews, 153

angle sum of triangle, 58

Anglo-Saxons, 3

antinomies, 449, 457, 477

Arago, 540-1

Aristotle, 205, 292, 460

arithmetic, 34, 379, 441, 463

associativity, 35

assumptions, 451, 453

astronomy, 81, 289, 315, 512

Atwood, 446

axiom, 60, 63, 65, 215


Bacon, 128

Bartholi, 503

Bassot, 542

beauty, 349, 368

Becquerel, 312

Beltrami, 56, 58

Bergson, 321

Berkeley, 4

Berthaut, 542

Bertrand, 156, 190, 211, 395

Betti, 239

Biot, 540

bodies, solid, 72

Boltzmann, 304

Bolyai, 56, 201, 203

Borel, 482

Bouguer, 537, 542

Bourgeois, 543

Boutroux, 390, 464

Bradley, 496

Briot, 298

Brownian movement, 152, 410

Bucherer, 507

Burali-Forti, 457-9, 477, 481-2


Caen, 387-8

Calinon, 228

canal rays, 491-2

canals, semicircular, 276

Cantor, 11, 448-9, 457, 459, 477

Cantorism, 381, 382, 480, 484

capillarity, 298

Carlyle, 128

Carnot's principle, 143, 151, 300, 303-5, 399

Cassini, 537

cathode rays, 487-92

cells, 217

center of gravity, 103

central forces, 297

Chaldeans, 290

chance, 395, 408

change of position, 70
state, __A_TAG_PLACEHOLDER_0__

chemistry of the stars, 295

circle-squarers, 11

Clairaut, 537-8

Clausius, 119, 123, 143

color sensation, 252

Columbus, 228

commutativity, 35-6
[Pg 548]
compensation, 72

complete induction, 40

Comte, 294

Condorcet, 411

contingence, 340

continuity, 173

continuum, 43
formless, __A_TAG_PLACEHOLDER_0__
math, __A_TAG_PLACEHOLDER_0__
physical, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
3D, __A_TAG_PLACEHOLDER_0__

convention, 50, 93, 106, 125, 173, 208, 317, 440, 451

convergence, 67-8

coordinates, 244

Copernicus, 109, 291, 354

Coulomb, 143, 516

Couturat, 450, 453, 456, 460, 462-3, 467, 472-6

creation, mathematical, 383

creed, 1

Crémieu, 168-9, 490

crisis, 303

Crookes, 195, 488, 527-8

crude fact, 326, 330

Curie, 312-3, 318

current, 186

curvature, 58-9

curve, 213, 346

curves without tangents, 51

cut, 52, 256

cyclones, 353


d'Alembert, 538

Darwin, 518-9

De Cyon, 276, 427

Dedekind, 44-5

Defforges, 542

definitions, 430, 453

deformation, 73, 415

Delage, 277

Delambre, 539

Delbeuf, 414

Descartes, 127

determinism, 123, 340

dictionary, 59

didymium, 333

dilatation, 76

dimensions, 53, 68, 78, 241, 256, 426

direction, 69

Dirichlet, 213

dispersion, 141

displacement, 73, 77, 247, 256

distance, 59, 292

distributivity, 36

Du Bois-Reymond, 50


earth, rotation of, 326, 353

eclipse, 326

electricity, 174

electrified bodies, 117

electrodynamic attraction, 308
induction, __A_TAG_PLACEHOLDER_0__
mass, __A_TAG_PLACEHOLDER_0__

electrodynamics, 184, 282

electromagnetic theory of light, 301

electrons, 316, 492-4, 505-8, 510, 512-4

elephant, 217, 436

ellipse, 215

Emerson, 203

empiricism, 86, 271

Epimenides, 478-9

equation of Laplace, 283

Erdély, 203

errors, accidental, 171, 402
law of, __A_TAG_PLACEHOLDER_0__
systematic, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
theory of, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__

ether, 145, 351

ethics, 205

Euclid, 62, 86, 202-3, 213

Euclidean geometry, 65, 235-6, 337

Euclid's postulate, 83, 91, 124, 353, 443, 453, 468, 470-1

experience, 90-1

experiment, 127, 317, 336, 446


fact, crude, 326, 330
in the wild, __A_TAG_PLACEHOLDER_0__
scientific, __A_TAG_PLACEHOLDER_0__

facts, 362, 371

Fahrenheit, 238

Faraday, 150, 192

Faye, 536

Fechner, 46, 52

Fehr, 383

finite, 57
[Pg 549]
Fitzgerald, 415-6, 500-1, 505

Fizeau, 146, 149, 309, 498, 504

Flammarion, 400, 406-7

flattening of the earth, 353

force, 72, 98, 444
direction of, __A_TAG_PLACEHOLDER_0__
-flow, __A_TAG_PLACEHOLDER_0__

forces, central, 297
equivalence of, __A_TAG_PLACEHOLDER_0__
magnitude of, __A_TAG_PLACEHOLDER_0__

Foucault's pendulum, 85, 109, 353

four dimensions, 78

Fourier, 298-9

Fourier's problem, 317
series, __A_TAG_PLACEHOLDER_0__

Franklin, 513-4

Fresnel, 132, 140, 153, 174, 176, 181, 351, 498

Fuchsian, 387-8

function, 213
continuous, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__


Galileo, 97, 331, 353-4

gaseous pressure, 141

gases, theory of, 400, 405, 523

Gauss, 384-5, 406

Gay-Lussac, 157

generalize, 342

geodesy, 535

geometer, 83, 210, 438

geometric space, 66

geometry, 72, 81, 125, 207, 380, 428, 442, 467
Euclidean, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
fourth, __A_TAG_PLACEHOLDER_0__
non-Euclidean, __A_TAG_PLACEHOLDER_0__
projective, __A_TAG_PLACEHOLDER_0__
qualitative, __A_TAG_PLACEHOLDER_0__
rational, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
Riemann's, __A_TAG_PLACEHOLDER_0__
spherical, __A_TAG_PLACEHOLDER_0__

Gibbs, 304

Goldstein, 492

Gouy, 152, 305, 410

gravitation, 512

Greeks, 93, 368


Hadamard, 459

Halsted, 3, 203, 464, 467

Hamilton, 115

helium, 294

Helmholtz, 56, 115, 118, 141, 190, 196

Hercules, 449

Hermite, 211, 220, 222, 285

Herschel, 528

Hertz, 102, 145, 194-5, 427, 488, 498, 502, 504, 510

Hertzian oscillator, 309, 317

Hilbert, 5, 11, 203, 433, 450-1, 464-8, 471, 475-7, 484

Himstedt, 195

Hipparchus, 291

homogeneity, 74, 423

homogeneous, 67

hydrodynamics, 284

hyperbola, 215

hypotheses, 6, 15, 127, 133

hysteresis, 151


identity of spaces, 268
of two points, __A_TAG_PLACEHOLDER_0__

illusions, optical, 202

incommensurable numbers, 44

induction, complete, 40, 452-3, 467-8
electromagnetic, __A_TAG_PLACEHOLDER_0__
mathematical, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
principle of __A_TAG_PLACEHOLDER_0__

inertia, 93, 486, 489, 507

infinite, 448

infinitesimals, 50

inquisitor, 331

integration, 139

interpolation, 131

intuition, 210, 213, 215

invariant, 333

Ionians, 127

ions, 152

irrational number, 44

irreversible phenomena, 151

isotropic, 67


Japanese mice, 277, 427

Jevons, 451

John Lackland, 128

Jules Verne, 111, 536

Jupiter, 131, 157, 231, 289


Kant, 16, 64, 202-3, 450-1, 471
[Pg 550]
Kauffman, 311, 490-1, 495, 506-7, 522, 545

Kazan, 203

Kelvin, 145, 523-4, 526-7

Kepler, 120, 133, 153, 282, 291-2

Kepler's laws, 136, 516

kinematics, 337

kinetic energy, 116
gas theory, __A_TAG_PLACEHOLDER_0__

Kirchhoff, 98-9, 103-5

Klein, 60, 211, 287

knowledge, 201

König, 144, 477

Kovalevski, 212, 286

Kronecker, 44


Lacombe, 543

La Condamine, 535, 537-8, 542-3

Lagrange, 98, 151, 179

Laisant, 383

Lallamand, 543

Langevin, 509

Laplace, 298, 398, 514-5, 518, 522, 538

Laplace's equation, 283, 287

Larmor, 145, 150

Lavoisier's principle, 301, 310, 312

law, 207, 291, 395

Leibnitz, 32, 450, 471

Le Roy, 28, 321-6, 332, 335, 337, 347-8, 354, 468

Lesage, 517-21

Liard, 440

Lie, 62-3, 212

light sensations, 252
theory of, __A_TAG_PLACEHOLDER_0__
speed of, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__

Lindemann, 508

line, 203, 243

linkages, 144

Lippmann, 196

Lobachevski, 29, 56, 60, 62, 83, 86, 203

Lobachevski's space, 239

local time, 306-7, 499

logic, 214, 435, 448, 460-2, 464

logistic, 457, 472-4

logisticians, 472

Lorentz, 147, 149, 196-7, 306, 308, 311, 315, 415-6, 492, 498-502, 504-9, 512, 514-6, 521

Lotze, 264

luck, 399

Lumen, 407-8


MacCullagh, 150

Mach, 375

Mach-Delage, 276

magnetism, 149

magnitude, 49

Mariotte's law, 120, 132, 157, 342, 524

Maros, 203

mass, 98, 312, 446, 486, 489, 494, 515

mathematical analysis, 218
continuum, __A_TAG_PLACEHOLDER_0__
creation, __A_TAG_PLACEHOLDER_0__
induction, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
physics, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__

mathematics, 369, 448

matter, 492

Maupertuis, 535, 537-8

Maurain, 543

Maxwell, 140, 152, 175, 177, 181, 193, 282-3, 298, 301, 304-5, 351, 503, 524-5

Maxwell-Bartholi, 309, 503-4, 519, 521

Mayer, 119, 123, 300, 312, 318

measurement, 49

Méchain, 539-40

mechanical explanation, 177
mass, __A_TAG_PLACEHOLDER_0__

mechanics, 92, 444, 486, 496, 512
anthropomorphic, __A_TAG_PLACEHOLDER_0__
celestial, __A_TAG_PLACEHOLDER_0__
stats, __A_TAG_PLACEHOLDER_0__

Méray, 211

metaphysician, 221

meteorology, 398

mice, 277

Michelson, 306, 309, 311, 316, 498, 500-1

milky way, 523-30

Mill, Stuart, 60-1, 453-4

Monist, 4, 89, 464

moons of Jupiter, 233

Morley, 309

motion of liquids, 283
[Pg 551]of the moon, __A_TAG_PLACEHOLDER_0__
of planets, __A_TAG_PLACEHOLDER_0__
relative, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
without distortion, __A_TAG_PLACEHOLDER_0__

multiplication, 36

muscular sensations, 69


Nagaoka, 317

nature, 127

navigation, 289

neodymium, 333

neomonics, 283

Neumann, 181

Newton, 85, 96, 98, 109, 153, 291, 370, 486, 516, 536, 538

Newton's argument, 108, 334, 343
law, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__, __A_TAG_PLACEHOLDER_3__, __A_TAG_PLACEHOLDER_4__, __A_TAG_PLACEHOLDER_5__, __A_TAG_PLACEHOLDER_6__, __A_TAG_PLACEHOLDER_7__, __A_TAG_PLACEHOLDER_8__, __A_TAG_PLACEHOLDER_9__, __A_TAG_PLACEHOLDER_10__, __A_TAG_PLACEHOLDER_11__, __A_TAG_PLACEHOLDER_12__
principle, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__, __A_TAG_PLACEHOLDER_3__

no-class theory, 478

nominalism, 28, 125, 321, 333, 335

non-Euclidean geometry, 55, 59, 388
language, __A_TAG_PLACEHOLDER_0__
space, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__
straight, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
world, __A_TAG_PLACEHOLDER_0__

number, 31
big, __A_TAG_PLACEHOLDER_0__
imaginary, __A_TAG_PLACEHOLDER_0__
incomparable, __A_TAG_PLACEHOLDER_0__
transfinite, __A_TAG_PLACEHOLDER_0__
whole, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__


objectivity, 209, 347, 349, 408

optical illusions, 202

optics, 174, 496

orbit of Saturn, 341

order, 385

orientation, 83

osmotic, 141


Padoa, 463

Panthéon, 414

parallax, 470

parallels, 56, 443

Paris time, 233

parry, 419-22, 427

partition, 45

pasigraphy, 456-7

Pasteur, 128

Peano, 450, 456-9, 463, 472

Pender, 490

pendulum, 224

Perrier, 541

Perrin, 195

phosphorus, 333, 468, 470-1

physical continuum, 46

physics, 127, 140, 144, 279, 297

physics of central forces, 297
of the principles, __A_TAG_PLACEHOLDER_0__

Pieri, 11, 203

Plato, 292

Poincaré, 473

point, 89, 244

Poncelet, 215

postulates, 382

potential energy, 116

praseodymium, 333

principle, 125, 299
Carnot's, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__, __A_TAG_PLACEHOLDER_3__, __A_TAG_PLACEHOLDER_4__
Clausius', __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__
Hamilton's, __A_TAG_PLACEHOLDER_0__
Lavoisier's, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
Mayer's, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__, __A_TAG_PLACEHOLDER_3__, __A_TAG_PLACEHOLDER_4__, __A_TAG_PLACEHOLDER_5__
Newton's, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__, __A_TAG_PLACEHOLDER_3__
of action and reaction, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__
of energy conservation, __A_TAG_PLACEHOLDER_0__
of energy degradation, __A_TAG_PLACEHOLDER_0__
of inertia, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__
of least action, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
of relativity, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__, __A_TAG_PLACEHOLDER_3__

Prony, 445

psychologist, 383

Ptolemy, 110, 291, 353-4

Pythagoras, 292


quadrature of the circle, 161

qualitative geometry, 238
space, __A_TAG_PLACEHOLDER_0__
time, __A_TAG_PLACEHOLDER_0__

quaternions, 282
[Pg 552]

radiometer, 503

radium, 312, 318, 486-7

Rados, 201

Ramsay, 313

rational geometry, 5, 467

reaction, 502

reality, 217, 340, 349

Réaumur, 238

recurrence, 37

Regnault, 170

relativity, 83, 305, 417, 423, 498, 505

Richard, 477-8, 480-1

Riemann, 56, 62, 145, 212, 239, 243, 381, 432
surface, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__

Roemer, 233

Röntgen, 511, 520

rotation of earth, 225, 331, 353

roulette, 403

Rowland, 194-7, 305, 489

Royce, 202

Russell, 201, 450, 460-2, 464-7, 471-4, 477-82, 484-5


St. Louis exposition, 208, 320

Sarcey, 442

Saturn, 231, 317

Schiller, 202

Schliemann, 19

science, 205, 321, 323, 340, 354

Science and Hypothesis, 205-7, 220, 240, 246-7, 319, 353, 452

semicircular canals, 276

series, development in, 287
Fourier's, __A_TAG_PLACEHOLDER_0__

Sirius, 226, 229

solid bodies, 72

space, 55, 66, 89, 235, 256
absolute, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
amorphous, __A_TAG_PLACEHOLDER_0__
Bolyai, __A_TAG_PLACEHOLDER_0__
Euclidean geometry, __A_TAG_PLACEHOLDER_0__
geometric, __A_TAG_PLACEHOLDER_0__
Lobachevski's, __A_TAG_PLACEHOLDER_0__
motor, __A_TAG_PLACEHOLDER_0__
non-Euclidean, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__, __A_TAG_PLACEHOLDER_2__
of four dimensions, __A_TAG_PLACEHOLDER_0__
perceptual, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
tactile, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
visual, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__

spectra, 316

spectroscope, 294

Spencer, 9

sponge, 219

Stallo, 10

stars, 292

statistical mechanics, 304

straight, 62, 82, 236, 433, 450, 470

Stratonoff, 531

surfaces, 58

systematic errors, 171


tactile space, 68, 264

Tait, 98

tangent, 51

Tannery, 43

teaching, 430, 437

thermodynamics, 115, 119

Thomson, 98, 488

thread, 104

time, 223
equality, __A_TAG_PLACEHOLDER_0__
local, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__
measure of, __A_TAG_PLACEHOLDER_0__

Tisserand, 515-6

Tolstoi, 354, 362, 368

Tommasina, 519

Transylvania, 203

triangle, 58
angle sum of, __A_TAG_PLACEHOLDER_0__

truth, 205

Tycho Brahe, 133, 153, 228


unity of nature, 130

universal invariant, 333

Uriel, 203


van der Waals, 153

Vauban, 210

Veblen, 203

velocity of light, 232, 312

Venus of Milo, 201

verification, 33

Virchow, 21
[Pg 553]
visual impressions, 252
space, __A_TAG_PLACEHOLDER_0__, __A_TAG_PLACEHOLDER_1__

Volga, 203

Voltaire, 537-8


Weber, 117, 515-6

Weierstrass, 11, 212, 432

Whitehead, 472, 481-2

whole numbers, 44

Wiechert, 145, 488


x-rays, 152, 511, 520


Zeeman effect, 152, 196, 317, 494

Zeno, 382

Zermelo, 477, 482-3

zigzag theory, 478

zodiac, 398, 404


FOOTNOTES

[1] See Le Roy, 'Science et Philosophie,' Revue de Métaphysique et de Morale, 1901.

[1] See Le Roy, 'Science and Philosophy,' Review of Metaphysics and Morality, 1901.

[2] With those contained in the special conventions which serve to define addition and of which we shall speak later.

[2] Along with those included in the specific agreements that help clarify addition and that we will discuss later.

[3] Revue de Métaphysique et de Morale, t. VI., pp. 1-13 (January, 1898).

[3] Review of Metaphysics and Morality, Vol. VI, pp. 1-13 (January, 1898).

[4] The following lines are a partial reproduction of the preface of my book Thermodynamique.

[4] The following lines are a partial reproduction of the preface of my book Thermodynamics.

[5] This chapter is a partial reproduction of the prefaces of two of my works: Théorie mathématique de la lumière (Paris, Naud, 1889), and Électricité et optique (Paris, Naud, 1901).

[5] This chapter is a partial reproduction of the prefaces of two of my works: Mathematical Theory of Light (Paris, Naud, 1889), and Electricity and Optics (Paris, Naud, 1901).

[6] We add that U will depend only on the parameters q, that T will depend on the parameters q and their derivatives with respect to the time and will be a homogeneous polynomial of the second degree with respect to these derivatives.

[6] We add that U will rely solely on the parameters q, while T will depend on the parameters q and their derivatives in relation to time, and will be a homogeneous polynomial of the second degree concerning these derivatives.

[7] Etude sur les diverses grandeurs, Paris, Gauthier-Villars, 1897.

[7] Study on Various Sizes, Paris, Gauthier-Villars, 1897.

[8] In place of saying that we refer space to axes rigidly bound to our body, perhaps it would be better to say, in conformity to what precedes, that we refer it to axes rigidly bound to the initial situation of our body.

[8] Instead of saying that we relate space to axes firmly attached to our body, it might be more accurate to say, consistent with what was mentioned earlier, that we relate it to axes firmly attached to our body's original position.

[9] Because bodies would oppose an increasing inertia to the causes which would tend to accelerate their motion; and this inertia would become infinite when one approached the velocity of light.

[9] Because objects would fight against any increase in their motion due to inertia; and this inertia would become infinite as one got closer to the speed of light.

[10] These considerations on mathematical physics are borrowed from my St. Louis address.

[10] These thoughts on mathematical physics come from my St. Louis speech.

[11] I here use the word real as a synonym of objective; I thus conform to common usage; perhaps I am wrong, our dreams are real, but they are not objective.

[11] I'm using the word real to mean objective; this aligns with common usage. Maybe I'm mistaken, since our dreams are real, but they aren't objective.

[12] See Science and Hypothesis, chapter I.

__A_TAG_PLACEHOLDER_0__ See Science and Hypothesis, chapter 1.

[13] 'The Foundations of Logic and Arithmetic,' Monist, XV., 338-352.

[13] 'The Foundations of Logic and Arithmetic,' Monist, XV., 338-352.

[14] Second ed., 1907, p. 86; French ed., 1911, p. 97. G. B. H.

[14] 2nd ed., 1907, p. 86; French ed., 1911, p. 97. G. B. H.

[15] Revue générale des sciences, June 30, 1905.

[15] General Review of the Sciences, June 30, 1905.

[16] In his article 'Le classi finite,' Atti di Torino, Vol. XXXII.

[16] In his article 'Finite Classes,' Proceedings of Turin, Vol. XXXII.

[17] At the moment of going to press we learn that M. Bucherer has repeated the experiment, taking new precautions, and that he has obtained, contrary to Kaufmann, results confirming the views of Lorentz.

[17] As we go to print, we find out that M. Bucherer has conducted the experiment again, taking new precautions, and he has obtained results that, unlike Kaufmann's, support Lorentz's views.




        
        
    
Download ePUB

If you like this ebook, consider a donation!