Science and Hypothesis/PART IV
| ←PART III | Science and Hypothesis by Part IV: Nature |
NATURE.
Contents |
CHAPTER IX.
HYPOTHESES IN PHYSICS.
The Rôle of Experiment and Generalisation. — Experiment is the sole source of truth. It alone can teach us something new; it alone can give us certainty. These are two points that cannot be questioned. But then, if experiment is every thing, what place is left for mathematical physics? What can experimental physics do with such an auxiliary — an auxiliary, moreover, which seems useless, and even may be dangerous?
However, mathematical physics exists. It has rendered undeniable service, and that is a fact which has to be explained. It is not sufficient merely to observe; we must use our observations, and for that purpose we must generalise. This is what has always been done, only as the recollection of past errors has made man more and more circumspect, he has observed more and more and generalised less and less. Every age has scoffed at its predecessor, accusing it of having generalised too boldly and too naïvely. Descartes used to commiserate the Ionians. Descartes in his turn makes us smile, and no doubt some day our children will laugh at us. Is there no way of getting at once to the gist of the matter, and thereby escaping the raillery which we foresee? Cannot we be content with experiment alone? No, that is impossible; that would be a complete misunderstanding of the true character of science. The man of science must work with method. Science is built up of facts, as a house is built of stones; but an accumulation of facts is no more a science than a heap of stones is a house. Most important of all, the man of science must exhibit foresight. Carlyle has written somewhere some thing after this fashion. "Nothing but facts are of importance. John Lackland passed by here. Here is something that is admirable. Here is a reality for which I would give all the theories in the world."[1] Carlyle was a compatriot of Bacon, and, like him, he wished to proclaim his worship of the God of Things as they are.
But Bacon would not have said that. That is the language of the historian. The physicist would most likely have said: "John Lackland passed by here. It is all the same to me, for he will not pass this way again."
We all know that there are good and bad experiments. The latter accumulate in vain. Whether there are a hundred or a thousand, one single piece of work — by a real master by a Pasteur, for example — will be sufficient to sweep them into oblivion. Bacon would have thoroughly understood that, for he invented the phrase experimentum crucis; but Carlyle would not have under stood it. A fact is a fact. A student has read such and such a number on his thermometer. He has taken no precautions. It does not matter; he has read it, and if it is only the fact which counts, this is a reality that is as much entitled to be called a reality as the peregrinations of King John Lackland. What, then, is a good experiment? It is that which teaches us something more than an isolated fact. It is that which enables us to predict, and to generalise. Without generalisation, prediction is impossible. The circumstances under which one has operated will never again be reproduced simultaneously. The fact observed will never be repeated. All that can be affirmed is that under analogous circumstances an analogous fact will be produced. To predict it, we must therefore invoke the aid of analogy — that is to say, even at this stage, we must generalise. However timid we may be, there must be interpolation. Experiment only gives us a certain number of isolated points. They must be connected by a continuous line, and this is a true generalisation. But more is done. The curve thus traced will pass between and near the points observed; it will not pass through the points themselves. Thus we are not restricted to generalising our experiment, we correct it; and the physicist who would abstain from these corrections, and really content himself with experiment pure and simple, would be compelled to enunciate very extra ordinary laws indeed. Detached facts cannot therefore satisfy us, and that is why our science must be ordered, or, better still, generalised.
It is often said that experiments should be made without preconceived ideas. That is impossible. Not only would it make every experiment fruitless, but even if we wished to do so, it could not be done. Every man has his own conception of the world, and this he cannot so easily lay aside. We must, for example, use language, and our language is necessarily steeped in preconceived ideas. Only they are unconscious preconceived ideas, which are a thousand times the most dangerous of all. Shall we say, that if we cause others to intervene of which we are fully conscious, that we shall only aggravate the evil? I do not think so. I am inclined to think that they will serve as ample counterpoises — I was almost going to say antidotes. They will generally disagree, they will enter into conflict one with another, and ipso facto, they will force us to look at things under different aspects. This is enough to free us. He is no longer a slave who can choose his master.
Thus, by generalisation, every fact observed enables us to predict a large number of others; only, we ought not to forget that the first alone is certain, and that all the others are merely probable. However solidly founded a prediction may appear to us, we are never absolutely sure that experiment will not prove it to be baseless if we set to work to verify it. But the probability of its accuracy is often so great that practically we may be content with it. It is far better to predict without certainty, than never to have predicted at all. We should never, therefore, disdain to verify when the opportunity presents itself. But every experiment is long and difficult, and the labourers are few, and the number of facts which we require to predict is enormous; and besides this mass, the number of direct verifications that we can make will never be more than a negligible quantity. Of this little that we can directly attain we must choose the best. Every experiment must enable us to make a maximum number of predictions having the highest possible degree of probability. The problem is, so to speak, to increase the output of the scientific machine. I may be permitted to compare science to a library which must go on increasing indefinitely; the librarian has limited funds for his purchases, and he must, therefore, strain every nerve not to waste them. Experimental physics has to make the purchases, and experimental physics alone can enrich the library. As for mathematical physics, her duty is to draw up the catalogue. If the catalogue is well done the library is none the richer for it; but the reader will be enabled to utilise its riches; and also by showing the librarian the gaps in his collection, it will help him to make a judicious use of his funds, which is all the more important, inasmuch as those funds are entirely inadequate. That is the rôle of mathematical physics. It must direct generalisation, so as to increase what I called just now the output of science. By what means it does this, and how it may do it without danger, is what we have now to examine.
The Unity of Nature. — Let us first of all observe that every generalisation supposes in a certain measure a belief in the unity and simplicity of Nature. As far as the unity is concerned, there can be no difficulty. If the different parts of the universe were not as the organs of the same body, they would not re-act one upon the other; they would mutually ignore each other, and we in particular should only know one part. We need not, therefore, ask if Nature is one, but how she is one.
As for the second point, that is not so clear. It is not certain that Nature is simple. Can we without danger act as if she were?
There was a time when the simplicity of Mariotte's law was an argument in favour of its accuracy: when Fresnel himself, after having said in a conversation with Laplace that Nature cares naught for analytical difficulties, was compelled to explain his words so as not to give offence to current opinion. Nowadays, ideas have changed considerably; but those who do not believe that natural laws must be simple, are still often obliged to act as if they did believe it. They cannot entirely dispense with this necessity without making all generalisation, and therefore all science, impossible. It is clear that any fact can be generalised in an infinite number of ways, and it is a question of choice. The choice can only be guided by considerations of simplicity. Let us take the most ordinary case, that of interpolation. We draw a continuous line as regularly as possible between the points given by observation. Why do we avoid angular points and inflexions that are too sharp? Why do we not make our curve describe the most capricious zigzags? It is because we know beforehand, or think we know, that the law we have to express cannot be so complicated as all that. The mass of Jupiter may be deduced either from the movements of his satellites, or from the perturbations of the major planets, or from those of the minor planets. If we take the mean of the determinations obtained by these three methods, we find three numbers very close together, but not quite identical. This result might be interpreted by supposing that the gravitation constant is not the same in the three cases; the observations would be certainly much better represented. Why do we reject this interpretation? Not because it is absurd, but because it is uselessly complicated. We shall only accept it when we are forced to, and it is not imposed upon us yet. To sum up, in most cases every law is held to be simple until the contrary is proved.
This custom is imposed upon physicists by the reasons that I have indicated, but how can it be justified in the presence of discoveries which daily show us fresh details, richer and more complex? How can we even reconcile it with the unity of nature? For if all things are interdependent, the relations in which so many different objects intervene can no longer be simple.
If we study the history of science we see produced two phenomena which are, so to speak, each the inverse of the other. Sometimes it is simplicity which is hidden under what is apparently complex; sometimes, on the contrary, it is simplicity which is apparent, and which conceals extremely complex realities. What is there more complicated than the disturbed motions of the planets, and what more simple than Newton's law? There, as Fresnel said, Nature playing with analytical difficulties, only uses simple means, and creates by their combination I know not what tangled skein. Here it is the hidden simplicity which must be disentangled. Examples to the contrary abound. In the kinetic theory of gases, molecules of tremendous velocity are discussed, whose paths, deformed by incessant impacts, have the most capricious shapes, and plough their way through space in every direction. The result observable is Mariotte's simple law. Each individual fact was complicated. The law of great numbers has re-established simplicity in the mean. Here the simplicity is only apparent, and the coarseness of our senses alone prevents us from seeing the complexity.
Many phenomena obey a law of proportionality. But why? Because in these phenomena there is something which is very small. The simple law observed is only the translation of the general analytical rule by which the infinitely small increment of a function is proportional to the increment of the variable. As in reality our increments are not infinitely small, but only very small, the law of proportionality is only approximate, and simplicity is only apparent. What I have just said applies to the law of the superposition of small movements, which is so fruitful in its applications and which is the foundation of optics.
And Newton's law itself? Its simplicity, so long undetected, is perhaps only apparent. Who knows if it be not due to some complicated mechanism, to the impact of some subtle matter animated by irregular movements, and if it has not become simple merely through the play of averages and large numbers? In any case, it is difficult not to suppose that the true law contains complementary terms which may become sensible at small distances. If in astronomy they are negligible, and if the law thus regains its simplicity, it is solely on account of the enormous distances of the celestial bodies. No doubt, if our means of investigation became more and more penetrating, we should discover the simple beneath the complex, and then the complex from the simple, and then again the simple beneath the complex, and so on, without ever being able to predict what the last term will be. We must stop somewhere, and for science to be possible we must stop where we have found simplicity. That is the only ground on which we can erect the edifice of our generalisations. But, this simplicity being only apparent, will the ground be solid enough? That is what we have now to discover.
For this purpose let us see what part is played in our generalisations by the belief in simplicity. We have verified a simple law in a considerable number of particular cases. We refuse to admit that this coincidence, so often repeated, is a result of mere chance, and we conclude that the law must be true in the general case.
Kepler remarks that the positions of a planet observed by Tycho are all on the same ellipse. Not for one moment does he think that, by a singular freak of chance, Tycho had never looked at the heavens except at the very moment when the path of the planet happened to cut that ellipse. What does it matter then if the simplicity be real or if it hide a complex truth? Whether it be due to the influence of great numbers which reduces individual differences to a level, or to the greatness or the smallness of certain quantities which allow of certain terms to be neglected — in no case is it due to chance. This simplicity, real or apparent, has always a cause. We shall therefore always be able to reason in the same fashion, and if a simple law has been observed in several particular cases, we may legitimately suppose that it still will be true in analogous cases. To refuse to admit this would be to attribute an in admissible rôle to chance. However, there is a difference. If the simplicity were real and pro found it would bear the test of the increasing precision of our methods of measurement. If, then, we believe Nature to be profoundly simple, we must conclude that it is an approximate and not a rigorous simplicity. This is what was formerly done, but it is what we have no longer the right to do. The simplicity of Kepler's laws, for instance, is only apparent; but that does not prevent them from being applied to almost all systems analogous to the solar system, though that prevents them from being rigorously exact.
Rôle of Hypothesis. — Every generalisation is a hypothesis. Hypothesis therefore plays a necessary role, which no one has ever contested. Only, it should always be as soon as possible submitted to verification. It goes without saying that, if it cannot stand this test, it must be abandoned without any hesitation. This is, indeed, what is generally done; but sometimes with a certain impatience. Ah well! this impatience is not justified. The physicist who has just given up one of his hypotheses should, on the contrary, rejoice, for he found an unexpected opportunity of discovery. His hypothesis, I imagine, had not been lightly adopted, It took into account all the known factors which seem capable of intervention in the phenomenon. If it is not verified, it is because there is something unexpected and extra ordinary about it, because we are on the point of finding something unknown and new. Has the hypothesis thus rejected been sterile? Far from it. It may be even said that it has rendered more service than a true hypothesis. Not only has it been the occasion of a decisive experiment, but if this experiment had been made by chance, without the hypothesis, no conclusion could have been drawn; nothing extraordinary would have been seen; and only one fact the more would have been catalogued, without deducing from it the remotest consequence.
Now, under what conditions is the use of hypothesis without danger? The proposal to submit all to experiment is not sufficient. Some hypotheses are dangerous, — first and foremost those which are tacit and unconscious. And since we make them without knowing them, we cannot get rid of them. Here again, there is a service that mathematical physics may render us. By the precision which is its characteristic, we are compelled to formulate all the hypotheses that we would unhesitatingly make without its aid. Let us also notice that it is important not to multiply hypotheses indefinitely. If we construct a theory based upon multiple hypo theses, and if experiment condemns it, which of the premisses must be changed? It is impossible to tell. Conversely, if the experiment succeeds, must we suppose that it has verified all these hypotheses at once? Can several unknowns be determined from a single equation?
We must also take care to distinguish between the different kinds of hypotheses. First of all, there are those which are quite natural and necessary. It is difficult not to suppose that the influence of very distant bodies is quite negligible, that small movements obey a linear law, and that effect is a continuous function of its cause. I will say as much for the conditions imposed by symmetry. All these hypotheses affirm, so to speak, the common basis of all the theories of mathematical physics. They are the last that should be abandoned. There is a second category of hypotheses which I shall qualify as indifferent. In most questions the analyst assumes, at the beginning of his calculations, either that matter is continuous, or the reverse, that it is formed of atoms. In either case, his results would have been the same. On the atomic supposition he has a little more difficulty in obtaining them — that is all. If, then, experiment confirms his conclusions, will he suppose that he has proved, for example, the real existence of atoms?
In optical theories two vectors are introduced, one of which we consider as a velocity and the other as a vortex. This again is an indifferent hypothesis, since we should have arrived at the same conclusions by assuming the former to be a vortex and the latter to be a velocity. The success of the experiment cannot prove, therefore, that the first vector is really a velocity. It only proves one thing — namely, that it is a vector; and that is the only hypothesis that has really been introduced into the premisses. To give it the concrete appearance that the fallibility of our minds demands, it was necessary to consider it either as a velocity or as a vortex. In the same way, it was necessary to represent it by an x or a y, but the result will not prove that we were right or wrong in regarding it as a velocity; nor will it prove we are right or wrong in calling it x and not y.
These indifferent hypotheses are never dangerous provided their characters are not misunderstood. They may be useful, either as artifices for calculation, or to assist our understanding by concrete images, to fix the ideas, as we say. They need not therefore be rejected. The hypotheses of the third category are real generalisations. They must be confirmed or invalidated by experiment. Whether verified or condemned, they will always be fruitful; but, for the reasons I have given, they will only be so if they are not too numerous.
Origin of Mathematical Physics. — Let us go further and study more closely the conditions which have assisted the development of mathematical physics. We recognise at the outset that the efforts of men of science have always tended to resolve the complex phenomenon given directly by experiment into a very large number of elementary phenomena, and that in three different ways.
First, with respect to time. Instead of embracing in its entirety the progressive development of a phenomenon, we simply try to connect each moment with the one immediately preceding. We admit that the present state of the world only depends on the immediate past, without being directly influenced, so to speak, by the recollection of a more distant past. Thanks to this postulate, instead of studying directly the whole succession of phenomena, we may confine ourselves to writing down its differential equation; for the laws of Kepler we substitute the law of Newton.
Next, we try to decompose the phenomena in space. What experiment gives us is a confused aggregate of facts spread over a scene of consider able extent. We must try to deduce the elementary phenomenon, which will still be localised in a very small region of space.
A few examples perhaps will make my meaning clearer. If we wished to study in all its complexity the distribution of temperature in a cooling solid, we could never do so. This is simply be cause, if we only reflect that a point in the solid can directly impart some of its heat to a neighbouring point, it will immediately impart that heat only to the nearest points, and it is but gradually that the flow of heat will reach other portions of the solid. The elementary phenomenon is the interchange of heat between two contiguous points. It is strictly localised and relatively simple if, as is natural, we admit that it is not influenced by the temperature of the molecules whose distance apart is small.
I bend a rod: it takes a very complicated form, the direct investigation of which would be impossible. But I can attack the problem, however, if I notice that its flexure is only the resultant of the deformations of the very small elements of the rod, and that the deformation of each of these elements only depends on the forces which are directly applied to it, and not in the least on those which may be acting on the other elements.
In all these examples, which may be increased without difficulty, it is admitted that there is no action at a distance or at great distances. That is an hypothesis. It is not always true, as the law of gravitation proves. It must therefore be verified. If it is confirmed, even approximately, it is valuable, for it helps us to use mathematical physics, at any rate by successive approximations. If it does not stand the test, we must seek something else that is analogous, for there are other means of arriving at the elementary phenomenon. If several bodies act simultaneously, it may happen that their actions are independent, and may be added one to the other, either as vectors or as scalar quantities. The elementary phenomenon is then the action of an isolated body. Or suppose, again, it is a question of small movements, or more generally of small variations which obey the well- known law of mutual or relative independence. The movement observed will then be decomposed into simple movements — for example, sound into its harmonics, and white light into its monochromatic components. When we have discovered in which direction to seek for the elementary phenomena, by what means may we reach it? First, it will often happen that in order to predict it, or rather in order to predict what is useful to us, it will not be necessary to know its mechanism. The law of great numbers will suffice. Take for example the propagation of heat. Each molecule radiates towards its neighbour — we need not inquire according to what law; and if we make any supposition in this respect, it will be an indifferent hypothesis, and therefore useless and unverifiable. In fact, by the action of averages and thanks to the symmetry of the medium, all differences are levelled, and, whatever the hypothesis may be, the result is always the same.
The same feature is presented in the theory of elasticity, and in that of capillarity. The neighbouring molecules attract and repel each other, we need not inquire by what law. It is enough for us that this attraction is sensible at small distances only, and that the molecules are very numerous, that the medium is symmetrical, and we have only to let the law of great numbers come into play.
Here again the simplicity of the elementary phenomenon is hidden beneath the complexity of the observable resultant phenomenon; but in its turn this simplicity was only apparent and disguised a very complex mechanism. Evidently the best means of reaching the elementary phenomenon would be experiment. It would be necessary by experimental artifices to dissociate the complex system which nature offers for our investigations and carefully to study the elements as dissociated as possible; for example, natural white light would be decomposed into monochromatic lights by the aid of the prism, and into polarised lights by the aid of the polariser. Unfortunately, that is neither always possible nor always sufficient, and sometimes the mind must run ahead of experiment. I shall only give one example which has always struck me rather forcibly. If I de compose white light, I shall be able to isolate a portion of the spectrum, but however small it may be, it will always be a certain width. In the same way the natural lights which are called monochromatic give us a very fine array, but a y which is not, however, infinitely fine. It might be supposed that in the experimental study of the properties of these natural lights, by operating with finer and finer rays, and passing on at last to the limit, so to speak, we should eventually obtain the properties of a rigorously monochromatic light. That would not be accurate. I assume that two rays emanate from the same source, that they are first polarised in planes at right angles, that they are then brought back again to the same plane of polarisation, and that we try to obtain interference. If the light were rigorously monochromatic, there would be interference; but with our nearly monochromatic lights, there will be no interference, and that, however narrow the ray may be. For it to be otherwise, the ray would have to be several million times finer than the finest known rays.
Here then we should be led astray by proceeding to the limit. The mind has to run ahead of the experiment, and if it has done so with success, it is because it has allowed itself to be guided by the instinct of simplicity. The knowledge of the elementary fact enables us to state the problem in the form of an equation. It only remains to deduce from it by combination the observable and verifiable complex fact. That is what we call integration, and it is the province of the mathematician. It might be asked, why in physical science generalisation so readily takes the mathematical form. The reason is now easy to see. It is not only because we have to express numerical laws; it is because the observable phenomenon is due to the superposition of a large number of elementary phenomena which are all similar to each other; and in this way differential equations are quite naturally introduced. It is not enough that each elementary phenomenon should obey simple laws: all those that we have to combine must obey the same law; then only is the intervention of mathematics of any use. Mathematics teaches us, in fact, to combine like with like. Its object is to divine the result of a combination without having to reconstruct that combination element by element. If we have to repeat the same operation several times, mathematics enables us to avoid this repetition by telling the result beforehand by a kind of induction. This I have explained before in the chapter on mathematical reasoning. But for that purpose all these operations must be similar; in the contrary case we must evidently make up our minds to working them out in full one after the other, and mathematics will be useless. It is therefore, thanks to the approximate homogeneity of the matter studied by physicists, that mathematical physics came into existence. In the natural sciences the following conditions are no longer to be found: — homogeneity, relative independence of remote parts, simplicity of the elementary fact; and that is why the student of natural science is compelled to have recourse to other modes of generalisation.
THE THEORIES OF MODERN PHYSICS.
Significance of Physical Theories. — The ephemeral nature of scientific theories takes by surprise the man of the world. Their brief period of prosperity ended, he sees them abandoned one after another; he sees ruins piled upon ruins; he predicts that the theories in fashion to-day will in a short time succumb in their turn, and he concludes that they are absolutely in vain. This is what he calls the bankruptcy of science.
His scepticism is superficial; he does not take into account the object of scientific theories and the part they play, or he would understand that the ruins may be still good for something. No theory seemed established on firmer ground than Fresnel's, which attributed light to the movements of the ether. Then if Maxwell's theory is to-day preferred, does that mean that Fresnel's work was in vain? No; for Fresnel's object was not to know whether there really is an ether, if it is or is not formed of atoms, if these atoms really move in this way or that; his object was to predict optical phenomena.
This Fresnel's theory enables us to do to-day as well as it did before Maxwell's time. The differential equations are always true, they may be always integrated by the same methods, and the results of this integration still preserve their value. It cannot be said that this is reducing physical theories to simple practical recipes; these equations express relations, and if the equations remain true, it is because the relations preserve their reality. They teach us now, as they did then, that there is such and such a relation between this thing and that; only, the something which we then called motion, we now call electric current. But these are merely names of the images we substituted for the real objects which Nature will hide for ever from our eyes. The true relations between these real objects are the only reality we can attain, and the sole condition is that the same relations shall exist between these objects as between the images we are forced to put in their place. If the relations are known to us, what does it matter if we think it convenient to replace one image by another?
That a given periodic phenomenon (an electric oscillation, for instance) is really due to the vibration of a given atom, which, behaving like a pendulum, is really displaced in this manner or that, all this is neither certain nor essential. But that there is between the electric oscillation, the movement of the pendulum, and all periodic phenomena an intimate relationship which corresponds to a profound reality; that this relationship, this similarity, or rather this parallelism, is continued in the details; that it is a consequence of more general principles such as that of the conservation of energy, and that of least action; this we may affirm; this is the truth which will ever remain the same in whatever garb we may see fit to clothe it.
Many theories of dispersion have been proposed. The first were imperfect, and contained but little truth. Then came that of Helmholtz, and this in its turn was modified in different ways; its author himself conceived another theory, founded on Maxwell's principles. But the remarkable thing is, that all the scientists who followed Helmholtz obtain the same equations, although their starting-points were to all appearance widely separated. I venture to say that these theories are all simultaneously true; not merely because they express a true relation — that between absorption and abnormal dispersion. In the premisses of these theories the part that is true is the part common to all: it is the affirmation of this or that relation between certain things, which some call by one name and some by another.
The kinetic theory of gases has given rise to many objections, to which it would be difficult to find an answer were it claimed that the theory is absolutely true. But all these objections do not alter the fact that it has been useful, particularly in revealing to us one true relation which would otherwise have remained profoundly hidden — the relation between gaseous and osmotic pressures. In this sense, then, it may be said to be true.
When a physicist finds a contradiction between two theories which are equally dear to him, he sometimes says: "Let us not be troubled, but let us hold fast to the two ends of the chain, lest we lose the intermediate links." This argument of the embarrassed theologian would be ridiculous if we were to attribute to physical theories the interpretation given them by the man of the world. In case of contradiction one of them at least should be considered false. But this is no longer the case if we only seek in them what should be sought. It is quite possible that they both express true relations, and that the contradictions only exist in the images we have formed to ourselves of reality. To those who feel that we are going too far in our limitations of the domain accessible to the scientist, I reply: These questions which we forbid you to investigate, and which you so regret, are not only insoluble, they are illusory and devoid of meaning.
Such a philosopher claims that all physics can be explained by the mutual impact of atoms. If he simply means that the same relations obtain between physical phenomena as between the mutual impact of a large number of billiard balls — well and good! this is verifiable, and perhaps is true. But he means something more, and we think we understand him, because we think we know what an impact is. Why? Simply because we have often watched a game of billiards. Are we to understand that God experiences the same sensations in the contemplation of His work that we do in watching a game of billiards? If it is not our intention to give his assertion this fantastic meaning, and if we do not wish to give it the more restricted meaning I have already mentioned, which is the sound meaning, then it has no meaning at all. Hypotheses of this kind have therefore only a metaphorical sense. The scientist should no more banish them than a poet banishes metaphor; but he ought to know what they are worth. They may be useful to give satisfaction to the mind, and they will do no harm as long as they are only indifferent hypotheses.
These considerations explain to us why certain theories, that were thought to be abandoned and definitively condemned by experiment, are suddenly revived from their ashes and begin a new life. It is because they expressed true relations, and had not ceased to do so when for some reason or other we felt it necessary to enunciate the same relations in another language. Their life had been latent, as it were.
Barely fifteen years ago, was there anything more ridiculous, more quaintly old-fashioned, than the fluids of Coulomb? And yet, here they are re-appearing under the name of electrons. In what do these permanently electrified molecules differ from the electric molecules of Coulomb? It is true that in the electrons the electricity is supported by a little, a very little matter; in other words, they have mass. Yet Coulomb did not deny mass to his fluids, or if he did, it was with reluctance. It would be rash to affirm that the belief in electrons will not also undergo an eclipse, but it was none the less curious to note this unexpected renaissance.
But the most striking example is Carnot's principle. Carnot established it, starting from false hypotheses. When it was found that heat was indestructible, and may be converted into work, his ideas were completely abandoned; later, Clausius returned to them, and to him is due their definitive triumph. In its primitive form, Carnot's theory expressed in addition to true relations, other inexact relations, the débris of old ideas; but the presence of the latter did not alter the reality of the others. Clausius had only to separate them, just as one lops off dead branches.
The result was the second fundamental law of thermodynamics. The relations were always the same, although they did not hold, at least to all appearance, between the same objects. This was sufficient for the principle to retain its value. Nor have the reasonings of Carnot perished on this account; they were applied to an imperfect conception of matter, but their form — i.e., the essential part of them, remained correct. What I have just said throws some light at the same time on the rôle of general principles, such as those of the principle of least action or of the conservation of energy. These principles are of very great value. They were obtained in the search for what there was in common in the enunciation of numerous physical laws; they thus represent the quintessence of innumerable observations. However, from their very generality results a consequence to which I have called attention in Chapter VIII. — namely, that they are no longer capable of verification. As we cannot give a general definition of energy, the principle of the conservation of energy simply signifies that there is a something which remains constant. Whatever fresh notions of the world may be given us by future experiments, we are certain beforehand that there is something which remains constant, and which may be called energy. Does this mean that the principle has no meaning and vanishes into a tautology? Not at all. It means that the different things to which we give the name of energy are connected by a true relationship; it affirms between them a real relation. But then, if this principle has a meaning, it may be false; it may be that we have no right to extend indefinitely its applications, and yet it is certain beforehand to be verified in the strict sense of the word. How, then, shall we know when it has been extended as far as is legitimate? Simply when it ceases to be useful to us — i.e., when we can no longer use it to predict correctly new phenomena. We shall be certain in such a case that the relation affirmed is no longer real, for otherwise it would be fruitful; experiment without directly contradicting a new extension of the principle will nevertheless have condemned it.
Physics and Mechanism. — Most theorists have a constant predilection for explanations borrowed from physics, mechanics, or dynamics. Some would be satisfied if they could account for all phenomena by the motion of molecules attracting one another according to certain laws. Others are more exact: they would suppress attractions acting at a distance; their molecules would follow rectilinear paths, from which they would only be deviated by impacts. Others again, such as Hertz, suppress the forces as well, but suppose their molecules subjected to geometrical connections analogous, for instance, to those of articulated systems; thus, they wish to reduce dynamics to a kind of kinematics. In a word, they all wish to bend nature into a certain form, and unless they can do this they cannot be satisfied. Is Nature flexible enough for this?
We shall examine this question in Chapter XII., àpropos of Maxwell's theory. Every time that the principles of least action and energy are satisfied, we shall see that not only is there always a mechanical explanation possible, but that there is an unlimited number of such explanations. By means of a well-known theorem due to Königs, it may be shown that we can explain everything in an unlimited number of ways, by connections after the manner of Hertz, or, again, by central forces. No doubt it may be just as easily demonstrated that everything may be explained by simple impacts. For this, let us bear in mind that it is not enough to be content with the ordinary matter of which we are aware by means of our senses, and the movements of which we observe directly. We may conceive of ordinary matter as either composed of atoms, whose internal movements escape us, our senses being able to estimate only the displacement of the whole; or we may imagine one of those subtle fluids, which under the name of ether or other names, have from all time played so important a rôle in physical theories. Often we go further, and regard the ether as the only primitive, or even as the only true matter. The more moderate consider ordinary matter to be condensed ether, and there is nothing startling in this conception; but others only reduce its importance still further, and see in matter nothing more than the geometrical locus of singularities in the ether. Lord Kelvin, for instance, holds what we call matter to be only the locus of those points at which the ether is animated by vortex motions. Riemann believes it to be locus of those points at which ether is constantly destroyed; to Wiechert or Larmor, it is the locus of the points at which the ether has undergone a kind of torsion of a very particular kind. Taking any one of these points of view, I ask by what right do we apply to the ether the mechanical properties observed in ordinary matter, which is but false matter? The ancient fluids, caloric, electricity, etc., were abandoned when it was seen that heat is not indestructible. But they were also laid aside for another reason. In materialising them, their individuality was, so to speak, emphasised — gaps were opened between them; and these gaps had to be filled in when the sentiment of the unity of Nature became stronger, and when the intimate relations which connect all the parts were perceived. In multiplying the fluids, not only did the ancient physicists create unnecessary entities, but they destroyed real ties. It is not enough for a theory not to affirm false relations; it must not conceal true relations.
Does our ether actually exist? We know the origin of our belief in the ether. If light takes several years to reach us from a distant star, it is no longer on the star, nor is it on the earth. It must be somewhere, and supported, so to speak, by some material agency.
The same idea may be expressed in a more mathematical and more abstract form. What we note are the changes undergone by the material molecules. We see, for instance, that the photographic plate experiences the consequences of a phenomenon of which the incandescent mass of a star was the scene several years before. Now, in ordinary mechanics, the state of the system under consideration depends only on its state at the moment immediately preceding; the system therefore satisfies certain differential equations. On the other hand, if we did not believe in the ether, the state of the material universe would depend not only on the state immediately preceding, but also on much older states; the system would satisfy equations of finite differences. The ether was invented to escape this breaking down of the laws of general mechanics.
Still, this would only compel us to fill the interplanetary space with ether, but not to make it penetrate into the midst of the material media. Fizeau's experiment goes further. By the interference of rays which have passed through the air or water in motion, it seems to show us two different media penetrating each other, and yet being displaced with respect to each other. The ether is all but in our grasp. Experiments can be conceived in which we come closer still to it. Assume that Newton's principle of the equality of action and re-action is not true if applied to matter alone, and that this can be proved. The geometrical sum of all the forces applied to all the molecules would no longer be zero. If we did not wish to change the whole of the science of mechanics, we should have to introduce the ether, in order that the action which matter apparently undergoes should be counterbalanced by the re-action of matter on something.
Or again, suppose we discover that optical and electrical phenomena are influenced by the motion of the earth. It would follow that those phenomena might reveal to us not only the relative motion of material bodies, but also what would seem to be their absolute motion. Again, it would be necessary to have an ether in order that these so-called absolute movements should not be their displacements with respect to empty space, but with respect to something concrete.
Will this ever be accomplished? I do not think so, and I shall explain why; and yet, it is not absurd, for others have entertained this view. For instance, if the theory of Lorentz, of which I shall speak in more detail in Chapter XIII., were true, Newton's principle would not apply to matter alone, and the difference would not be very far from being within reach of experiment. On the other hand, many experiments have been made on the influence of the motion of the earth. The results have always been negative. But if these experiments have been undertaken, it is because we have not been certain beforehand; and indeed, according to current theories, the compensation would be only approximate, and we might expect to find accurate methods giving positive results. I think that such a hope is illusory; it was none the less interesting to show that a success of this kind would, in a certain sense, open to us a new world.
And now allow me to make a digression; I must explain why I do not believe, in spite of Lorentz, that more exact observations will ever make evident anything else but the relative displacements of material bodies. Experiments have been made that should have disclosed the terms of the first order; the results were nugatory. Could that have been by chance? No one has admitted this; a general explanation was sought, and Lorentz found it. He showed that the terms of the first order should cancel each other, but not the terms of the second order. Then more exact experiments were made, which were also negative; neither could this be the result of chance. An explanation was necessary, and was forthcoming; they always are; hypotheses are what we lack the least. But this is not enough. Who is there who does not think that this leaves to chance far too important a role? Would it not also be a chance that this singular concurrence should cause a certain circumstance to destroy the terms of the first order, and that a totally different but very opportune circumstance should cause those of the second order to vanish? No; the same explanation must be found for the two cases, and everything tends to show that this explanation would serve equally well for the terms of the higher order, and that the mutual destruction of these terms will be rigorous and absolute.
The Present State of Physics. — Two opposite tendencies may be distinguished in the history of the development of physics. On the one hand, new relations are continually being discovered between objects which seemed destined to remain for ever unconnected; scattered facts cease to be strangers to each other and tend to be marshalled into an imposing synthesis. The march of science is towards unity and simplicity.
On the other hand, new phenomena are continually being revealed; it will be long before they can be assigned their place — sometimes it may happen that to find them a place a corner of the edifice must be demolished. In the same way, we are continually perceiving details ever more varied in the phenomena we know, where our crude senses used to be unable to detect any lack of unity. What we thought to be simple becomes complex, and the march of science seems to be towards diversity and complication.
Here, then, are two opposing tendencies, each of which seems to triumph in turn. Which will win? If the first wins, science is possible; but nothing proves this à priori, and it may be that after unsuccessful efforts to bend Nature to our ideal of unity in spite of herself, we shall be submerged by the ever-rising flood of our new riches and compelled to renounce all idea of classification — to abandon our ideal, and to reduce science to the mere recording of innumerable recipes.
In fact, we can give this question no answer. All that we can do is to observe the science of to-day, and compare it with that of yesterday. No doubt after this examination we shall be in a position to offer a few conjectures.
Half-a-century ago hopes ran high indeed. The unity of force had just been revealed to us by the discovery of the conservation of energy and of its transformation. This discovery also showed that the phenomena of heat could be explained by molecular movements. Although the nature of these movements was not exactly known, no one doubted but that they would be ascertained before long. As for light, the work seemed entirely completed. So far as electricity was concerned, there was not so great an advance. Electricity had just annexed magnetism. This was a considerable and a definitive step towards unity. But how was electricity in its turn to be brought into the general unity, and how was it to be included in the general universal mechanism? No one had the slightest idea. As to the possibility of the inclusion, all were agreed; they had faith. Finally, as far as the molecular properties of material bodies are concerned, the inclusion seemed easier, but the details were very hazy. In a word, hopes were vast and strong, but vague.
To-day, what do we see? In the first place, a step in advance — immense progress. The relations between light and electricity are now known; the three domains of light, electricity, and magnetism, formerly separated, are now one; and this annexation seems definitive.
Nevertheless the conquest has caused us some sacrifices. Optical phenomena become particular cases in electric phenomena; as long as the former remained isolated, it was easy to explain them by movements which were thought to be known in all their details. That was easy enough; but any explanation to be accepted must now cover the whole domain of electricity. This cannot be done without difficulty.
The most satisfactory theory is that of Lorentz; it is unquestionably the theory that best explains the known facts, the one that throws into relief the greatest number of known relations, the one in which we find most traces of definitive construction. That it still possesses a serious fault I have shown above. It is in contradiction with Newton's law that action and re-action are equal and opposite — or rather, this principle according to Lorentz cannot be applicable to matter alone; if it be true, it must take into account the action of the ether on matter, and the re-action of the matter on the ether. Now, in the new order, it is very likely that things do not happen in this way.
However this may be, it is due to Lorentz that the results of Fizeau on the optics of moving bodies, the laws of normal and abnormal dispersion and of absorption are connected with each other and with the other properties of the ether, by bonds which no doubt will not be readily severed. Look at the ease with which the new Zeeman phenomenon found its place, and even aided the classification of Faraday's magnetic rotation, which had defied all Maxwell's efforts. This facility proves that Lorentz's theory is not a mere artificial combination which must eventually find its solvent. It will probably have to be modified, but not destroyed.
The only object of Lorentz was to include in a single whole all the optics and electro-dynamics of moving bodies; he did not claim to give a mechanical explanation. Larmor goes further; keeping the essential part of Lorentz's theory, he grafts upon it, so to speak, MacCullagh's ideas on the direction of the movement of the ether. MacCullagh held that the velocity of the ether is the same in magnitude and direction as the magnetic force. Ingenious as is this attempt, the fault in Lorentz's theory remains, and is even aggravated. According to Lorentz, we do not know what the movements of the ether are; and because we do not know this, we may suppose them to be movements compensating those of matter, and re-affirming that action and re-action are equal and opposite. According to Larmor we know the movements of the ether, and we can prove that the compensation does not take place.
If Larmor has failed, as in my opinion he has, does it necessarily follow that a mechanical explanation is impossible? Far from it. I said above that as long as a phenomenon obeys the two principles of energy and least action, so long it allows of an unlimited number of mechanical explanations. And so with the phenomena of optics and electricity.
But this is not enough. For a mechanical explanation to be good it must be simple; to choose it from among all the explanations that are possible there must be other reasons than the necessity of making a choice. Well, we have no theory as yet which will satisfy this condition and consequently be of any use. Are we then to complain? That would be to forget the end we seek, which is not the mechanism; the true and only aim is unity.
We ought therefore to set some limits to our ambition. Let us not seek to formulate a mechanical explanation; let us be content to show that we can always find one if we wish. In this we have succeeded. The principle of the conservation of energy has always been confirmed, and now it has a fellow in the principle of least action, stated in the form appropriate to physics. This has also been verified, at least as far as concerns the reversible phenomena which obey Lagrange's equations — in other words, which obey the most general laws of physics. The irreversible phenomena are much more difficult to bring into line; but they, too, are being co-ordinated and tend to come into the unity. The light which illuminates them comes from Carnot's principle. For a long time thermo-dynamics was confined to the study of the dilatations of bodies and of their change of state. For some time past it has been growing bolder, and has considerably extended its domain. We owe to it the theories of the voltaic cell and of their thermo-electric phenomena; there is not a corner in physics which it has not explored, and it has even attacked chemistry itself. The same laws hold good; everywhere, disguised in some form or other, we find Carnot's principle; everywhere also appears that eminently abstract concept of entropy which is as universal as the concept of energy, and like it, seems to conceal a reality. It seemed that radiant heat must escape, but recently that, too, has been brought under the same laws.
In this way fresh analogies are revealed which may be often pursued in detail; electric resistance resembles the viscosity of fluids; hysteresis would rather be like the friction of solids. In all cases friction appears to be the type most imitated by the most diverse irreversible phenomena, and this relationship is real and profound.
A strictly mechanical explanation of these phenomena has also been sought, but, owing to their nature, it is hardly likely that it will be found. To find it, it has been necessary to suppose that the irreversibility is but apparent, that the elementary phenomena are reversible and obey the known laws of dynamics. But the elements are extremely numerous, and become blended more and more, so that to our crude sight all appears to tend towards uniformity — i.e., all seems to progress in the same direction, and that without hope of return. The apparent irreversibility is therefore but an effect of the law of great numbers. Only a being of infinitely subtle senses, such as Maxwell's demon, could unravel this tangled skein and turn back the course of the universe.
This conception, which is connected with the kinetic theory of gases, has cost great effort and has not, on the whole, been fruitful; it may become so. This is not the place to examine if it leads to contradictions, and if it is in conformity with the true nature of things.
Let us notice, however, the original ideas of M. Gouy on the Brownian movement. According to this scientist, this singular movement does not obey Carnot's principle. The particles which it sets moving would be smaller than the meshes of that tightly drawn net; they would thus be ready to separate them, and thereby to set back the course of the universe. One can almost see Maxwell's demon at work.[2]
To resume, phenomena long known are gradually being better classified, but new phenomena come to claim their place, and most of them, like the Zeeman effect, find it at once. Then we have the cathode rays, the X-rays, uranium and radium rays; in fact, a whole world of which none had suspected the existence. How many unexpected guests to find a place for! No one can yet predict the place they will occupy, but I do not believe they will destroy the general unity; I think that they will rather complete it. On the one hand, indeed, the new radiations seem to be connected with the phenomena of luminosity; not only do they excite fluorescence, but they sometimes come into existence under the same conditions as that property; neither are they unrelated to the cause which produces the electric spark under the action of ultra-violet light. Finally, and most important of all, it is believed that in all these phenomena there exist ions, animated, it is true, with velocities far greater than those of electrolytes. All this is very vague, but it will all become clearer.
Phosphorescence and the action of light on the spark were regions rather isolated, and consequently somewhat neglected by investigators. It is to be hoped that a new path will now be made which will facilitate their communications with the rest of science. Not only do we discover new phenomena, but those we think we know are revealed in unlooked-for aspects. In the free ether the laws preserve their majestic simplicity, but matter properly so called seems more and more complex; all we can say of it is but approximate, and our formulae are constantly requiring new terms.
But the ranks are unbroken, the relations that we have discovered between objects we thought simple still hold good between the same objects when their complexity is recognised, and that alone is the important thing. Our equations become, it is true, more and more complicated, so as to embrace more closely the complexity of nature; but nothing is changed in the relations which enable these equations to be derived from each other. In a word, the form of these equations persists. Take for instance the laws of reflection. Fresnel established them by a simple and attractive theory which experiment seemed to confirm. Subsequently, more accurate researches have shown that this verification was but approximate; traces of elliptic polarisation were detected everywhere. But it is owing to the first approximation that the cause of these anomalies was found in the existence of a transition layer, and all the essentials of Fresnel's theory have remained. We cannot help reflecting that all these relations would never have been noted if there had been doubt in the first place as to the complexity of the objects they connect. Long ago it was said: If Tycho had had instruments ten times as precise, we would never have had a Kepler, or a Newton, or Astronomy. It is a misfortune for a science to be born too late, when the means of observation have become too perfect. That is what is happening at this moment with respect to physical chemistry; the founders are hampered in their general grasp by third and fourth decimal places; happily they are men of robust faith. As we get to know the properties of matter better we see that continuity reigns. From the work of Andrews and Van der Waals, we see how the transition from the liquid to the gaseous state is made, and that it is not abrupt. Similarly, there is no gap between the liquid and solid states, and in the proceedings of a recent Congress we see memoirs on the rigidity of liquids side by side with papers on the flow of solids.
With this tendency there is no doubt a loss of simplicity. Such and such an effect was represented by straight lines; it is now necessary to connect these lines by more or less complicated curves. On the other hand, unity is gained. Separate categories quieted but did not satisfy the mind.
Finally, a new domain, that of chemistry, has been invaded by the method of physics, and we see the birth of physical chemistry. It is still quite young, but already it has enabled us to connect such phenomena as electrolysis, osmosis, and the movements of ions.
From this cursory exposition what can we conclude? Taking all things into account, we have approached the realisation of unity. This has not been done as quickly as was hoped fifty years ago, and the path predicted has not always been followed; but, on the whole, much ground has been gained.
THE CALCULUS OF PROBABILITIES.
No doubt the reader will be astonished to find reflections on the calculus of probabilities in such a volume as this. What has that calculus to do with physical science? The questions I shall raise — without, however, giving them a solution — are naturally raised by the philosopher who is examining the problems of physics. So far is this the case, that in the two preceding chapters I have several times used the words "probability" and "chance." " Predicted facts," as I said above, "can only be probable." However solidly founded a prediction may appear to be, we are never absolutely certain that experiment will not prove it false; but the probability is often so great that practically it may be accepted. And a little farther on I added: — "See what a part the belief in simplicity plays in our generalisations. We have verified a simple law in a large number of particular cases, and we refuse to admit that this so-often-repeated coincidence is a mere effect of chance." Thus, in a multitude of circumstances the physicist is often in the same position as the gambler who reckons up his chances. Every time that he reasons by induction, he more or less consciously requires the calculus of probabilities, and that is why I am obliged to open this chapter parenthetically, and to interrupt our discussion of method in the physical sciences in order to examine a little closer what this calculus is worth, and what dependence we may place upon it. The very name of the calculus of probabilities is a paradox. Probability as opposed to certainty is what one does not know, and how can we calculate the unknown? Yet many eminent scientists have devoted themselves to this calculus, and it cannot be denied that science has drawn there from no small advantage. How can we explain this apparent contradiction? Has probability been defined? Can it even be defined? And if it can not, how can we venture to reason upon it? The definition, it will be said, is very simple. The probability of an event is the ratio of the number of cases favourable to the event to the total number of possible cases. A simple example will show how incomplete this definition is: — I throw two dice. What is the probability that one of the two at least turns up a 6? Each can turn up in six different ways; the number of possible cases is
. The number of favourable cases is 11; the probability is
. That is the correct solution. But why cannot we just as well proceed as follows? — The points which turn up on the two dice form
different combinations. Among these combinations, six are favourable; the probability is
. Now why is the first method of calculating the number of possible cases more legitimate than the second? In any case it is not the definition that tells us. We are therefore bound to complete the definition by saying, "... to the total number of possible cases, provided the cases are equally probable." So we are compelled to define the probable by the probable. How can we know that two possible cases are equally probable? Will it be by a convention? If we insert at the beginning of every problem an explicit convention, well and good! We then have nothing to do but to apply the rules of arithmetic and algebra, and we complete our calculation, when our result cannot be called in question. But if we wish to make the slightest application of this result, we must prove that our convention is legitimate, and we shall find ourselves in the presence of the very difficulty we thought we had avoided. It may be said that common-sense is enough to show us the convention that should be adopted. Alas! M. Bertrand has amused himself by discussing the following simple problem: — "What is the probability that a chord of a circle may be greater than the side of the inscribed equilateral triangle?" The illustrious geometer successively adopted two conventions which seemed to be equally imperative in the eyes of common-sense, and with one convention he finds ½, and with the other ⅓. The conclusion which seems to follow from this is that the calculus of probabilities is a useless science, that the obscure instinct which we call common-sense, and to which we appeal for the legitimisation of our conventions, must be distrusted. But to this conclusion we can no longer subscribe. We cannot do without that obscure instinct. Without it, science would be impossible, and without it we could neither discover nor apply a law. Have we any right, for instance, to enunciate Newton's law? No doubt numerous observations are in agreement with it, but is not that a simple fact of chance? and how do we know, besides, that this law which has been true for so many generations will not be untrue in the next? To this objection the only answer you can give is: It is very improbable. But grant the law. By means of it I can calculate the position of Jupiter in a year from now. Yet have I any right to say this? Who can tell if a gigantic mass of enormous velocity is not going to pass near the solar system and produce unforeseen perturbations? Here again the only answer is: It is very improbable. From this point of view all the sciences would only be unconscious applications of the calculus of probabilities. And if this calculus be condemned, then the whole of the sciences must also be condemned. I shall not dwell at length on scientific problems in which the intervention of the calculus of probabilities is more evident. In the forefront of these is the problem of interpolation, in which, knowing a certain number of values of a function, we try to discover the intermediary values. I may also mention the celebrated theory of errors of observation, to which I shall return later; the kinetic theory of gases, a well-known hypothesis wherein each gaseous molecule is supposed to describe an extremely complicated path, but in which, through the effect of great numbers, the mean phenomena which are all we observe obey the simple laws of Mariotte and Gay-Lussac. All these theories are based upon the laws of great numbers, and the calculus of probabilities would evidently involve them in its ruin. It is true that they have only a particular interest, and that, save as far as interpolation is concerned, they are sacrifices to which we might readily be resigned. But I have said above, it would not be these partial sacrifices that would be in question; it would be the legitimacy of the whole of science that would be challenged. I quite see that it might be said: We do not know, and yet we must act. As for action, we have not time to devote ourselves to an inquiry that will suffice to dispel our ignorance. Besides, such an inquiry would demand unlimited time. We must therefore make up our minds without knowing. This must be often done whatever may happen, and we must follow the rules although we may have but little confidence in them. What I know is, not that such a thing is true, but that the best course for me is to act as if it were true. The calculus of probabilities, and therefore science itself, would be no longer of any practical value.
Unfortunately the difficulty does not thus dis appear. A gambler wants to try a coup, and he asks my advice. If I give it him, I use the calculus of probabilities; but I shall not guarantee success. That is what I shall call subjective probability. In this case we might be content with the explanation of which I have just given a sketch. But assume that an observer is present at the play, that he knows of the coup, and that play goes on for a long time, and that he makes a summary of his notes. He will find that events have taken place in conformity with the laws of the calculus of probabilities. That is what I shall call objective probability, and it is this phenomenon which has to be explained. There are numerous Insurance Societies which apply the rules of the calculus of probabilities, and they distribute to their shareholders dividends, the objective reality of which cannot be contested. In order to explain them, we must do more than invoke our ignorance and the necessity of action. Thus, absolute scepticism is not admissible. We may distrust, but we cannot condemn en bloc. Discussion is necessary.
I. Classification of the Problems of Probability. — In order to classify the problems which are presented to us with reference to probabilities, we must look at them from different points of view, and first of all, from that of generality. I said above that probability is the ratio of the number of favourable to the number of possible cases. What for want of a better term I call generality will increase with the number of possible cases. This number may be finite, as, for instance, if we take a throw of the dice in which the number of possible cases is 36. That is the first degree of generality. But if we ask, for instance, what is the probability that a point within a circle is within the inscribed square, there are as many possible cases as there are points in the circle — that is to say, an infinite number. This is the second degree of generality. Generality can be pushed further still. We may ask the probability that a function will satisfy a given condition. There are then as many possible cases as one can imagine different functions. This is the third degree of generality, which we reach, for instance, when we try to find the most probable law after a finite number of observations. Yet we may place ourselves at a quite different point of view. If we were not ignorant there would be no probability, there could only be certainty. But our ignorance cannot be absolute, for then there would be no longer any probability at all. Thus the problems of probability may be classed according to the greater or less depth of this ignorance. In mathematics we may set ourselves problems in probability. What is the probability that the fifth decimal of a logarithm taken at random from a table is a 9. There is no hesitation in answering that this probability is 1-10th. Here we possess all the data of the problem. We can calculate our logarithm without having recourse to the table, but we need not give ourselves the trouble. This is the first degree of ignorance. In the physical sciences our ignorance is already greater. The state of a system at a given moment depends on two things — its initial state, and the law according to which that state varies. If we know both this law and this initial state, we have a simple mathematical problem to solve, and we fall back upon our first degree of ignorance. Then it often happens that we know the law and do not know the initial state. It may be asked, for instance, what is the present distribution of the minor planets? We know that from all time they have obeyed the laws of Kepler, but we do not know what was their initial distribution. In the kinetic theory of gases we assume that the gaseous molecules follow rectilinear paths and obey the laws of impact and elastic bodies; yet as we know nothing of their initial velocities, we know nothing of their present velocities. The calculus of probabilities alone enables us to predict the mean phenomena which will result from a combination of these velocities. This is the second degree of ignorance. Finally it is possible, that not only the initial conditions but the laws themselves are unknown. We then reach the third degree of ignorance, and in general we can no longer affirm anything at all as to the probability of a phenomenon. It often happens that instead of trying to discover an event by means of a more or less imperfect knowledge of the law, the events may be known, and we want to find the law; or that, instead of deducing effects from causes, we wish to deduce the causes from the effects. Now, these problems are classified as probability of causes, and are the most interesting of all from their scientific applications. I play at écarté with a gentleman whom I know to be perfectly honest. What is the chance that he turns up the king? It is 1/8. This is a problem of the probability of effects. I play with a gentleman whom I do not know. He has dealt ten times, and he has turned the king up six times. What is the chance that he is a sharper? This is a problem in the probability of causes. It may be said that it is the essential problem of the experimental method. I have observed n values of x and the corresponding values of y. I have found that the ratio of the latter to the former is practically constant. There is the event; what is the cause? Is it probable that there is a general law according to which y would be proportional to x, and that small divergencies are due to errors of observation? This is the type of question that we are ever asking, and which we unconsciously solve whenever we are engaged in scientific work. I am now going to pass in review these different categories of problems by discussing in succession what I have called subjective and objective probability.
II. Probability in Mathematics. — The impossibility of squaring the circle was shown in 1885, but before that date all geometers considered this im possibility as so "probable" that the Académie des Sciences rejected without examination the, alas! too numerous memoirs on this subject that a few unhappy madmen sent in every year. Was the Académie wrong? Evidently not, and it knew perfectly well that by acting in this manner it did not run the least risk of stifling a discovery of moment. The Académie could not have proved that it was right, but it knew quite well that its instinct did not deceive it. If you had asked the Academicians, they would have answered: "We have compared the probability that an unknown scientist should have found out what has been vainly sought for so long, with the probability that there is one madman the more on the earth, and the latter has appeared to us the greater." These are very good reasons, but there is nothing mathematical about them; they are purely psychological. If you had pressed them further, they would have added: "Why do you expect a particular value of a transcendental function to be an algebraical number; if π be the root of an algebraical equation, why do you expect this root to be a period of the function sin 2x, and why is it not the same with the other roots of the same equation?" To sum up, they would have invoked the principle of sufficient reason in its vaguest form. Yet what information could they draw from it? At most a rule of conduct for the employment of their time, which would be more usefully spent at their ordinary work than in reading a lucubration that inspired in them a legitimate distrust. But what I called above objective probability has nothing in common with this first problem. It is otherwise with the second. Let us consider the first 10,000 logarithms that we find in a table. Among these 10,000 logarithms I take one at random. What is the probability that its third decimal is an even number? You will say with out any hesitation that the probability is ½, and in fact if you pick out in a table the third decimals in these 10,000 numbers you will find nearly as many even digits as odd. Or, if you prefer it, let us write 10,000 numbers corresponding to our 10,000 logarithms, writing down for each of these numbers +1 if the third decimal of the corresponding logarithm is even, and -1 if odd; and then let us take the mean of these 10,000 numbers. I do not hesitate to say that the mean of these 10,000 units is probably zero, and if I were to calculate it practically, I would verify that it is extremely small. But this verification is needless. I might have rigorously proved that this mean is smaller than 0.003. To prove this result I should have had to make a rather long calculation for which there is no room here, and for which I may refer the reader to an article that I published in the Revue générale des Sciences, April 15th, 1899. The only point to which I wish to draw attention is the following. In this calculation I had occasion to rest my case on only two facts namely, that the first and second derivatives of the logarithm remain, in the interval considered, between certain limits. Hence our first conclusion is that the property is not only true of the logarithm but of any continuous function whatever, since the derivatives of every continuous function are limited. If I was certain beforehand of the result, it is because I have often observed analogous facts for other continuous functions; and next, it is because I went through in my mind in a more or less unconscious and imperfect manner the reasoning which led me to the preceding in equalities, just as a skilled calculator before finishing his multiplication takes into account what it ought to come to approximately. And besides, since what I call my intuition was only an incomplete summary of a piece of true reasoning, it is clear that observation has confirmed my predictions, and that the objective and subjective probabilities are in agreement. As a third example I shall choose the following: T— he number u is taken at random and n is a given very large integer. What is the mean value of sin nu? This problem has no meaning by itself. To give it one, a convention is required — namely, we agree that the probability for the number u to lie between a and a + da is Φ(a)da; that it is therefore proportional to the infinitely small interval da, and is equal to this multiplied by a function Φ(a), only depending on a. As for this function I choose it arbitrarily, but I must assume it to be continuous. The value of sin nu remaining the same when u increases by 2π, I may without loss of generality assume that u lies between 0 and 2π, and I shall thus be led to suppose that Φ(a) is a periodic function whose period is 2π. The mean value that we seek is readily expressed by a simple integral, and it is easy to show that this integral is smaller than
, MK being the maximum value of the Kth derivative of Φ(u). We see then that if the Kth derivative is finite, our mean value will tend towards zero when n increases indefinitely, and that more rapidly than
. The mean value of sin nu when n is very large is therefore zero. To define this value I required a convention, but the result remains the same whatever that convention may be. I have imposed upon myself but slight restrictions when I assumed that the function Φ(a) is continuous and periodic, and these hypotheses are so natural that we may ask ourselves how they can be escaped. Examination of the three preceding examples, so different in all respects, has already given us a glimpse on the one hand of the rôle of what philosophers call the principle of sufficient reason, and on the other hand of the importance of the fact that certain properties are common to all continuous functions. The study of probability in the physical sciences will lead us to the same result.
III. Probability in the Physical Sciences. — We now come to the problems which are connected with what I have called the second degree of ignorance — namely, those in which we know the law but do not know the initial state of the system. I could multiply examples, but I shall take only one. What is the probable present distribution of the minor planets on the zodiac? We know they obey the laws of Kepler. We may even, without changing the nature of the problem, suppose that their orbits are circular and situated in the same plane, a plane which we are given. On the other hand, we know absolutely nothing about their initial distribution. However, we do not hesitate to affirm that this distribution is now nearly uniform. Why? Let b be the longitude of a minor planet in the initial epoch that is to say, the epoch zero. Let a be its mean motion. Its longitude at the present time — i.e., at the time t will be at + b. To say that the present distribution is uniform is to say that the mean value of the sines and cosines of multiples of at + b is zero. Why do we assert this? Let us represent our minor planet by a point in a plane — namely, the point whose co-ordinates are a and b. All these representative points will be contained in a certain region of the plane, but as they are very numerous this region will appear dotted with points. We know nothing else about the distribution of the points. Now what do we do when we apply the calculus of probabilities to such a question as this? What is the probability that one or more representative points may be found in a certain portion of the plane? In our ignorance we are compelled to make an arbitrary hypothesis. To explain the nature of this hypothesis I may be allowed to use, instead of a mathematical formula, a crude but concrete image. Let us suppose that over the surface of our plane has been spread imaginary matter, the density of which is variable, but varies continuously. We shall then agree to say that the probable number of representative points to be found on a certain portion of the plane is proportional to the quantity of this imaginary matter which is found there. If there are, then, two regions of the plane of the same extent, the probabilities that a representative point of one of our minor planets is in one or other of these regions will be as the mean densities of the imaginary matter in one or other of the regions. Here then are two distributions, one real, in which the representative points are very numerous, very close together, but discrete like the molecules of matter in the atomic hypothesis; the other remote from reality, in which our representative points are replaced by imaginary continuous matter. We know that the latter cannot be real, but we are forced to adopt it through our ignorance. If, again, we had some idea of the real distribution of the representative points, we could arrange it so that in a region of some extent the density of this imaginary continuous matter may be nearly proportional to the number of representative points, or, if it is preferred, to the number of atoms which are contained in that region. Even that is impossible, and our ignorance is so great that we are forced to choose arbitrarily the function which defines the density of our imaginary matter. We shall be compelled to adopt a hypothesis from which we can hardly get away; we shall suppose that this function is continuous. That is sufficient, as we shall see, to enable us to reach our conclusion.
What is at the instant t the probable distribution of the minor planets — or rather, what is the mean value of the sine of the longitude at the moment t — i.e., of sin (at + b)? We made at the outset an arbitrary convention, but if we adopt it, this probable value is entirely defined. Let us decompose the plane into elements of surface. Consider the value of sin (at + b) at the centre of each of these elements. Multiply this value by the surface of the element and by the corresponding density of the imaginary matter. Let us then take the sum for all the elements of the plane. This sum, by definition, will be the probable mean value we seek, which will thus be expressed by a double integral. It may be thought at first that this mean value depends on the choice of the function Φ which defines the density of the imaginary matter, and as this function Φ is arbitrary, we can, according to the arbitrary choice which we make, obtain a certain mean value. But this is not the case. A simple calculation shows us that our double integral decreases very rapidly as t increases. Thus, I cannot tell what hypothesis to make as to the probability of this or that initial distribution, but when once the hypothesis is made the result will be the same, and this gets me out of my difficulty. Whatever the function Φ may be, the mean value tends towards zero as t increases, and as the minor planets have certainly accomplished a very large number of revolutions, I may assert that this mean value is very small. I may give to Φ any value I choose, with one restriction: this function must be continuous; and, in fact, from the point of view of subjective probability, the choice of a discontinuous function would have been unreasonable. What reason could I have, for instance, for supposing that the initial longitude might be exactly o°, but that it could not lie between o° and 1°?
The difficulty reappears if we look at it from the point of view of objective probability; if we pass from our imaginary distribution in which the supposititious matter was assumed to be continuous, to the real distribution in which our representative points are formed as discrete atoms. The mean value of sin (at + b) will be represented quite simply by

n being the number of minor planets. Instead of a double integral referring to a continuous function, we shall have a sum of discrete terms. However, no one will seriously doubt that this mean value is practically very small. Our representative points being very close together, our discrete sum will in general differ very little from an integral. An integral is the limit towards which a sum of terms tends when the number of these terms is indefinitely increased. If the terms are very numerous, the sum will differ very little from its limit — that is to say, from the integral, and what I said of the latter will still be true of the sum itself. But there are exceptions. If, for instance, for all the minor planets
, the longitude of all the planets at the time t would be
, and the mean value in question would be evidently unity. For this to be the case at the time o, the minor planets must have all been lying on a kind of spiral of peculiar form, with its spires very close together. All will admit that such an initial distribution is extremely improbable (and even if it were realised, the distribution would not be uniform at the present time — for example, on the 1st January 1900; but it would become so a few years later). Why, then, do we think this initial distribution improbable? This must be explained, for if we are wrong in rejecting as improbable this absurd hypothesis, our inquiry breaks down, and we can no longer affirm any thing on the subject of the probability of this or that present distribution. Once more we shall invoke the principle of sufficient reason, to which we must always recur. We might admit that at the beginning the planets were distributed almost in a straight line. We might admit that they were irregularly distributed. But it seems to us that there is no sufficient reason for the unknown cause that gave them birth to have acted along a curve so regular and yet so complicated, which would appear to have been expressly chosen so that the distribution at the present day would not be uniform.
IV. Rouge et Noir. — The questions raised by games of chance, such as roulette, are, fundamentally, quite analogous to those we have just treated. For example, a wheel is divided into thirty-seven equal compartments, alternately red and black. A ball is spun round the wheel, and after having moved round a number of times, it stops in front of one of these sub-divisions. The probability that the division is red is obviously ½. The needle describes an angle θ, including several complete revolutions. I do not know what is the probability that the ball is spun with such a force that this angle should lie between θ and θ+dθ, but I can make a convention. I can suppose that this probability is Φ(θ)dθ. As for the function Φ(θ), I can choose it in an entirely arbitrary manner. I have nothing to guide me in my choice, but I am naturally induced to suppose the function to be continuous. Let ε be a length (measured on the circumference of the circle of radius unity) of each red and black compartment. We have to calculate the integral of Φ(θ)dθ, extending it on the one hand to all the red, and on the other hand to all the black compartments, and to compare the results. Consider an interval 2ε comprising two consecutive red and black compartments. Let M and m be the maximum and minimum values of the function Φ(θ) in this interval. The integral extended to the red compartments will be smaller than Σmε; extended to the black it will be greater than Σmε. The difference will therefore be smaller than Σ(M-m)ε. But if the function Φ is supposed continuous, and if on the other hand the interval ε is very small with respect to the total angle described by the needle, the difference M-m will be very small. The difference of the two integrals will be therefore very small, and the probability will be very nearly ½. We see that without knowing anything of the function Φ we must act as if the probability were ½. And on the other hand it explains why, from the objective point of view, if I watch a certain number of coups, observation will give me almost as many black coups as red. All the players know this objective law; but it leads them into a remarkable error, which has often been exposed, but into which they are always falling. When the red has won, for example, six times running, they bet on black, thinking that they are playing an absolutely safe game, because they say it is a very rare thing for the red to win seven times running. In reality their probability of winning is still ½. Observation shows, it is true, that the series of seven consecutive reds is very rare, but series of six reds followed by a black are also very rare. They have noticed the rarity of the series of seven reds; if they have not remarked the rarity of six reds and a black, it is only because such series strike the attention less.
V. The Probability of Causes. — We now come to the problems of the probability of causes, the most important from the point of view of scientific applications. Two stars, for instance, are very close together on the celestial sphere. Is this apparent contiguity a mere effect of chance? Are these stars, although almost on the same visual ray, situated at very different distances from the earth, and therefore very far indeed from one another? or does the apparent correspond to a real contiguity? This is a problem on the probability of causes.
First of all, I recall that at the outset of all problems of probability of effects that have occupied our attention up to now, we have had to use a convention which was more or less justified; and if in most cases the result was to a certain extent independent of this convention, it was only the condition of certain hypotheses which enabled us à priori to reject discontinuous functions, for example, or certain absurd conventions. We shall again find something analogous to this when we deal with the probability of causes. An effect may be produced by the cause a or by the cause b. The effect has just been observed. We ask the probability that it is due to the cause a. This is an à posteriori probability of cause. But I could not calculate it, if a convention more or less justified did not tell me in advance what is the à priori probability for the cause a to come into play — I mean the probability of this event to some one who had not observed the effect. To make my meaning clearer, I go back to the game of écarté mentioned before. My adversary deals for the first time and turns up a king. What is the probability that he is a sharper? The formulae ordinarily taught give 8/9, a result which is obviously rather surprising. If we look at it closer, we see that the conclusion is arrived at as if, before sitting down at the table, I had considered that there was one chance in two that my adversary was not honest. An absurd hypothesis, because in that case I should certainly not have played with him; and this explains the absurdity of the conclusion. The function on the à priori probability was unjustified, and that is why the conclusion of the à posteriori probability led me into an inadmissible result. The importance of this preliminary convention is obvious. I shall even add that if none were made, the problem of the à posteriori probability would have no meaning. It must be always made either explicitly or tacitly.
Let us pass on to an example of a more scientific character. I require to determine an experimental law; this law, when discovered, can be represented by a curve. I make a certain number of isolated observations, each of which may be represented by a point. When I have obtained these different points, I draw a curve between them as carefully as possible, giving my curve a regular form, avoiding sharp angles, accentuated inflexions, and any sudden variation of the radius of curvature. This curve will represent to me the probable law, and not only will it give me the values of the functions intermediary to those which have been observed, but it also gives me the observed values more accurately than direct observation does; that is why I make the curve pass near the points and not through the points themselves.
Here, then, is a problem in the probability of causes. The effects are the measurements I have recorded; they depend on the combination of two causes — the true law of the phenomenon and errors of observation. Knowing the effects, we have to find the probability that the phenomenon shall obey this law or that, and that the observations have been accompanied by this or that error. The most probable law, therefore, corresponds to the curve we have traced, and the most probable error is represented by the distance of the corresponding point from that curve. But the problem has no meaning if before the observations I had an à priori idea of the probability of this law or that, or of the chances of error to which I am exposed. If my instruments are good (and I knew whether this is so or not before beginning the observations), I shall not draw the curve far from the points which represent the rough measurements. If they are inferior, I may draw it a little farther from the points, so that I may get a less sinuous curve; much will be sacrificed to regularity.
Why, then, do I draw a curve without sinuosities? Because I consider à priori a law represented by a continuous function (or function the derivatives of which to a high order are small), as more probable than a law not satisfying those conditions. But for this conviction the problem would have no meaning; interpolation would be impossible; no law could be deduced from a finite number of observations; science would cease to exist.
Fifty years ago physicists considered, other things being equal, a simple law as more probable than a complicated law. This principle was even invoked in favour of Mariotte's law as against that of Regnault. But this belief is now repudiated; and yet, how many times are we compelled to act as though we still held it! However that may be, what remains of this tendency is the belief in continuity, and as we have just seen, if the belief in continuity were to disappear, experimental science would become impossible.
VI. The Theory of Errors. — We are thus brought to consider the theory of errors which is directly connected with the problem of the probability of causes. Here again we find effects — to wit, a certain number of irreconcilable observations, and we try to find the causes which are, on the one hand, the true value of the quantity to be measured, and, on the other, the error made in each isolated observation. We must calculate the probable a posteriori value of each error, and therefore the probable value of the quantity to be measured. But, as I have just explained, we cannot undertake this calculation unless we admit à priori — i.e., before any observations are made — that there is a law of the probability of errors. Is there a law of errors? The law to which all calculators assent is Gauss's law, that is represented by a certain transcendental curve known as the "bell."
But it is first of all necessary to recall the classic distinction between systematic and accidental errors. If the metre with which we measure a length is too long, the number we get will be too small, and it will be no use to measure several times — that is a systematic error. If we measure with an accurate metre, we may make a mistake, and find the length sometimes too large and sometimes too small, and when we take the mean of a large number of measurements, the error will tend to grow small. These are accidental errors.
It is clear that systematic errors do not satisfy Gauss's law, but do accidental errors satisfy it? Numerous proofs have been attempted, almost all of them crude paralogisms. But starting from the following hypotheses we may prove Gauss's law: the error is the result of a very large number of partial and independent errors; each partial error is very small and obeys any law of probability whatever, provided the probability of a positive error is the same as that of an equal negative error. It is clear that these conditions will be often, but not always, fulfilled, and we may reserve the name of accidental for errors which satisfy them.
We see that the method of least squares is not legitimate in every case; in general, physicists are more distrustful of it than astronomers. This is no doubt because the latter, apart from the systematic errors to which they and the physicists are subject alike, have to contend with an extremely important source of error which is entirely accidental — I mean atmospheric undulations. So it is very curious to hear a discussion between a physicist and an astronomer about a method of observation. The physicist, persuaded that one good measurement is worth more than many bad ones, is pre-eminently concerned with the elimination by means of every precaution of the final systematic errors; the astronomer retorts: "But you can only observe a small number of stars, and accidental errors will not disappear."
What conclusion must we draw? Must we continue to use the method of least squares? We must distinguish. We have eliminated all the systematic errors of which we have any suspicion; we are quite certain that there are others still, but we cannot detect them; and yet we must make up our minds and adopt a definitive value which will be regarded as the probable value; and for that purpose it is clear that the best thing we can do is to apply Gauss's law. We have only applied a practical rule referring to subjective probability. And there is no more to be said.
Yet we want to go farther and say that not only the probable value is so much, but that the probable error in the result is so much. This is absolutely invalid: it would be true only if we were sure that all the systematic errors were eliminated, and of that we know absolutely nothing. We have two series of observations; by applying the law of least squares we find that the probable error in the first series is twice as small as in the second. The second series may, how ever, be more accurate than the first, because the first is perhaps affected by a large systematic error. All that we can say is, that the first series is probably better than the second because its accidental error is smaller, and that we have no reason for affirming that the systematic error is greater for one of the series than for the other, our ignorance on this point being absolute.
VII. Conclusions. In the preceding lines I have set several problems, and have given no solution. I do not regret this, for perhaps they will invite the reader to reflect on these delicate questions.
However that may be, there are certain points which seem to be well established. To undertake the calculation of any probability, and even for that calculation to have any meaning at all, we must admit, as a point of departure, an hypothesis or convention which has always something arbitrary about it. In the choice of this convention we can be guided only by the principle of sufficient reason. Unfortunately, this principle is very vague and very elastic, and in the cursory examination we have just made we have seen it assume different forms. The form under which we meet it most often is the belief in continuity, a belief which it would be difficult to justify by apodeictic reasoning, but without which all science would be impossible. Finally, the problems to which the calculus of probabilities may be applied with profit are those in which the result is independent of the hypothesis made at the outset, provided only that this hypothesis satisfies the condition of continuity.
OPTICS AND ELECTRICITY.
Fresnel's Theory. — The best example that can be chosen is the theory of light and its relations to the theory of electricity. It is owing to Fresnel that the science of optics is more advanced than any other branch of physics. The theory called the theory of undulations forms a complete whole, which is satisfying to the mind; but we must not ask from it what it cannot give us. The object of mathematical theories is not to reveal to us the real nature of things; that would be an unreasonable claim. Their only object is to co-ordinate the physical laws with which physical experiment makes us acquainted, the enunciation of which, without the aid of mathematics, we should be unable to effect. Whether the ether exists or not matters little — let us leave that to the metaphysicians; what is essential for us is, that everything happens as if it existed, and that this hypothesis is found to be suitable for the explanation of phenomena. After all, have we any other reason for believing in the existence of material objects? That, too, is only a convenient hypothesis; only, it will never cease to be so, while some day, no doubt, the ether will be thrown aside as useless.
But at the present moment the laws of optics, and the equations which translate them into the language of analysis, hold good — at least as a first approximation. It will therefore be always useful to study a theory which brings these equations into connection.
The undulatory theory is based on a molecular hypothesis; this is an advantage to those who think they can discover the cause under the law. But others find in it a reason for distrust; and this distrust seems to me as unfounded as the illusions of the former. These hypotheses play but a secondary role. They may be sacrificed, and the sole reason why this is not generally done is, that it would involve a certain loss of lucidity in the explanation. In fact, if we look at it a little closer we shall see that we borrow from molecular hypotheses but two things — the principle of the conservation of energy, and the linear form of the equations, which is the general law of small movements as of all small variations. This explains why most of the conclusions of Fresnel remain unchanged when we adopt the electro magnetic theory of light.
Maxwell's Theory. — We all know that it was Maxwell who connected by a slender tie two branches of physics — optics and electricity — until then unsuspected of having anything in common. Thus blended in a larger aggregate, in a higher harmony, Fresnel's theory of optics did not perish. Parts of it are yet alive, and their mutual relations are still the same. Only, the language which we use to express them has changed; and, on the other hand, Maxwell has revealed to us other relations, hitherto unsuspected, between the different branches of optics and the domain of electricity.
The first time a French reader opens Maxwell's book, his admiration is tempered with a feeling of uneasiness, and often of distrust.
It is only after prolonged study, and at the cost of much effort, that this feeling disappears. Some minds of high calibre never lose this feeling. Why is it so difficult for the ideas of this English scientist to become acclimatised among us? No doubt the education received by most enlightened Frenchmen predisposes them to appreciate precision and logic more than any other qualities. In this respect the old theories of mathematical physics gave us complete satisfaction. All our masters, from Laplace to Cauchy, proceeded along the same lines. Starting with clearly enunciated hypotheses, they deduced from them all their consequences with mathematical rigour, and then compared them with experiment. It seemed to be their aim to give to each of the branches of physics the same precision as to celestial mechanics.
A mind accustomed to admire such models is not easily satisfied with a theory. Not only will it not tolerate the least appearance of contradiction, but it will expect the different parts to be logically connected with one another, and will require the number of hypotheses to be reduced to a minimum.
This is not all; there will be other demands which appear to me to be less reasonable. Behind the matter of which our senses are aware, and which is made known to us by experiment, such a thinker will expect to see another kind of matter — the only true matter in its opinion — which will no longer have anything but purely geometrical qualities, and the atoms of which will be mathematical points subject to the laws of dynamics alone. And yet he will try to represent to himself, by an unconscious contradiction, these invisible and colourless atoms, and therefore to bring them as close as possible to ordinary matter.
Then only will he be thoroughly satisfied, and he will then imagine that he has penetrated the secret of the universe. Even if the satisfaction is fallacious, it is none the less difficult to give it up. Thus, on opening the pages of Maxwell, a French man expects to find a theoretical whole, as logical and as precise as the physical optics that is founded on the hypothesis of the ether. He is thus preparing for himself a disappointment which I should like the reader to avoid; so I will warn him at once of what he will find and what he will not find in Maxwell.
Maxwell does not give a mechanical explanation of electricity and magnetism; he confines himself to showing that such an explanation is possible. He shows that the phenomena of optics are only a particular case of electro-magnetic phenomena. From the whole theory of electricity a theory of light can be immediately deduced. Unfortunately the converse is not true; it is not always easy to find a complete explanation of electrical phenomena. In particular it is not easy if we take as our starting-point Fresnel's theory; to do so, no doubt, would be impossible; but none the less we must ask ourselves if we are compelled to surrender admirable results which we thought we had definitively acquired. That seems a step backwards, and many sound intellects will not willingly allow of this.
Should the reader consent to set some bounds to his hopes, he will still come across other difficulties. The English scientist does not try to erect a unique, definitive, and well-arranged building; he seems to raise rather a large number of provisional and independent constructions, between which communication is difficult and sometimes impossible. Take, for instance, the chapter in which electrostatic attractions are explained by the pressures and tensions of the dielectric medium. This chapter might be sup pressed without the rest of the book being thereby less clear or less complete, and yet it contains a theory which is self-sufficient, and which can be understood without reading a word of what precedes or follows. But it is not only independent of the rest of the book; it is difficult to reconcile it with the fundamental ideas of the volume. Maxwell does not even attempt to reconcile it; he merely says: "I have not been able to make the next step — namely, to account by mechanical considerations for these stresses in the dielectric."
This example will be sufficient to show what I mean; I could quote many others. Thus, who would suspect on reading the pages devoted to magnetic rotatory polarisation that there is an identity between optical and magnetic phenomena?
We must not flatter ourselves that we have avoided every contradiction, but we ought to make up our minds. Two contradictory theories, provided that they are kept from overlapping, and that we do not look to find in them the explanation of things, may, in fact, be very useful instruments of research; and perhaps the reading of Maxwell would be less suggestive if he had not opened up to us so many new and divergent ways. But the fundamental idea is masked, as it were. So far is this the case, that in most works that are popularised, this idea is the only point which is left completely untouched. To show the importance of this, I think I ought to explain in what this fundamental idea consists; but for that purpose a short digression is necessary.
The Mechanical Explanation of Physical Phenomena. — In every physical phenomenon there is a certain number of parameters which are reached directly by experiment, and which can be measured. I shall call them the parameters q. Observation next teaches us the laws of the variations of these parameters, and these laws can be generally stated in the form of differential equations which connect together the parameters q and time. What can be done to give a mechanical interpretation to such a phenomenon? We may endeavour to explain it, either by the movements of ordinary matter, or by those of one or more hypothetical fluids. These fluids will be considered as formed of a very large number of isolated molecules m. When may we say that we have a complete mechanical explanation of the phenomenon? It will be, on the one hand, when we know the differential equations which are satisfied by the co-ordinates of these hypothetical molecules m, equations which must, in addition, conform to the laws of dynamics; and, on the other hand, when we know the relations which define the co-ordinates of the molecules m as functions of the parameters q, attainable by experiment. These equations, as I have said, should conform to the principles of dynamics, and, in particular, to the principle of the conservation of energy, and to that of least action.
The first of these two principles teaches us that the total energy is constant, and may be divided into two parts:
(1) Kinetic energy, or vis viva, which depends on the masses of the hypothetical molecules m, and on their velocities. This I shall call T. (2) The potential energy which depends only on the co-ordinates of these molecules, and this I shall call U. It is the sum of the energies T and U that is constant.
Now what are we taught by the principle of least action? It teaches us that to pass from the initial position occupied at the instant t0 to the final position occupied at the instant t1, the system must describe such a path that in the interval of time between the instant t0 and t1, the mean value of the action — i.e., the difference between the two energies T and U, must be as small as possible. The first of these two principles is, moreover, a consequence of the second. If we know the functions T and U, this second principle is sufficient to determine the equations of motion.
Among the paths which enable us to pass from one position to another, there is clearly one for which the mean value of the action is smaller than for all the others. In addition, there is only such path; and it follows from this, that the principle of least action is sufficient to determine the path followed, and therefore the equations of motion. We thus obtain what are called the equations of Lagrange. In these equations the independent variables are the co-ordinates of the hypothetical molecules m; but I now assume that we take for the variables the parameters q, which are directly accessible to experiment.
The two parts of the energy should then be expressed as a function of the parameters q and their derivatives; it is clear that it is under this form that they will appear to the experimenter. The latter will naturally endeavour to define kinetic and potential energy by the aid of quantities he can directly observe.[4] If this be granted, the system will always proceed from one position to another by such a path that the mean value of the action is a minimum. It matters little that T and U are now expressed by the aid of the parameters q and their derivatives; it matters little that it is also by the aid of these parameters that we define the initial and final positions; the principle of least action will always remain true.
Now here again, of the whole of the paths which lead from one position to another, there is one and only one for which the mean action is a minimum. The principle of least action is therefore sufficient for the determination of the differential equations which define the variations of the parameters q. The equations thus obtained are another form of Lagrange's equations.
To form these equations we need not know the relations which connect the parameters q with the co-ordinates of the hypothetical molecules, nor the masses of the molecules, nor the expression of U as a function of the co-ordinates of these molecules. All we need know is the expression of U as a function of the parameters q, and that of T as a function of the parameters q and their derivatives — i.e., the expressions of the kinetic and potential energy in terms of experimental data.
One of two things must now happen. Either for a convenient choice of T and U the Lagrangian equations, constructed as we have indicated, will be identical with the differential equations deduced from experiment, or there will be no functions T and U for which this identity takes place. In the latter case it is clear that no mechanical explanation is possible. The necessary condition for a mechanical explanation to be possible is therefore this: that we may choose the functions T and U so as to satisfy the principle of least action, and of the conservation of energy. Besides, this condition is sufficient. Suppose, in fact, that we have found a function U of the parameters q, which represents one of the parts of energy, and that the part of the energy which we represent by T is a function of the parameters q and their derivatives; that it is a polynomial of the second degree with respect to its derivatives, and finally that the Lagrangian equations formed by the aid of these two functions T and U are in conformity with the data of the experiment. How can we deduce from this a mechanical explanation? U must be regarded as the potential energy of a system of which T is the kinetic energy. There is no difficulty as far as U is concerned, but can T be regarded as the vis viva of a material system?
It is easily shown that this is always possible, and in an unlimited number of ways. I will be content with referring the reader to the pages of the preface of my Électricité et Optique for further details. Thus, if the principle of least action cannot be satisfied, no mechanical explanation is possible; if it can be satisfied, there is not only one explanation, but an unlimited number, whence it follows that since there is one there must be an unlimited number.
One more remark. Among the quantities that may be reached by experiment directly we shall consider some as the co-ordinates of our hypothetical molecules, some will be our parameters q, and the rest will be regarded as dependent not only on the co-ordinates but on the velocities — or what comes to the same thing, we look on them as derivatives of the parameters q, or as combinations of these parameters and their derivatives.
Here then a question occurs: among all these quantities measured experimentally which shall we choose to represent the parameters q? and which shall we prefer to regard as the derivatives of these parameters? This choice remains arbitrary to a large extent, but a mechanical explanation will be possible if it is done so as to satisfy the principle of least action.
Next, Maxwell asks: Can this choice and that of the two energies T and U be made so that electric phenomena will satisfy this principle? Experiment shows us that the energy of an electro-magnetic field decomposes into electro-static and electro-dynamic energy. Maxwell recognised that if we regard the former as the potential energy U, and the latter as the kinetic energy T, and that if on the other hand we take the electro-static charges of the conductors as the parameters q, and the intensity of the currents as derivatives of other parameters q — under these conditions, Maxwell has recognised that electric phenomena satisfies the principle of least action. He was then certain of a mechanical explanation. If he had expounded this theory at the beginning of his first volume, instead of relegating it to a corner of the second, it would not have escaped the attention of most readers. If therefore a phenomenon allows of a complete mechanical explanation, it allows of an unlimited number of others, which will equally take into account all the particulars revealed by experiment. And this is confirmed by the history of every branch of physics. In Optics, for instance, Fresnel believed vibration to be perpendicular to the plane of polarisation; Neumann holds that it is parallel to that plane. For a long time an experimentum crucis was sought for, which would enable us to decide between these two theories, but in vain. In the same way, without going out of the domain of electricity, we find that the theory of two fluids and the single fluid theory equally account in a satisfactory manner for all the laws of electro-statics. All these facts are easily explained, thanks to the properties of the Lagrange equations.
It is easy now to understand Maxwell's fundamental idea. To demonstrate the possibility of a mechanical explanation of electricity we need not trouble to find the explanation itself; we need only know the expression of the two functions T and U, which are the two parts of energy, and to form with these two functions Lagrange's equations, and then to compare these equations with the experimental laws.
How shall we choose from all the possible explanations one in which the help of experiment will be wanting? The day will perhaps come when physicists will no longer concern themselves with questions which are inaccessible to positive methods, and will leave them to the metaphysicians. That day has not yet come; man does not so easily resign himself to remaining for ever ignorant of the causes of things. Our choice cannot be therefore any longer guided by considerations in which personal appreciation plays too large a part. There are, however, solutions which all will reject because of their fantastic nature, and others which all will prefer because of their simplicity. As far as magnetism and electricity are concerned, Maxwell abstained from making any choice. It is not that he has a systematic contempt for all that positive methods cannot reach, as may be seen from the time he has devoted to the kinetic theory of gases. I may add that if in his magnum opus he develops no complete explanation, he has attempted one in an article in the Philosophical Magazine. The strangeness and the complexity of the hypotheses he found himself compelled to make, led him afterwards to withdraw it.
The same spirit is found throughout his whole work. He throws into relief the essential — i.e., what is common to all theories; everything that suits only a particular theory is passed over almost in silence. The reader therefore finds himself in the presence of form nearly devoid of matter, which at first he is tempted to take as a fugitive and unassailable phantom. But the efforts he is thus compelled to make force him to think, and eventually he sees that there is often something rather artificial in the theoretical "aggregates" which he once admired.
ELECTRO-DYNAMICS.
THE history of electro-dynamics is very instructive from our point of view. The title of Ampère's immortal work is, Théorie des phéenomènes electro-dynamiques, uniquement fondée sur expérience. He therefore imagined that he had made no hypotheses; but as we shall not be long in recognising, he was mistaken; only, of these hypotheses he was quite unaware. On the other hand, his successors see them clearly enough, because their attention is attracted by the weak points in Ampère's solution. They made fresh hypotheses, but this time deliberately. How many times they had to change them before they reached the classic system, which is perhaps even now not quite definitive, we shall see.
I. Ampère's Theory. — In Ampère's experimental study of the mutual action of currents, he has operated, and he could operate only, with closed currents. This was not because he denied the existence or possibility of open currents. If two conductors are positively and negatively charged and brought into communication by a wire, a current is set up which passes from one to the other until the two potentials are equal. According to the ideas of Ampère's time, this was considered to be an open current; the current was known to pass from the first conductor to the second, but they did not know it returned from the second to the first. All currents of this kind were therefore considered by Ampère to be open currents — for instance, the currents of discharge of a condenser; he was unable to experiment on them, their duration being too short. Another kind of open current may be imagined. Suppose we have two conductors A and B connected by a wire AMB. Small conducting masses in motion are first of all placed in contact with the conductor B, receive an electric charge, and leaving B are set in motion along a path BNA, carrying their charge with them. On coming into contact with A they lose their charge, which then returns to B along the wire AMB. Now here we have, in a sense, a closed circuit, since the electricity describes the closed circuit BNAMB; but the two parts of the current are quite different. In the wire AMB the electricity is displaced through a fixed conductor like a voltaic current, overcoming an ohmic resistance and developing heat; we say that it is displaced by conduction. In the part BNA the electricity is carried by a moving conductor, and is said to be displaced by convection. If therefore the convection current is considered to be perfectly analogous to the conduction current, the circuit BNAMB is closed; if on the contrary the convection current is not a "true current," and, for instance, does not act on the magnet, there is only the conduction current AMB, which is open. For example, if we connect by a wire the poles of a Holtz machine, the charged rotating disc transfers the electricity by convection from one pole to the other, and it returns to the first pole by conduction through the wire. But currents of this kind are very difficult to produce with appreciable intensity; in fact, with the means at Ampère's disposal we may almost say it was impossible.
To sum up, Ampère could conceive of the existence of two kinds of open currents, but he could experiment on neither, because they were not strong enough, or because their duration was too short. Experiment therefore could only show him the action of a closed current on a closed current — or more accurately, the action of a closed current on a portion of current, because a current can be made to describe a closed circuit, of which part may be in motion and the other part fixed. The displacements of the moving part may be studied under the action of another closed current. On the other hand, Ampère had no means of studying the action of an open current either on a closed or on another open current.
1. The Case of Closed Currents. — In the case of the mutual action of two closed currents, experiment revealed to Ampère remarkably simple laws. The following will be useful to us in the sequel: —
(1) If the intensity of the currents is kept constant, and if the two circuits, after having undergone any displacements and deformations whatever, return finally to their initial positions, the total work done by the electro-dynamical actions is zero. In other words, there is an electro-dynamical potential of the two circuits proportional to the product of their intensities, and depending on the form and relative positions of the circuits; the work done by the electro-dynamical actions is equal to the change of this potential.
(2) The action of a closed solenoid is zero.
(3) The action of a circuit C on another voltaic circuit C' depends only on the "magnetic field" developed by the circuit C. At each point in space we can, in fact, define in magnitude and direction a certain force called "magnetic force," which enjoys the following properties: —
(a) The force exercised by C on a magnetic pole is applied to that pole, and is equal to the magnetic force multiplied by the magnetic mass of the pole.
(b) A very short magnetic needle tends to take the direction of the magnetic force, and the couple to which it tends to reduce is proportional to the product of the magnetic force, the magnetic moment of the needle, and the sine of the dip of the needle.
(c) If the circuit C is displaced, the amount of the work done by the electro-dynamic action of C on C' will be equal to the increment of "flow of magnetic force" which passes through the circuit.
2. Action of a Closed Current on a Portion of Current. — Ampère being unable to produce the open current properly so called, had only one way of studying the action of a closed current on a portion of current. This was by operating on a circuit C composed of two parts, one movable and the other fixed. The movable part was, for instance, a movable wire αβ, the ends α and β of which could slide along a fixed wire. In one of the positions of the movable wire the end α rested on the point A, and the end β on the point B of the fixed wire. The current ran from α to β — i.e., from A to B along the movable wire, and then from B to A along the fixed wire. This current was therefore closed.
In the second position, the movable wire having slipped, the points α and β were respectively at A' and B' on the fixed wire. The current ran from α to β — i.e., from A' to B' on the movable wire, and returned from B' to B, and then from B to A, and then from A to A all on the fixed wire. This current was also closed. If a similar circuit be exposed to the action of a closed current C, the movable part will be dis placed just as if it were acted on by a force. Ampère admits that the force, apparently acting on the movable part A B, representing the action of C on the portion αβ of the current, remains the same whether an open current runs through αβ, stopping at α and β, or whether a closed current runs first to β and then returns to α through the fixed portion of the circuit. This hypothesis seemed natural enough, and Ampère innocently assumed it; nevertheless the hypothesis is not a necessity, for we shall presently see that Helmholtz rejected it. However that may be, it enabled Ampère, although he had never produced an open current, to lay down the laws of the action of a closed current on an open current, or even on an element of current. They are simple:
(1) The force acting on an element of current is applied to that element; it is normal to the element and to the magnetic force, and proportional to that component of the magnetic force which is normal to the element.
(2) The action of a closed solenoid on an element of current is zero. But the electro-dynamic potential has disappeared — i.e., when a closed and an open current of constant intensities return to their initial positions, the total work done is not zero.
3. Continuous Rotations. — The most remarkable electro-dynamical experiments are those in which continuous rotations are produced, and which are called unipolar induction experiments. A magnet may turn about its axis; a current passes first through a fixed wire and then enters the magnet by the pole N, for instance, passes through half the magnet, and emerges by a sliding con tact and re-enters the fixed wire. The magnet then begins to rotate continuously. This is Faraday's experiment. How is it possible? If it were a question of two circuits of invariable form, C fixed and C' movable about an axis, the latter would never take up a position of continuous rotation; in fact, there is an electro-dynamical potential; there must therefore be a position of equilibrium when the potential is a maximum. Continuous rotations are therefore possible only when the circuit C' is composed of two parts — one fixed, and the other movable about an axis, as in the case of Faraday's experiment. Here again it is convenient to draw a distinction. The passage from the fixed to the movable part, or vice versa, may take place either by simple contact, the same point of the movable part remaining constantly in contact with the same point of the fixed part, or by sliding contact, the same point of the movable part coming successively into con tact with the different points of the fixed part.
It is only in the second case that there can be continuous rotation. This is what then happens: — the system tends to take up a position of equilibrium; but, when at the point of reaching that position, the sliding contact puts the moving part in contact with a fresh point in the fixed part; it changes the connexions and therefore the conditions of equilibrium, so that as the position of equilibrium is ever eluding, so to speak, the system which is trying to reach it, rotation may take place indefinitely.
Ampère admits that the action of the circuit on the movable part of C' is the same as if the fixed part of C' did not exist, and therefore as if the current passing through the movable part were an open current. He concluded that the action of a closed on an open current, or vice versa, that of an open current on a fixed current, may give rise to continuous rotation. But this conclusion depends on the hypothesis which I have enunciated, and to which, as I said above, Helmholtz declined to subscribe.
4. Mutual Action of Two Open Currents. — As far as the mutual action of two open currents, and in particular that of two elements of current, is concerned, all experiment breaks down. Ampère falls back on hypothesis. He assumes: (1) that the mutual action of two elements reduces to a force acting along their join; (2) that the action of two closed currents is the resultant of the mutual actions of their different elements, which are the same as if these elements were isolated.
The remarkable thing is that here again Ampère makes two hypotheses without being aware of it. However that may be, these two hypotheses, together with the experiments on closed currents, suffice to determine completely the law of mutual action of two elements. But then, most of the simple laws we have met in the case of closed currents are no longer true. In the first place, there is no electro-dynamical potential; nor was there any, as we have seen, in the case of a closed current acting on an open current. Next, there is, properly speaking, no magnetic force; and we have above denned this force in three different ways: (1) By the action on a magnetic pole; (2) by the director couple which orientates the magnetic needle; (3) by the action on an element of current.
In the case with which we are immediately concerned, not only are these three definitions not in harmony, but each has lost its meaning: —
(1) A magnetic pole is no longer acted on by a unique force applied to that pole. We have seen, in fact, the action of an element of current on a pole is not applied to the pole but to the element; it may, moreover, be replaced by a force applied to the pole and by a couple.
(2) The couple which acts on the magnetic needle is no longer a simple director couple, for its moment with respect to the axis of the needle is not zero. It decomposes into a director couple, properly so called, and a supplementary couple which tends to produce the continuous rotation of which we have spoken above.
(3) Finally, the force acting on an element of a current is not normal to that element. In other words, the unity of the magnetic force has disappeared.
Let us see in what this unity consists. Two systems which exercise the same action on a magnetic pole will also exercise the same action on an indefinitely small magnetic needle, or on an element of current placed at the point in space at which the pole is. Well, this is true if the two systems only contain closed currents, and according to Ampère it would not be true if the systems contained open currents. It is sufficient to remark, for instance, that if a magnetic pole is placed at A and an element at B, the direction of the element being in AB produced, this element, which will exercise no action on the pole, will exercise an action either on a magnetic needle placed at A, or on an element of current at A.
5. Induction. — We know that the discovery of electro-dynamical induction followed not long after the immortal work of Ampère. As long as it is only a question of closed currents there is no difficulty, and Helmholtz has even remarked that the principle of the conservation of energy is sufficient for us to deduce the laws of induction from the electro-dynamical laws of Ampère. But on the condition, as Bertrand has shown, — that we make a certain number of hypotheses.
The same principle again enables this deduction to be made in the case of open currents, although the result cannot be tested by experiment, since such currents cannot be produced.
If we wish to compare this method of analysis with Ampère's theorem on open currents, we get results which are calculated to surprise us. In the first place, induction cannot be deduced from the variation of the magnetic field by the well-known formula of scientists and practical men; in fact, as I have said, properly speaking, there is no magnetic field. But further, if a circuit C is subjected to the induction of a variable voltaic system S, and if this system S be displaced and deformed in any way whatever, so that the intensity of the currents of this system varies according to any law whatever, then so long as after these variations the system eventually returns to its initial position, it seems natural to suppose that the mean electro-motive force induced in the current C is zero. This is true if the circuit C is closed, and if the system S only contains closed currents. It is no longer true if we accept the theory of Ampère, since there would be open currents. So that not only will induction no longer be the variation of the flow of magnetic force in any of the usual senses of the word, but it cannot be represented by the variation of that force whatever it may be.
II. Helmholtz's Theory. — I have dwelt upon the consequences of Ampère's theory and on his method of explaining the action of open currents. It is difficult to disregard the paradoxical and artificial character of the propositions to which we are thus led. We feel bound to think "it cannot be so." We may imagine then that Helmholtz has been led to look for something else. He rejects the fundamental hypothesis of Ampère — namely, that the mutual action of two elements of current reduces to a force along their join. He admits that an clement of current is not acted upon by a single force but by a force and a couple, and this is what gave rise to the celebrated polemic between Bertrand and Helmholtz. Helmholtz replaces Ampère's hypothesis by the following: — Two elements of current always admit of an electro-dynamic potential, depending solely upon their position and orientation; and the work of the forces that they exercise one on the other is equal to the variation of this potential. Thus Helmholtz can no more do without hypothesis than Ampère, but at least he does not do so without explicitly announcing it. In the case of closed currents, which alone are accessible to experiment, the two theories agree; in all other cases they differ. In the first place, contrary to what Ampère supposed, the force which seems to act on the movable portion of a closed current is not the same as that acting on the movable portion if it were isolated and if it constituted an open current. Let us return to the circuit C', of which we spoke above, and which was formed of a movable wire sliding on a fixed wire. In the only experiment that can be made the movable portion αβ is not isolated, but is part of a closed circuit. When it passes from AB to A'B', the total electro-dynamic potential varies for two reasons. First, it has a slight increment because the potential of A'B' with respect to the circuit C is not the same as that of AB; secondly, it has a second increment because it must be increased by the potentials of the elements AA' and B'B with respect to C. It is this double increment which represents the work of the force acting upon the portion AB. If, on the contrary, αβ be isolated, the potential would only have the first increment, and this first increment alone would measure the work of the force acting on AB. In the second place, there could be no continuous rotation without sliding contact, and in fact, that, as we have seen in the case of closed currents, is an immediate consequence of the existence of an electro-dynamic potential. In Faraday's experiment, if the magnet is fixed, and if the part of the current external to the magnet runs along a movable wire, that movable wire may undergo continuous rotation. But it does not mean that, if the contacts of the weir with the magnet were suppressed, and an open current were to run along the wire, the wire would still have a movement of continuous rotation. I have just said, in fact, that an isolated element is not acted on in the same way as a movable element making part of a closed circuit. But there is another difference. The action of a solenoid on a closed current is zero according to experiment and according to the two theories. Its action on an open current would be zero according to Ampère, and it would not be zero according to Helmholtz. From this follows an important consequence. We have given above three definitions of the magnetic force. The third has no meaning here, since an element of current is no longer acted upon by a single force. Nor has the first any meaning. What, in fact, is a magnetic pole? It is the extremity of an indefinite linear magnet. This magnet may be replaced by an indefinite solenoid. For the definition of magnetic force to have any meaning, the action exercised by an open current on an indefinite solenoid would only depend on the position of the extremity of that solenoid — i.e., that the action of a closed solenoid is zero. Now we have just seen that this is not the case. On the other hand, there is nothing to prevent us from adopting the second definition which is founded on the measurement of the director couple which tends to orientate the magnetic needle; but, if it is adopted, neither the effects of induction nor electro-dynamic effects will depend solely on the distribution of the lines of force in this magnetic field.
III. Difficulties raised by these Theories. — Helmholtz's theory is an advance on that of Ampère; it is necessary, however, that every difficulty should be removed. In both, the name "magnetic field" has no meaning, or, if we give it one by a more or less artificial convention, the ordinary laws so familiar to electricians no longer apply; and it is thus that the electro-motive force induced in a wire is no longer measured by the number of lines of force met by that wire. And our objections do not proceed only from the fact that it is difficult to give up deeply-rooted habits of language and thought. There is something more. If we do not believe in actions at a distance, electro-dynamic phenomena must be explained by a modification of the medium. And this medium is precisely what we call "magnetic field," and then the electro-magnetic effects should only depend on that field. All these difficulties arise from the hypothesis of open currents.
IV. Maxwell's Theory. — Such were the difficulties raised by the current theories, when Maxwell with a stroke of the pen caused them to vanish. To his mind, in fact, all currents are closed currents. Maxwell admits that if in a dielectric, the electric field happens to vary, this dielectric becomes the seat of a particular phenomenon acting on the galvanometer like a current and called the current of displacement. If, then, two conductors bearing positive and negative charges are placed in connection by means of a wire, during the discharge there is an open current of conduction in that wire; but there are produced at the same time in the surrounding dielectric currents of displacement which close this current of conduction. We know that Maxwell's theory leads to the explanation of optical phenomena which would be due to extremely rapid electrical oscillations. At that period such a conception was only a daring hypothesis which could be supported by no experiment; but after twenty years Maxwell's ideas received the confirmation of experiment. Hertz succeeded in producing systems of electric oscillations which reproduce all the properties of light, and only differ by the length of their wave — that is to say, as violet differs from red. In some measure he made a synthesis of light. It might be said that Hertz has not directly proved Maxwell's fundamental idea of the action of the current of displacement on the galvanometer. That is true in a sense. What he has shown directly is that electro-magnetic induction is not instantaneously propagated, as was supposed, but its speed is the speed of light. Yet, to suppose there is no current of displacement, and that induction is with the speed of light; or, rather, to suppose that the currents of displacement produce inductive effects, and that the induction takes place instantaneously — comes to the same thing. This cannot be seen at the first glance, but it is proved by an analysis of which I must not even think of giving even a summary here.
V. Rowland's Experiment. — But, as I have said above, there are two kinds of open conduction currents. There are first the currents of discharge of a condenser, or of any conductor whatever. There are also cases in which the electric charges describe a closed contour, being displaced by conduction in one part of the circuit and by convection in the other part. The question might be regarded as solved for open currents of the first kind; they were closed by currents of displacement. For open currents of the second kind the solution appeared still more simple.
It seemed that if the current were closed it could only be by the current of convection itself. For that purpose it was sufficient to admit that a "convection current" — i.e., a charged conductor in motion could act on the galvanometer. But experimental confirmation was lacking. It appeared difficult, in fact, to obtain a sufficient intensity even by increasing as much as possible the charge and the velocity of the conductors. Rowland, an extremely skilful experimentalist, was the first to triumph, or to seem to triumph, over these difficulties. A disc received a strong electrostatic charge and a very high speed of rotation. An astatic magnetic system placed beside the disc underwent deviations. The experiment was made twice by Rowland, once in Berlin and once at Baltimore. It was afterwards repeated by Himstedt. These physicists even believed that they could announce that they had succeeded in making quantitative measurements. For twenty years Rowland's law was admitted without objection by all physicists, and, indeed, everything seemed to confirm it. The spark certainly does produce a magnetic effect, and does it not seem extremely likely that the spark discharged is due to particles taken from one of the electrodes and transferred to the other electrode with their charge? Is not the very spectrum of the spark, in which we recognise the lines of the metal of the electrode, a proof of it? The spark would then be a real current of induction.
On the other hand, it is also admitted that in an electrolyte the electricity is carried by the ions in motion. The current in an electrolyte would therefore also be a current of convection; but it acts on the magnetic needle. And in the same way for cathodic rays; Crooks attributed these rays to very subtle matter charged with negative electricity and moving with very high velocity. He looked upon them, in other words, as currents of convection. Now, these cathodic rays are deviated by the magnet. In virtue of the principle of action and re-action, they should in their turn deviate the magnetic needle. It is true that Hertz believed he had proved that the cathodic rays do not carry negative electricity, and that they do not act on the magnetic needle; but Hertz was wrong. First of all, Perrin succeeded in collecting the electricity carried by these rays — electricity of which Hertz denied the existence; the German scientist appears to have been deceived by the effects due to the action of the X-rays, which were not yet discovered. Afterwards, and quite recently, the action of the cathodic rays on the magnetic needle has been brought to light. Thus all these phenomena looked upon as currents of convection, electric sparks, electrolytic currents, cathodic rays, act in the same manner on the galvanometer and in conformity to Rowland's law.
VI. Lorentz's Theory. We need not go much further. According to Lorentz's theory, currents of conduction would themselves be true convection currents. Electricity would remain indissolubly connected with certain material particles called electrons. The circulation of these electrons through bodies would produce voltaic currents, and what would distinguish conductors from insulators would be that the one could be traversed by these electrons, while the others would check the movement of the electrons. Lorentz's theory is very attractive. It gives a very simple explanation of certain phenomena, which the earlier theories — even Maxwell's in its primitive form — could only deal with in an unsatisfactory manner; for example, the aberration of light, the partial impulse of luminous waves, magnetic polarisation, and Zeeman's experiment.
A few objections still remained. The phenomena of an electric system seemed to depend on the absolute velocity of translation of the centre of gravity of this system, which is contrary to the idea that we have of the relativity of space. Supported by M. Crémieu, M. Lippman has presented this objection in a very striking form. Imagine two charged conductors with the same velocity of translation. They are relatively at rest. However, each of them being equivalent to a current of convection, they ought to attract one another, and by measuring this attraction we could measure their absolute velocity. "No!" replied the partisans of Lorentz. "What we could measure in that way is not their absolute velocity, but their relative velocity with respect to the ether, so that the principle of relativity is safe." Whatever there may be in these objections, the edifice of electro-dynamics seemed, at any rate in its broad lines, definitively constructed. Everything was presented under the most satisfactory aspect. The theories of Ampère and Helmholtz, which were made for the open currents that no longer existed, seem to have no more than purely historic interest, and the in extricable complications to which these theories led have been almost forgotten. This quiescence has been recently disturbed by the experiments of M. Crémieu, which have contradicted, or at least have seemed to contradict, the results formerly obtained by Rowland. Numerous investigators have endeavoured to solve the question, and fresh experiments have been undertaken. What result will they give? I shall take care not to risk a prophecy which might be falsified between the day this book is ready for the press and the day on which it is placed before the public.
[edit] Footnotes
- ↑ V. Past and Present, end of Chapter I., Book II. [TR.]
- ↑ Clerk Maxwell imagined some supernatural agency at work, sorting molecules in a gas of uniform temperature into (a) those possessing kinetic energy above the average, (b) those possessing kinetic energy below the average. [Tr.]
- ↑ This chapter is mainly taken from the prefaces of two of my books — Théorie Mathématique de la lumiére (Paris: Naud, 1889), and Électricité et Optique (Paris: Naud, 1901).
- ↑ We may add that U will depend only on the q parameters, that T will depend on them and their derivatives with respect to time, and will be a homogeneous polynomial of the second degree with respect to these derivatives.