The Relation of Mass to Energy
The Relation of Mass to Energy (1908) by 
Philosophical Magazine, January 1908, 6, 15 (85): 121, Online 
Contents
110[edit]
1. Whether the inertia of matter has or has not a complete electromagnetic explanation is a question that it will perhaps take many years to answer with any degree of certainty. The experiments of Kaufmann seem to prove that in the case of a single electron the mass is entirely of this origin; and it is impossible therefore to avoid the conclusion that at least a fraction of ordinary material inertia is also electromagnetic. Doubtless there is a psychological cause for our reluctance to accept the electromagnetic explanation as complete, constant familiarity with ponderable bodies having blinded us to the possibility of anything being more fundamental ; but certain it is, that if we free ourselves from prejudice as much as possible and adopt the welltried policy of choosing the simplest theory which adequately represents the phenomena, —the theory that is, which involves the least number of variables,— we must decide in favour of the complete electromagnetic explanation, which involves only the aether and its properties.
2. The complexity of the Zeeman effect and the relations between the wavelengths of the spectral lines, make it seem probable that if matter is to be considered as an electrical system, it must be much more complex than a system composed entirely of electrons separated by distances great in comparison to their size. It becomes therefore of interest to see whether any relations can be found between the mass of an electric system in general, and any of its other properties. It will be found that a general relation does exist, which is not only of considerable interest in itself, but also suggests other relations.
3. The straightforward calculation of the mass of an electric system possessing any distribution of charge and any internal velocities below that of light presents considerable difficulty; for such calculation involves the use of the scalar and vector potential, and these are not effective instantaneously at all parts of the system. Any expression for the mass of the system calculated in this way will therefore involve terms which vary in an extremely complicated way with the internal velocities when these are not very small. The same is true with respect to the velocity of the system as a whole. In the following discussion the problem is attacked in an entirely different way, which is not open to this objection.
As the constraints of the system are intimately involved, it will be well first to consider them.
4. The position of internal constraints in general electrical theory is a very fundamental one. By "constraints" are meant rigid connexions of any kind. These act merely as reactions to the electrical forces, and do not contribute to the virtual work. If the electrical laws are to hold universally, i. e., for minute distances as well as for greater ones, it is obvious that no electrical system can exist as such unless there are such constraints to balance the electrical forces. Even a single electron would dissipate itself through the mutual repulsion of its elements, were it not for some form of internal constraint. Besides holding the system together, as it were, these constraints also act in another important way. They may become, in common with all geometrical constraints, paths of energy flow. We are accustomed to think of the Poynting vector as representing completely the energy flow in a purely electrical system, but of course this is not in general true.
Take as a simple example the case of a large plane aircondenser moving in a direction perpendicular to the plane of its plates. If the condenser is charged there is obviously a transference of energy at a rate equal to the internal energy multiplied by the velocity of movement. The Poynting vector is, however, zero. The energy transfer is not through space in the ordinary sense, but is along the constraint which holds the condenserplates apart. The plate in the rear picks up, as we may say, the energy of the field, and after it has been transmitted to the forward plate by means of the constraint it is there set down again. On the other hand, when the condenser is moving parallel to the plane of the plates, there is no energy How along the constraint and the Poynting vector adequately represents the transfer of energy. So also in the case of a single moving electron, the rate of transfer of energy is not given by the integration of the Poynting vector through all space, but differs from this by an amount corresponding to the energyflow along the constraints in the body of the electron. This does not mean that there is any energy associated with the constraints, for of course rigid constraints can neither absorb nor give out energy; there is no storing up, but merely a transfer.
5. It is not difficult to find an expression for this rate of transfer. If the constraint is a simple linear one the transfer of energy along its direction is evidently
,
where () is the length of the constraint, () the velocity with which it is moving along its length, and () the tension along it. The amount of energy () per sec. is put at the forward end, and is instantly available at the rear end at a distance (). If the velocity () makes an angle () with the constraint, , and the transfer in the direction of is
.
Another type of transfer enters when there is shearing stress in the constraint, a transfer that is which is in a direction perpendicular to that of velocity. It must be remembered that, the constraints are described in no way except geometrically.
If we consider therefore the general case where the stress in the body of the constraint is represented mathematically by the nine stresses commonly used in the theory of elasticity, namely, , then there is a rate of transfer of energy in the xdirection through unit volume given by
.
,
where () is the element of volume.
In like manner the velocity () and the shearing stress () cause energy to be taken up at the surface () and given out at the surface (), and we have the rate of flow along the xaxis
;
and finally the velocity () and the shearing stress () give
.
Hence adding we have, if we call () the density of flow along ,
.
Obtaining the corresponding equations in similar way we have finally for the three components of the density of energyflow along the constraints in any system
(1) 
(2) 
These equations give the density of the total energyflow through any purely electrical system, in which the ordinary electrical laws hold universally.
6. Consider an isolated electrical system moving as a whole through space with the constant velocity (). A constant velocity will be possible if the system retains on the average the same internal structure. The total average rate of transfer of energy corresponding to the movement of such a system is evidently (), where is the total contained energy. Another expression for the same thing is to be obtained by integrating throughout the system the components along ( of ) given in equations (2). In order that the velocity () may appear explicitly, however, it is necessary that the velocity (), which was used in equations (2), be written as the sum of () and another velocity (). Then () is the velocity with respect to axes moving with the system.
If are the direction cosines of the constant velocity (), we have for the total energyflow () in the direction of (),
(3) 
The electromagnetic momentum corresponding to any electrical system is given by the components
(4) 
which, except for the factor , are the same as the integrals of the components of the Poynting vector throughout the system. Hence equation (3) may be written
(3A) 
Also if the electrical system here dealt with is to represent a material body, we may assume that the resultant momentum () is in the direction of the velocity, and hence
.
This may be considered as due to the fact that the lack of symmetry necessarily involved in the intimate structure of any electromagnetic system has become a symmetrical average in particles large enough to be dealt with. This symmetrical point may of course have been reached in the case of single atoms. We may now write (3A) in the form
(5) 
7. To reduce this expression further requires some relation to be established between the stresses and the electric and magnetic force intensities. This process is closely analogous to the derivation of the Maxwell stress in the free aether except that we here have to deal with, besides the forces in the constraint, only the electromagnetic force on electricity embedded in the constraint, and we have nothing to do with hypothetical stresses in the free aether.
If () represents the electric density and () the component of the total electromagnetic force on unit charge embedded in the constraint, we have
,  (6) 
where () is the density of convection current caused by the movement with velocity () of the electricity of density ().
Making use temporarily of the vector terminology for the sake of brevity and calling the electric force (), the magnetic (), and the sign [ ] denoting the vector product, we have
.
Since div and ,
.  (7) 
Now it is an easily verifiable identity that
;  (8) 
and hence, remembering that div , equation (7) becomes
(9) 
,
the last two terms in the bracket of (9) become together
,
which is minus the time rate of the density of momentum at the point. The time rate, however, refers to a point fixed in space, and to change to a point moving with the system we make use of the usual expression and write
,  (10) 
where is the component of the density of momentum and () are, as formerly, the direction cosines of the constant velocity (). The operator now refers to the rate of change at a point moving with the system.
Substituting (10) for the two last terms in (9) and noticing that
may be written ,
where the integration is to be taken from (meaning merely from a point outside the system where is zero) up to the point in question, account being taken of any discontinuity at the bounding surface, we have in place of (9)
(11) 
This equation gives us what we were seeking, namely, the values of , and in terms of the electric and magnetic forces and the density of the momentum.
Thus we may take
(12) 
Similar values are of course to be found for the other six components of stress in the constraint .
8. The values for these nine stresscomponents are now to be substituted in equation (5). In doing this it is to be noticed that the last term in each of equations (12) will, after substitution, furnish a term of the type
,  (13) 
and this, being a time derivative, gives an average value of zero when the time is allowed to increase indefinitely, since all quantities in the system remain finite. Also
.
Making the substitution in (5) and simplifying, we have as the value for ()
. 
(14) 
Now the first integral represents the total included energy, the two parts of the second integral represent the squares of the components of the electric and magnetic forces in the direction of the motion of the system, and the last integral represents the momentum in the direction of motion, which in this case is the whole momentum , since we have assumed that and are in the same direction.
Calling
the transverse energy of the system, we may rewrite equation (14) as
,  (15) 
and hence
(16) 
This gives the total momentum of any isolated, purely electrical system, which has on the average the same internal structure, in terms of its transverse energy, i. e., the energy represented by the components of the electric and magnetic forces which are perpendicular to the velocity of the system. The mass of the system is then
(17) 
If, as we have assumed in deriving this expression, the system possesses the same momentum for uniform translation in any direction, this formula for the mass can contain terms of even powers only in the ratio of the velocity of the system to the velocity of light. If we neglect terms of the second and higher orders has the same value as for , which from symmetry of the system must be twothirds the total energy . Therefore
,  (18) 
if second order terms he neglected. This formula would apply with extreme accuracy for the electromagnetic mass of ponderable bodies, for no such bodies have in nature a velocity large enough to make appreciable.
It should be noticed that in equation (17) second order terms may enter in either () or its derivative with respect to . In fact such terms do enter for two reasons. In the first place, the setting of the body in motion requires work and hence adds new energy, through a second order term; and secondly there is an effect due to the change which motion causes in the velocity of propagation through the system of electrical disturbances. This is seen in the simple case of a moving electron where the crowding of the lines towards the equator with increase of velocity is only partly due to the added energy. It is evident, therefore, that for velocities so great that the second order terms cannot be neglected, the mass depends on complicated terms which vary with the internal structure and motions of the system, and it does not appear as if a general expression for the mass of a system for such high velocities could be found.
The second order terms may in the future make themselves experimentally manifest through an increase of mass of rapidly moving aparticles.
9. Expression (18) may readily be verified for simple symmetrical systems. For a single charged conducting sphere of radius (a) the mass for slow velocities is wellknown to be
(e. Potential) .
An interesting verification of equation (16) for the special case of a general, rigid, electrostatic system in translatory motion has been furnished me privately by Mr. G. F. C. Searle. He obtains for such a system (Phil. Mag. Jan. 1907, p. 129) the expression
,  (19) 
where is the momentum of the entire system along the direction () of motion, () is the total magnetic energy due to this motion, and () is the common translatory velocity possessed by all parts of the system.
Now it is well known that where the Faraday tubes move through space uniformly, as in the present case, the magnetic force () is given in terms of the electric force () by the expression
,
() representing the angle between () and the velocity of motion (), and () being in a direction perpendicular both to () and (). In the present notation
,
and hence we have
.  (20) 
(21) 
and remembering that
we have finally
(22) 
which, since () is along (), is identical with (16). Thus (16) is verified for the case where the moving system possesses no internal motion.
Perhaps the simplest symmetrical system containing magnetic as well as electric energy is that formed by a great, number of charged spheres moving in straight lines out from a common centre, with velocities small enough so that the fourth and higher powers may be neglected. They are to he at distances from each other great in comparison with their size, and at equal distances from the common centre. If the system be now given the slow velocity () as a whole the total momentum accompanying this motion may be determined. Because of limited space the calculation will not be here given, but if it be carried out along established lines it will be found that the mass is times the sum of the electric and magnetic energies, thus verifying equation (18).
10. We conclude that, if ordinary material mass has an electromagnetic basis, such mass for slow velocities is proportional to the total electromagnetic energycontent of the body, and the laws of conservation of mass and energy become closely related if not identical. In any case the expression given represents the electromagnetic part of the total mass whatever that may be.
Considerations suggested by the Foregoing.[edit]
The Atomic Weights.[edit]
11. If the conclusion of the last article is correct a diminution in mass should follow a loss of energy in material transformations. Calculation shows, however, that in the case of the powerful reaction between hydrogen and oxygen forming water, the change of mass would only he of the order gram. In the case of radioactivity, however, the energy change is very much greater and an appreciable effect is to be expected. Thus if a radium atom gives off an particle of mass () with velocity (), then there should be a diminution in the sum of the masses of the \alpha particle and the remaining atom equal to
,
since represents the energy lost, and this, calling (using gramatomic weight) and , gives
;
an amount large enough to cause discrepancies in calculating the atomic weights of radioactive substances from the number of particles lost. Since (Mass) is proportional to the square of the velocity of the particle, its value would be greatly increased by a slight error in the determination of () and the effect could easily be much larger.
12. A consideration of some interest is the following. If we adopt the disintegration theory, we are obliged to think of the various atoms as combinations or groups, more or less modified, of the lighter atoms. If there were perfect conservation of mass this would introduce a certain uniformity in the relations between the atomic weights, a uniformity which apparently does not exist. On the other hand, if we take into consideration the inevitable change of mass when the electromagnetic energy of the system is modified, the atomic weights will involve a correction term depending upon the change in this energy, and hence they will no longer bear simple, exact relations to each other. In a highly important paper (Zeitschrift fur Anorg. Chemie, xiv. p. 66, 1897) Rydberg has shown that the atomic weights of the first twenty seven elements of the periodic system approximate to whole numbers very much more closely than chance could bring about. He has also shown that the atomic weights of these elements are best considered as the sum of two parts () where is an integer and is a fraction, in general positive and smaller than unity. If is the number of the element in the system (called by Rydberg the "Ordnungszahl"), then is equal to for the elements of even valence and for the elements of odd valence. Below is given a table showing the various quantities. I have used, however, the International Atomic Weight values for 1907 instead of those Rydberg used.
Sign  M. 

Atomic Weight 
D.  Sign.  M. 

Atomic Weight 
D.  

He Li 
2 3 
4 8 
7 11 
4 7,03 
– ,03 
P S 
15 16 
32 36 
31 35 
31,0 32,06 
,0 ,06 
The orderly arrangement of the series is striking. It will be noticed that in three cases only are the 's greater than unity and only in two cases are they negative.
Rydberg points out that although the heavier elements do not conform well to this scheme, i. c., do not in general give the small fractional values of () noticed above, yet this is in reality no valid objection, for the numerical values of the weights of heavier elements depend much more on the value of the arbitrary unit chosen than do those of the lighter weight elements, and hence they can have little influence one way or the other in estimating the validity of the curious relations he sets forth.
The whole question is of course whether these differences represent real physical deviations from something or whether they are merely mathematical remainders. Rydberg certainly believes them to represent physical realities, and considering the beforementioned overwhelming improbability that the approximation of the atomic weights to whole numbers is due to chance, we can hardly doubt that he is right.
13. Now it is to be noticed that these deviations find a ready explanation when the conclusions of the present paper are combined with the theory, so much favoured recently, that one element breaks down into two or more others with an accompanying expulsion of energy. The deviations are then to be explained as resulting from loss of mass accompanying the dissipation of energy. On the other hand, if no such loss of mass takes place, the existence of these deviations in the table of atomic weights becomes a wellnigh insuperable difficulty in the path of the evolutionary theory of the elements.
If we follow the present suggestion, we must search for the components of an element, not by comparing atomic weights, but by comparing the corresponding values of , for the atomic weights deviate because of the lost mass accompanying the dissipation of internal energy.
Very recently Sir W. Ramsay has announced several striking discoveries which seem to add much weight to the disintegration theory, and, indirectly, to the views here set forth. Ho found helium, neon, or argon appearing as a product of radium emanation according to the exterior conditions imposed, and he found lithium appearing when a coppersulphate solution was left in the presence of the emanation. Prof. Ramsay states. I believe, that every source of error was eliminated and that the results were obtained many times.
14. It should be noticed that this theory of loss of mass and its consequences does not require that the whole material mass should be of electromagnetic nature. It only requires that the energy lost in the transformations, explosive or otherwise, should be at the expense of internal electromagnetic energy, i. e., that the forces which expel the particles should be electric or magnetic.
Respecting gravitation.[edit]
15. The experiments of many investigators have shown that up to a high degree of accuracy the ratio of mass to weight for different substances is the same. Now if the mass is proportional to the internal energy as here suggested, instead of being proportional to the number of electric nuclei as might be supposed, the conclusion is apparently forced upon us that gravitational attraction is between quantities of confined energy, and not between quantities of "matter" in any other sense.
On this basis, the weight of a calorie at the earth's surface would be of order dyne. This is apparently too small to explain the temperature gradient in the earth although the calculation, depending as it docs on the mechanical force on confined energy due to a temperature gradient, would certainly depend to a large degree on the medium.
If we assume this gravitational effect, it is interesting to ask whether free energy would also show an attraction for itself. If so, the energy radiated from a gravitational centre like the sun would leave some of itself behind along its path as it moved through space, and it might be possible to account in this way for some of the energy which is ordinarily thought of as totally dissipated.
Another conclusion which is suggested by the foregoing is that, assuming the loss of mass accompanying dissipation of energy, the sun's mass must have decreased steadily through millions of years. If too, our conclusion respecting the gravitating quality of confined energy be correct, the gravitation constant of the sun has also decreased and the distances of the planets must have increased accordingly. This last increase of planetary distance can be calculated by making the angular momentum of the planet about the sun a constant, and allowing the mass of the planet, together with the gravities of both sun and planet, to grow less with time.
So little is known as to the former radiating power of the sun that no even approximate calculation can be made, but it is not difficult to show that the order of magnitude is such as might make the increase in the planetary distances not altogether negligible during great lapses of time.
A Proof from a different Point of View.[edit]
16. The proof of expression (17) which has been given has the advantage of entering intimately into the structure of the general system and showing the part that nonelectrical forces in the form of constraints must play if the fundamental laws of electrical action are to hold for every infinitesimal element of the finite volume occupied by any electrical system. Although this is assumed in every mathematical derivation of the mass of an electron, and in fact in all problems of a similar nature, many will doubtless object to this assumption on the ground that probably the ordinary electrical laws do not apply when the distance between "elements of charge," so called, is comparable with the diameter of an electron.
Although it is difficult to see how a coherent mathematical theory of electricity can at present be formed without this assumption, yet it was thought best to add a more general proof of (17). The following is therefore given as avoiding the explicit use of constraints.
17. The statement of the law of the conservation of energy for an element of volume in any electrical system possessing electrical charges in motion, is the wellknown expression
(23) 
,
where , and , are the electric and magnetic force intensities respectively, and
,
,
.
Equation (23) states merely that the rate of increase of energy in an elementary volume is equal to the activity of any foreign (i. e., nonelectrical) forces which may act therein minus the outward flow of energy.
Now suppose we consider an electromagnetic system bounded by a rigid surface (), which moves uniformly through space with the velocity () along the axis of (); and further suppose that the volume inside this closed surface is divided into two parts by the plane partition () which is perpendicular to the xaxis and which, although fixed in the moving system, coincides at a given instant with the plane () fixed in space. If this system be considered as isolated, then no disturbance passes through the bounding surface ().
In equation (23) the time derivative of the energydensity refers to a point fixed in space, and if we wish it to refer to a point moving with the system we must write as usual
,  (24) 
where now means the rate of change measured from the moving point. Likewise, if we wish the velocities which enter into (23) to be expressed in terms of velocities relative to axes moving with the system, we must write
(25) 
where , and are the components of these relative velocities.
Substituting (24) and (25) in (23), and remembering the simple proportionality between , and and the density of momentum , and , we easily obtain
. 
(26) 
Now () may be expressed in terms of the electric and magnetic force intensities, together with the density of the momentum. This involves only the fundamental equations of electromagnetic theory and has been done in paragraph 7, reference to which will show that with the present notation
. 
(27) 
Substituting this for the () which occurs on the lefthand side of (26), rearranging the latter, and putting
and
,
(28) 
Now if this expression is integrated through the part of the volume () which lies on the side () of the partition (), the terms on the left of (28) become equal to
,
where the integral is taken over the part of the plane () which is included in the surface. This follows from the fact that the rest of the surface of part belongs to the surface () and outside of () there is no disturbance whatever. The terms on the right (of 28) give an average value of zero. This last will be evident if they are considered separately. The first two give directly a time average of zero, after great elapse of time, since neither () nor () ever becomes infinite. The terms involving the () and () derivatives when integrated may be written as surface integrals over the bounding surface (of part ), and they then represent the flux of energy through this surface in a direction perpendicular to (), i. e., in a direction perpendicular to the axis. This flux being everywhere zero over the surface, these terms vanish.
The terms involving also give a time average of zero because they represent what might be called the internal activity of the forces which act on the charges in the part of the volume (), and since this part is isolated on all sides except on the side ( the activity of these forces really means the rate at which the part () by means of them is doing work on . In the long run 's work on must be equal to 's work on , if the system is to be conservative and the internal motions are to be stationary.
Thus we learn finally from equation (28) that the average value of
,
where the integral is taken over the enclosed part of the plane (). Since this is true for any position of the plane () so long as it is perpendicular to () we evidently have on the average, integrating along () throughout the entire system,
,  (29) 
This gives, using the former notation, and remembering that on the average the internal structure is assumed to remain the same,
(30) 
which, since () is here along (), is precisely the result of equation (16), and becomes (17) on differentiation.
Conclusion.[edit]
It has been shown in the foregoing that the electromagnetic mass of an isolated, symmetrical, purely electric system possessing any structure which on the average remains the same, and any internal motions or constraints, is expressible in terms of its velocity as a whole through space together with its "transverse energy" and the derivative of the latter with respect to the velocity. If secondorder terms in the velocity be neglected, the mass is a simple constant multiplied by the total included electromagnetic energy.
If the mass of ponderable bodies has an electromagnetic origin, then the inertia of matter is to be considered merely as a manifestation of confined energy. From this point of view, matter and energy are thus very closely related and the laws of the conservation of mass and energy become practically identical.
It has been pointed out that the loss of mass, inevitable on this view, which takes place when energy is lost to the system, is large enough to be detected in the case of radioactive changes. If we assume the disintegration theory of the elements, this loss of mass affords a ready explanation of the general, small irregularities to be found in the list of atomic weights, and thus removes a serious difficulty from the path of the disintegration theory. For this loss of mass to take place however, it is not necessary that the whole of the mass be electromagnetic.
It has been shown that if material mass be electromagnetic and if lighter elements are formed from heavier ones through violent energy changes, it follows that gravity acts between quantities of confined energy and not between masses in any other sense. Several speculations are indulged in as to the results of assuming gravitation between quantities of energy.
Finally, the fundamental proposition is dealt with mathematically from an entirely different point of view and the same result obtained.
In conclusion I wish to express my thanks to Prof. J. J. Thomson and to Mr. G. F. C. Searle for several valuable criticisms and suggestions.
Cambridge, England,
 August 14th, 1907.
This work is in the public domain in the United States because it was published before January 1, 1923.
The author died in 1970, so this work is also in the public domain in countries and areas where the copyright term is the author's life plus 30 years or less. This work may also be in the public domain in countries and areas with longer native copyright terms that apply the rule of the shorter term to foreign works. 