On Einstein's Theory of gravitation

From Wikisource
 
Jump to: navigation, search
On Einstein's Theory of gravitation I-IV  (1916) 
by Hendrik Lorentz
Proceedings of the Royal Netherlands Academy of Arts and Sciences, 1917, 19 (2):1341-1361 Online, 20 (1):2-34 Online

"On Einstein's Theory of gravitation." By Prof. H. A. Lorentz.

I.

(Communicated in the meeting of February 26, 1916).


§ 1. In pursuance of his important researches on gravitation Einstein has recently attained the aim which he had constantly kept in view; he has succeeded in establishing equations whose form is not changed by an arbitrarily chosen change of the system of coordinates[1]. Shortly afterwards, working out an idea that had been expressed already in one of Einstein's papers, Hilbert[2] has shown the use that may be made of a variation law that may be regarded as Hamilton's principle in a suitably generalized form. By these results the "general theory of relativity" may be said to have taken a definitive form, though much remains still to be done in further developing it and in applying it to special problems. It will also be desirable to present the fundamental ideas in a form as simple as possible.

In this communication it will be shown that a four-dimensional geometric representation may be of much use for this latter purpose; by means of it we shall be able to indicate for a system containing a number of material points and an electromagnetic field (or eventually only one of these) the quantity H, which occurs in the variation theorem, and which we may call the principal function. This quantity consists of three parts, of which the first relates to the material points, the second to the electromagnetic field and the third to the gravitation field itself.

As to the material points, it will be assumed that the only connexion between them is that which results from their mutual gravitational attraction.


§ 2. We shall be concerned with a four-dimensional extension R_{4} , in which "space" and "time" are combined, so that each point P in it indicates a definite place A and at the same time a definite moment of time t. If we say that P refers to a material point we mean that at the time t this point is found at the place A. In the course of time the material point is represented every moment by a new point P; all these points lie on the "world-line", which represents the state of motion (or eventually the state of rest) of the material point[3]. In the same sense we may speak of the world-line of a propagated light-vibration. An intersection of two world-lines means that the two objects to which they belong meet at a certain moment, that a "coincidence" takes place[4]. Now Einstein has made the striking remark[5] that the only thing we can learn from our observations and with which our theories are essentially concerned, is the existence of these coincidences. Let us suppose e.g. that we have observed an occultation of a star by the moon or rather the reappearance of a star at the moon's border. Then the world-line of a certain light-vibration starting from a point on the world-line of the star has in its further course intersected the world-line of a point of the border of the moon and finally that of the observer's eye. A similar remark may be made when the moment of reappearance is read on a clock. Let us suppose that the light-vibration itself lights the dial-plate, reaching it when the hand is at the point a; then we may say that three world-lines, viz. that of the light-vibration, that of the hand and that of the point a intersect.


§ 3. We may imagine that, in order to investigate a gravitation field as e.g. that of the sun, a great number of material points, moving in all directions and with different velocities, are thrown into it, that light-beams are also made to traverse the field and that all coincidences are noted[6]. It would be possible to represent the results of these observations by world-lines in a four-dimensional figure — let us say in a "field-figure" — the lines being drawn in such a way that each observed coincidence is represented by an intersection of two lines and that the points of intersection of one line with a number of the others succeed each other in the right order.

Now, as we have to attend only to the intersections, we have a great degree of liberty in the construction of the "field-figure". If, independently of each other, two persons were to describe the same observations, their figures would probably look quite different and if these figures were deformed in an arbitrary way, without break of continuity, they would not cease to serve the purpose.

After having constructed a field-figure F we may introduce "coordinates", by which we mean that to each point P we ascribe four numbers x_{1},x_{2},x_{3},x_{4}, in such a way that along any line in the field-figure these numbers change continuously and that never two different points get the same four numbers. Having done this we may for each point P seek a point P' in a four-dimensional extension R'_{4} , in which the numbers x_{1},\dots x_{4} ascribed to P are the Cartesian coordinates of the point P'. In this way we obtain in R'_{4} a figure F', which just as well as F can serve as field-figure and which of course may be quite different according to the choice of the numbers x_{1},\dots x_{4} , that have been ascribed to the points of F.

If now it is true that the coincidences only are of importance it must be possible to express the fundamental laws of the phenomena by geometric considerations referring to the field-figure, in such a way that this mode of expression is the same for all possible field-figures; from our point of view all these figures can be considered as being the same. In such a geometric treatment the introduction of coordinates will be of secondary importance; with a single exception (§ 13) it only serves for short calculations which we have to intercalate (for the proof of certain geometric propositions) and for establishing the final equations, which have to be used for the solution of special problems. In the discussion of the general principles coordinates play no part; and it is thus seen that the formulation of these principles can take place in the same way whatever be our choice of coordinates. So we are sure beforehand of the general covariancy of the equations that was postulated by Einstein.


§ 4. Einstein ascribes to a line-element PQ in the field-figure a length ds defined by the equation

\begin{array}{c}
ds^{2}=\sum(ab)g_{ab}dx_{a}dx_{b}\\
\\
\left(g_{ab}=g_{ba}\right)
\end{array} (1)

Here dx_{1},\dots dx_{4} are the changes of the coordinates when we pass from P to Q, while the coefficients g_{ab} depend in one way or another on the coordinates. The gravitation field is known when these 10 quantities are given as functions of x_{1},\dots x_{4} . Here it must be remarked that in all real cases the coordinates can be chosen in such a way that for one point arbitrarily chosen (1) becomes

ds^{2}=-dx_{1}^{2}-dx_{2}^{2}-dx_{3}^{2}+dx_{4}^{2}

This requires that the determinant g of the coefficients of (1) be always negative. The minor of this determinant corresponding to the coefficient g_{ab} will be denoted by G_{ab} .

Around each point P of the field-figure as a centre we may now construct an infinitesimal surface[7], which, when P is chosen as origin of coordinates, is determined by the equation

\sum(ab)g_{ab}x_{a}x_{b}=\epsilon^{2} (2)

where \epsilon is an infinitely small positive constant which we shall fix once for all. This surface, which we shall call the indicatrix, is a hyperboloid with one real axis and three imaginary ones. We shall also introduce the surface determined by the equation

\sum(ab)g_{ab}x_{a}x_{b}=-\epsilon^{2} (3)

which differs from (2) only by the sign of \epsilon^{2} . We shall call this the conjugate indicatrix. It is to be understood that the indicatrices and conjugate indicatrices take part in the changes to which the field-figure may be subjected. As these surfaces are infinitely small, they always remain hyperboloids of the said kind. The gravitation field will now be determined by these indicatrices, which we can imagine to have been constructed in the field-figure without the introduction of coordinates. When we have occasion to use these latter, we shall so choose them that the "axes" x_{1},x_{2},x_{3} intersect the conjugate indicatrix constructed around their starting point, while the indicatrix itself is intersected by the axis x_{4} . This involves that the coefficients g_{11},g_{22},g_{33} are negative and that g_{44} is positive.


§ 5. The indicatrices will give us the units in which we shall express the length of lines in the field-figure and the magnitude of two-, three or four-dimensional extensions. When we use these units we shall say that the quantities in question are expressed in natural measure.

In the case of a line-element PQ the unit might simply be the radius-vector in the direction PQ of the indicatrix or the conjugate indicatrix described about P. It is however desirable to distinguish the two cases that PQ intersects the indicatrix itself or the conjugate indicatrix. In the latter case we shall ascribe an imaginary length to the line-element[8]. Besides, by taking as unit not the radius-vector itself but a length proportional to it, the numerical value of a line-element may be made to be independent of the choice of the quantity \epsilon .

These considerations lead us to define the length that will be ascribed to line-elements by the assumption that each radius-vector of the indicatrix has in natural measure the length \epsilon , while each radius-vector of the conjugate indicatrix has the length i\epsilon .[9]

It will now be clear that the length of an arbitrary line in the field-figure can be found by integration, each of its elements being measured by means of the indicatrix or the conjugate indicatrix belonging to the position of the element. In virtue of our definitions a deformation of the field-figure will not change the length of lines expressed in natural measure and a geodetic line will remain a geodetic line.


§ 6. We are now in a position to indicate the first part H_{1} of the principal function (§ 1). Let \sigma be a closed surface in the field-figure and let us confine ourselves to the principal function so far as it belongs to the space \Omega enclosed by that surface. Then the quantity H_{1} is the sum, taken with the negative sign, of the lengths of all world-lines of material points so far as they lie within \Omega , each length multiplied by a constant m, characteristic of the point in question and to be called its mass.[10]

It must be remarked that the elements of the world-lines of material points intersect the corresponding indicatrices themselves. The lengths of these lines are therefore real positive quantities.

A deformation of the field-figure leaves H_{1} unchanged.


§ 7. We shall now pass on to the part of the principal function belonging to the gravitation field. The mathematical expression for this part was communicated to me by Einstein in our correspondence. It is also to be found in Hilbert's paper in which it is remarked that the quantity in question may be regarded as the measure of the curvature of the four-dimensional extension to which (1) relates. Here we have to speak only of the interpretation of this quantity. To find this the following geometrical considerations may be used.

Let PQ and PR be two line-elements starting from a point P of the field-figure, QR the line-element joining the extremities Q and R. If then the lengths of these elements in natural measure are

PQ=ds',\ PR=ds'',\ QR=ds

we define the angle (s',s'') between PQ and PR by the well known trigonometric formula

\begin{array}{c}
ds^{2}=ds'^{2}+ds''^{2}-2ds'ds''\cos(s',s'')\\
\\
\cos(s',s'')=\frac{ds'^{2}+ds''^{2}-ds^{2}}{2ds'ds''}
\end{array} (4)

from which one can derive

\cos(s',s'')=\sum(ab)g_{ab}\frac{dx'_{a}}{ds'}\frac{dx''_{b}}{ds''} (5)

By means of this formula we are able to determine the angle between any two intersecting lines. Of course the two other angles of the triangle PQR can be calculated in the same way.

Now two cases must be distinguished.

a. The plane of the triangle PQR cuts the conjugate indicatrix, but not the indicatrix itself. Then the three sides have positive imaginary values. Moreover each of them proves to be smaller than the sum of the others, from which one finds that the angles have real values and that their sum is \pi .

b. The plane PQR cuts both the indicatrix and the conjugate indicatrix. In this case different positions of the triangle are still possible. We can however confine ourselves to triangles the three sides of which are real. These are really possible, for in the plane of a hyperbola we can draw triangles the sides of which are parallel to radius-vectors drawn from the centre to points of the curve (and not of the conjugate hyperbola).

By a closer consideration of the triangles now in question it is found however that by the choice of our "natural" units one side is necessarily longer than the sum of the other two. Formula (4) then shows that the cosines of the angles are real quantities, greater than 1 in absolute value, two of them being positive, and the third negative. We must therefore ascribe to the angles imaginary or complex values. If for p>+1 we put

\arccos p=i\log\left(p+\sqrt{p^{2}-1}\right)

and


\arccos(-p)=\pi-\arccos p

we find for the three angles expressions of the form

i\alpha,\ i\beta and \pi-i(\alpha+\beta)

so that the sum is again \pi .

From the cosine calculated by (4) or (5) the sine can be derived by means of the formula

\sin\varphi=\sqrt{1-\cos^{2}\varphi}

where for the case \cos^{2}\varphi>1 we can confine ourselves to the value

\sin\varphi=i\sqrt{\cos^{2}\varphi-1}

with the positive sign.

It deserves special notice that two conjugate radius-vectors of the indicatrix and the conjugate indicatrix are perpendicular to each other and that a deformation of the field-figure does not change the angle between two intersecting lines determined according to our definitions.


§ 8. Before proceeding further we must now indicate the natural units (§ 5) for two-, three-, or four-dimensional extensions in the field-figure. Like the unit of length, these are defined for each point separately, so that the numerical value of a finite extension is found by dividing it into infinitely small parts.

A two-dimensional extension cuts the conjugate indicatrix in an ellipse, or the indicatrix itself and the conjugate indicatrix in two conjugate hyperbolae. In both cases we derive our unit from the area of a parallelogram described on conjugate radius-vectors.

A three-dimensional extension cuts the conjugate indicatrix in an ellipsoid, or the indicatrix and its conjugate in two conjugate hyperboloids. Now our unit will be derived from the volume of a parallelepiped described on three conjugate radius-vectors.

In a similar way the magnitude of four-dimensional extensions will be determined by comparison with a parallelepiped the edges of which are four conjugate radius-vectors of the indicatrix and the conjugate indicatrix.

It must here be kept in mind that, according to well known theorems, the area of the parallelogram and the volume of the parallelepipeds in question are independent of the special choice of the conjugate radius-vectors.

We shall further specify the units in such a way (comp. § 5) that the numerical magnitude of a parallelogram or a parallelepiped described on conjugate radius-vectors is found by multiplying the numbers by which the edges are expressed in natural measure.

From what has been said it follows that the area of the parallelogram described on two line-elements is given by the product of the lengths of these elements and the sine of the enclosed angle. Similarly the area of an infinitely small triangle is determined by half the product of two sides and the sine of the angle between them.

We need hardly add that the numerical value of any two-, three- or four-dimensional domain expressed in natural measure is not changed by a deformation of the field-figure.


§ 9. Let, at any point P of the field-figure, 1, 2, 3, 4 be four arbitrarily chosen conjugate radius-vectors of the indicatrix. Two of these determine an infinitely small part V of a two-dimensional extension. We may prolong this part to finite distances from P by drawing from this point geodetic lines whose initial directions lie in the plane V. In this way we obtain six two-dimensional extensions (1,2), (2,3), (3,1), (1,4), (2,4) and (3,4). Let us now consider in one of these e. g. (a, b) an infinitesimal triangle near the point P, the sides of which are geodetic lines (viz. geodetic lines in (a, b)). If in calculating the angles of this triangle we go to quantities of the second order with respect to the sides and to the distances from P, the sum s of the angles proves to have no longer the value \pi (comp. § 7). The "excess" e=s-\pi is proportional to the area \Delta of the triangle, independently of the length of the sides, of their ratios and of the position of the triangle in the extension (a, b). For the three extensions (1,2) (2,3), (3,1), which do not intersect the indicatrix itself but the conjugate indicatrix, this proposition follows from a well-known theorem of Gauss in the theory of curvature of surfaces; for the other three (1,4), (2,4), (3,4), which cut the indicatrix itself, the proof can be given by direct calculation. The considerations necessary for this, and some other calculations with which we shall be concerned further on will be communicated in a later paper.

In considering the three last-mentioned extensions I have confined myself to triangles with real sides (§ 7, b).

The quotient

\frac{e}{\Delta}=K_{ab}

is now for each extension a definite number, which we may consider as a measure of the curvature of the two-dimensional extension (a, b); the sum K of the six numbers K_{ab} may be called the curvature of the field-figure at the point P in question. This quantity is the same that has been introduced by Hilbert; this results from the calculation of its value, which at the same time shows K to be independent of the special choice of the directions 1, 2, 3, 4 introduced in the beginning of this §.

The numbers K_{ab} all real and have a meaning that can be indicated without the introduction of coordinates; moreover their sum K is not changed by a deformation of the field-figure.

If now d\Omega is an element of the four-dimensional extension of the field-figure, expressed in natural measure, the part of the principal function belonging to the gravitation field is

H_{3}=\frac{i}{\varkappa}\int Kd\Omega (6)

where the integration is extended to the domain considered (§ 6) while \varkappa is the gravitation constant. H_{3} too is not changed by a deformation of the field-figure.

The factor i has been introduced in order to obtain a real value for H_3, the element d\Omega being represented in natural measure by a negative imaginary number (§ 8).


§ 10. What we have to say of the electromagnetic field must be preceded by some considerations belonging to what may be called the "vector theory" of the field-figure.

A line-element PQ, taken in a definite, direction (indicated by the order of the letters), may be called a vector. Such vectors can be compounded or decomposed by means of parallelograms or parallelepipeds. Especially, when coordinates x_{1},\dots x_{4} have been chosen, a vector may be resolved into four components which have the directions of the coordinates, viz. such directions that a shift along the first e.g. changes x_{1} , while x_{2},x_{3},x_{4} remain constant. The four components in question are determined by the differentials dx_{1},\dots dx_{4} corresponding to PQ. We shall say that by these they are expressed in "x-measure". Their values in natural measure are found by multiplying dx_{1},\dots dx_{4} by certain factors. If we keep in mind that the radius-vectors of the e conjugate indicatrix and the indicatrix in the directions of the axes are expressed in "x measure" by

\frac{\epsilon}{\sqrt{-g_{11}}},\ \frac{\epsilon}{\sqrt{-g_{22}}},\ \frac{\epsilon}{\sqrt{-g_{33}}},\ \frac{\epsilon}{\sqrt{g_{44}}},

and in natural units by

i\epsilon,\ i\epsilon,\ i\epsilon,\ \epsilon

we find for the reducing factors

l_{1}=i\sqrt{-g_{11}},\ l_{2}=i\sqrt{-g_{22}},\ l_{3}=i\sqrt{-g_{33}},\ l_{4}=i\sqrt{g_{44}}. (7)

In the language of vector-analysis the vector obtained by the composition of two or more vectors is also called the sum of these vectors.

We shall also speak of finite vectors, i.e. of directed quantities which can be represented on an infinitely reduced scale by line-elements in the field-figure. If \omega is the constant "reduction factor" chosen for this purpose, a vector \mathrm{A} will be represented by a line-element \omega\mathrm{A} , the direction of which is also ascribed to \omega\mathrm{A} . It will now be evident that two finite vectors, as well as two infinitely small ones, determine an infinitesimal two dimensional extension and that finite vectors can be compounded and resolved by means of parallelograms and parallelepipeds. Also that we may speak of the "magnitude" of such figures, that e.g. the rule given in § 8 applies to the parallelogram described on two vectors.

The components of a vector in the directions of the coordinates expressed in x-measure will be called X_{1},X_{2},X_{3},X_{4} . This means that \omega X_{1},\dots\omega X_{4} are equal to the differentials dx_{1},\dots dx_{4} corresponding to the infinitely small vector \omega\mathrm{A} .

If we want to know the components of \mathrm{A} in natural units we must multiply X_{1},\dots X_{4} by the factors (7).


§ 11. Two vectors \mathrm{A} and \mathrm{B} starting from a point P of the field-figure and lying in a plane V, determine what we shall call a rotation \mathrm{R} in that plane. We ascribe to it the direction indicated by the order \mathrm{AB} and a value given by the parallelogram described on \mathrm{A} and \mathrm{B} and expressed in natural measure[11]. This involves that the same rotation may be represented in many different ways by two vectors in the plane V.

For the rotation \mathrm{R} we shall also use the symbol [\mathrm{A\cdot B]} .

By the vector product [\mathrm{A\cdot B\cdot C]} of three vectors \mathrm{A,B,C} at a point of the field-figure and not lying in one plane we shall understand a vector \mathrm{D} the direction of which is conjugate with each of the three vectors (and therefore with the three-dimensional extension \mathrm{A,B,C} ), the direction of \mathrm{D} corresponding to those of \mathrm{A,B} and \mathrm{C} in a way presently to be indicated, while the magnitude of \mathrm{D}, expressed in natural measure, is equal to that of the parallelepiped described on \mathrm{A}, \mathrm{B} and \mathrm{C} and expressed in the same measure. This definition involves that the value is ascribed to the vector product of three vectors lying in one and the same plane.

A further statement about the direction of \mathrm{D} is necessary because two opposite directions are conjugate with \mathrm{A,B,C} . For one set of three directions \mathrm{A_{0},B_{0},C_{0}} we shall choose arbitrarily which of its two conjugate directions will be said to correspond to it. If this is the direction \mathrm{D}_{0} , then the direction \mathrm{D} corresponding to \mathrm{A,B,C} will be determined by the rule that \mathrm{D}_{0} , passes into \mathrm{D} by a gradual passage of the first three vectors from \mathrm{A_{0},B_{0},C_{0}} into \mathrm{A,B,C} , this latter passage being effected in such a way that during the change the vectors never come to lie in one plane.

The vector product [\mathrm{A\cdot B\cdot C]} takes the opposite direction when one of the vectors is reversed as well as when two of them are interchanged. We must therefore always attend to the order of the symbols in [\mathrm{A\cdot B\cdot C]} .

The vector product possesses the distributive property with respect to each of the three vectors, so that e.g. if \mathrm{A}_{1} and \mathrm{A}_{2} are vectors,

\left[\left(\mathrm{A}_{1}+\mathrm{A}_{2}\right)\cdot\mathrm{B\cdot C}\right]=\mathrm{\left[A_{1}\cdot B\cdot C\right]+\left[A_{2}\cdot B\cdot C\right]}

From this we can infer that [\mathrm{A\cdot B\cdot C]} depends only on \mathrm{C} and the rotation \mathrm{R} determined by \mathrm{A} and \mathrm{B}. For this reason we write for the vector product also [\mathrm{R\cdot C]} ; in calculating it we are free to replace the rotation \mathrm{R} by any two vectors by means of which it can be represented.

If \mathrm{R}, \mathrm{R}_{1} and \mathrm{R}_{2} are rotations in the same plane, such that the value and direction of \mathrm{R} are found by adding \mathrm{R}_{1} and \mathrm{R}_{2} algebraically, we have, in virtue of the distributive property

[\mathrm{R_{1}\cdot C]}+[\mathrm{R_{2}\cdot C]}=[\mathrm{R\cdot C]}

§ 12. In what precedes we were concerned with the volumes of parallelepipeds expressed in natural units. When we have introduced coordinates x_{1},\dots x_{4} we may also express these volumes in the "x-units" corresponding to the coordinates chosen.

Let us consider e.g. the three-dimensional extension x_{4}=const. , which cuts the conjugate indicatrix in the ellipsoid

g_{11}x_{1}^{2}+g_{22}x_{2}^{2}+g_{33}x_{3}^{2}+2g_{12}x_{1}x_{2}+2g_{23}x_{2}x_{3}+2g_{31}x_{3}x_{1}=-\epsilon^{2}

If we agree that in x-measure spaces in this extension will be represented by positive numbers and that a parallelepiped with the positive edges dx_{1},dx_{2},dx_{3} will have the volume dx_{1}\ dx_{2}\ dx_{3} , we find for that of the parallelepiped on three conjugate radius-vectors

\frac{\epsilon^{3}}{\sqrt{-G_{44}}}

where it has been taken into consideration that G_{44} is negative.

The volume of the same parallelepiped being expressed in natural measure by — -i\epsilon^{3} (§ 8), we have to multiply by

l_{123}=-i\sqrt{-G_{44}}\, (8)

if we want to pass from the expression in x-measure to that in natural measure.

For the extension \left(x_{2},x_{3},x_{4}\right) , i.e. x_{1}=0 the corresponding factor is

l_{234}=-\sqrt{G_{11}} (9)


§ 13. In the theory of electromagnetic phenomena we are concerned in the first place with the electric charge and the convection current. So far as these quantities belong to a definite element d\Omega of the field-figure they may be combined into

\mathrm{q}d\Omega

where \mathrm{q} is a vector which we may call the current vector. When it is resolved into four components having the directions of the axes, the first three components determine the convection current, while the fourth component gives the density of the electric charge.

As to the electric and the magnetic force, these two taken together can be represented at each point of the field-figure by two rotations

\mathrm{R}_{e} and \mathrm{R}_{h}

in definite, mutually conjugate two-dimensional extensions. These quantities are closely connected with the current vector, for after having introduced coordinates x_{1},\dots x_{4} we have for each closed surface \sigma the vector equation

\int\left\{ \left[\mathrm{R}_{e}\cdot\mathrm{N}\right]+\left[\mathrm{R}_{h}\cdot\mathrm{N}\right]\right\} _{x}d\sigma=i\int\{\mathrm{q}\}_{x}d\Omega (10)

where the second integral has to be taken over the domain \Omega enclosed by \sigma . On the left hand side d\sigma represents a three-dimensional surface-element expressed in natural units and \mathrm{N} a vector of the magnitude 1 in natural measure conjugate with or perpendicular to that element (§ 7) and directed towards the outside of the domain \Omega . The index x shows that the vector \left[\mathrm{R}_{e}\cdot\mathrm{N}\right]+\left[\mathrm{R}_{h}\cdot\mathrm{N}\right] must be expressed in x-measure. At each point of the surface we must resolve the vector along the four directions of the coordinates, express each component in x-measure (§10) and finally, after multiplication by d\sigma , we must add algebraically all x_{1} -components; similarly all x_{2} -components and so on.

It must be expressly remarked that if an equation like (10) in which we are concerned with the composition of vectors at different points of the field-figure, shall have a definite meaning we must know which components are to be considered as having the same direction, so that they can be added. This has been determined by the introduction of coordinates.

On the right hand side of the equation the index x means that the vector \mathrm{q} must be expressed in x-measure and the factor i had to be introduced because d\Omega is imaginary.

One can prove that equation (10) is equivalent to the differential equations which in Einstein's theory serve for the same purpose and further that when the equation holds for one choice of coordinates it will also be true for any other choice.


§ 14. The proof for these assertions must be deferred to the second part of this communication. For the present we shall only add that the part of the principal function referring to the electromagnetic field is given by

H_{2}=i\int\frac{1}{2}\left(\mathrm{R}_{e}^{2}+\mathrm{R}_{h}^{2}\right)d\Omega

where \mathrm{R}_{e} and \mathrm{R}_{h} are, expressed in natural units, the two rotations that are characteristic of the field. Like the two other parts of the principal function, H_{2} is not changed by a deformation of the field-figure. In this statement it is to be understood that the parallelograms by which \mathrm{R}_{e} and \mathrm{R}_{h} are represented take part in the deformation.

Some remarks on the way in which, starting from the principal function, we may obtain the fundamental equations of the theory must also be deferred. I shall conclude now by remarking that, as an immediate consequence of Hamilton's principle, the world-line of a material point which is acted on only by a given gravitation field, will be a geodetic line, and that the equations which determine the gravitation field caused by material and electromagnetic systems will be found by the consideration of infinitely small variations of the indicatrices, by which the numerical values of all quantities that are measured by means of these surfaces will be changed.

II.

(Communicated in the meeting of March 25, 1916).


§ 15. In the first part of this communication the connexion between the electric and the magnetic force on one hand and the charge and the convection current on the other was expressed by the equation

\int\left\{ \left[\mathrm{R}_{e}\cdot\mathrm{N}\right]+\left[\mathrm{R}_{h}\cdot\mathrm{N}\right]\right\} _{x}d\sigma=i\int\{\mathrm{q}\}_{x}d\Omega (10)

which has been discussed in § 13. It will now be shown that this formula is equivalent to the differential equations by which the connexion in question is expressed in the theory of Einstein. For this purpose some further geometrical considerations must first be developed. They refer to the special case that the quantities g_{ab} , have the same values at every point of the field-figure.

If this condition is fulfilled, considerations which generally may be applied to infinitesimal extensions only are valid for finite extensions too.


§ 16. The factor required, in the measurement of four-dimensional domains, for the passage from x-units to natural units has now the same value at every point of the field-figure. Similarly, when any one-, two- or three-dimensional extension in the field-figure that is determined by linear equations ("linear extensions") is considered, the factor by means of which the said passage may be effected for parts of that extension, will be the same for all those parts. Moreover the factor in question will be the same for two "parallel" extensions of this kind, i.e. for two extensions the determining equations of which can be written in such a way that the coefficients of x_{1},\dots x_{4} are the same in them.

It is obvious that linear one-dimensional extensions can be called "straight lines", also it will be clear what is to be understood by a "prism" (or "cylinder"). This latter is bounded by two mutually parallel linear three-dimensional extensions \sigma_{1} and \sigma_{s} and by a lateral surface which may be extended indefinitely to both sides and in which mutually parallel straight lines ("generating lines") can be drawn.

We need not dwell upon the elementary properties of the prism.


§ 17. A vector may now be represented by a straight line of finite length; the quantities X_{1},\dots X_{4}, which have been introduced in § 10, are the changes of the coordinates caused by a displacement along that line. The magnitude of the vector, expressed in natural units, will be denoted by S. It is given by a formula similar to (1), viz. by

S^{2}=\sum(ab)g_{ab}X_{a}X_{b} (11)

A vector may be regarded as being the same everywhere in the field-figure, if X_{1},\dots X_{4} have constant values. In the same way a rotation \mathrm{R} (§ 11) may be said to be the same everywhere, if it can be represented by two vectors of this kind.

If from a point P two vectors PQ and PR issue, denoted by X'_{1},\dots X'_{4}, S' and X''_{1},\dots X''_{4}, S'' resp., the angle between them (comp. (5)) is defined by

S'S''\cos(S',S'')=\sum(ab)g_{ab}X'_{a}X''_{b} (12)

We remark here that X'_{a},\ X''_{b} are real, positive or negative quantities and that S' and S'' are expressed in the way indicated in § 5 ("absolute" values). It is to be understood that S does not change when the signs of X_{1},\dots X_{4} are reversed at the same time.

If S''' is the value of the vector RQ and if the angle between this vector and RP is denoted by (S'', S'''), it follows further from (11) and (12) that

S''=S'\cos(S',S'')+S'''\cos(S'',S''')

In the special case of a right angle R we have

S''=S'\cos(S',S'')

an equation expressing the connexion between a vector PQ and its "projection" on a line PR. The angle (S', S'') is the angle between the vector and its projection, both reckoned from the same point P.


§ 18. Let us now return to the prism R mentioned in § 16. From a point A_{2} of the boundary of the "upper face"\sigma_{2}, we can draw a line perpendicular to \sigma_{2} and \sigma_{1}. Let B_1 be the point, where it cuts thus last, plane, the "base", and A_{1} the point where this plane is encountered by the generating line through A_{2}. If then \angle A_{1}A_{2}B_{1}=\vartheta, we have

\overline{A_{2}B_{1}}=\overline{A_{2}A_{1}}\cos\vartheta (13)

The strokes over the letters indicate the absolute values of the distances A_{2}B_{1} and A_{2}A_{1}.

It can be shown (§ 8) that, all quantities being expressed in natural units, the "volume" of the prism P is found by taking the product of the numerical values of the base \sigma_{1} and the "height" A_{2}B_{1}.

Let now linear three-dimensional extensions perpendicular to A_{1}A_{2} be made to pass through A_1 and A_2. From these extensions the lateral boundary of the prism cuts the parts \sigma'_{1} and \sigma'_{2} and these parts, together with the lateral surface, enclose a new prism P', the volume of which is equal to that of P. As now the volume of P' is given by the product of \overline{A_{2}A_{1}} and \sigma'_{1}, we have with regard to (13)

\sigma'_{1}=\sigma{}_{1}\cos\vartheta

If now we remember that, if a vector perpendicular to \sigma_{1} is projected on the generating line, the ratio between the projection and the vector itself (viz. between their absolute values) is given by \cos\vartheta and that a connexion similar to that which was found above between a normal section \sigma'_{1} of the prism and \sigma_{1}, also exists between \sigma'_{1} and any other oblique section, we easily find the following theorem:

Let \sigma and \bar{\sigma} be two arbitrarily chosen linear three-dimensional sections of the prism, \mathrm{N} and \bar{\mathrm{N}} two vectors, perpendicular to \sigma and \bar{\sigma} resp. and of the same length, S and \bar{S} the absolute values of the projections of \mathrm{N} and \bar{\mathrm{N}} on a generating line. Then we have

S\sigma=\bar{S}\bar{\sigma} (14)


§ 19. After these preliminaries we can show that the left hand side of (10) is equal to 0, if the numbers g_{ab} are constants and if moreover both the rotation \mathrm{R}_{e} and the rotation \mathrm{R}_{h} are everywhere the same. For the two parts of the integral the proof may be given in the same way, so that it suffices to consider the expression

\int\left[\mathrm{R}_{e}\cdot\mathrm{N}\right]_{x}d\sigma (15)
Let X_{1},\dots X_{4} be the components of the vector \mathrm{N}, expressed in x-units. From the distributive property of the vector product it then follows that each of the four components of

\left[\mathrm{R}_{e}\cdot\mathrm{N}\right]_{x}

is a homogeneous linear function of X_{1},\dots X_{4}. Under the special assumptions specified at the beginning of this § these are every where, the same functions. Let us thus consider a definite component of (15) e.g. that which corresponds to the direction of the coordinate x_{a}. We can represent it by an expression of the form

\int\left(\alpha_{1}X_{1}+\dots+\alpha_{4}X_{4}\right)d\sigma

where \alpha_{1},\dots\alpha_{4} are constants. It will therefore be sufficient to prove that the four integrals

\int X_{1}d\sigma\dots\int X_{4}d\sigma (16)

vanish.

In order to calculate \int X_{1}d\sigma we consider an infinitely small prism, the edges of which have the direction x_1. This prism cuts from the boundary surface \sigma two elements d\sigma and \overline{d\sigma}. Proceeding along a generating line in the direction of the positive x_{1} we shall enter the extension \Omega bounded by \sigma through one of these elements and leave it through the other. Now the vectors perpendicular to \sigma, which occur in (15) and which we shall denote by \mathrm{N} and \bar{\mathrm{N}} for the two elements, have the same value.[12] If, therefore, S and \bar{S} are the absolute values of the projections of \mathrm{N} and \bar{\mathrm{N}} on a line in the direction x_1, we have according to (14)

Sd\sigma=\bar{S}\overline{d\sigma} (17)

Let first the four directions of coordinates be perpendicular to one another. Then the components of the vector obtained by projecting \mathrm{N} on the above mentioned line are X_{1},0,0,0 and similarly those of the projection of \bar{\mathrm{N}}:\bar{X}_{1},0,0,0. But as, proceeding in the direction of x_1 we enter \Omega through one element and leave it through the other, while \mathrm{N} and \bar{\mathrm{N}} are both directed outward, X_{1} and \overline{X_{1}}, must have opposite signs. So we have

S:\bar{S}=X_{1}:-\bar{X}_{1}

and because of (17) we may now conclude that the elements X_{1}d\sigma and \overline{X_{1}}\overline{d\sigma} in the first of the integrals (16) annul each other. It will be clear now that the whole integral vanishes and that similar considerations may be applied to the other three.

So we have proved that under the special assumptions made the left hand side of (10) will vanish in the special case that the directions of the coordinates are perpendicular to each other. This conclusion likewise holds for an other set of coordinates if only the assumption made at the beginning of this § is fulfilled. This is obvious, as we can pass from mutually perpendicular coordinates x_{1},\dots x_{4} to arbitrarily chosen other ones x'_{1},\dots x'_{4} which fulfil this latter condition by linear transformation formulae with constant coefficients. The x- and the x'-components of the vector

\left[\mathrm{R}_{e}\cdot\mathrm{N}\right]+\left[\mathrm{R}_{h}\cdot\mathrm{N}\right]

are then connected by homogeneous linear formulae with coefficients which have the same value at all points of the surface \sigma. Hence if, as has been shown above, the four x-components of the vector

\int\left\{ \left[\mathrm{R}_{e}\cdot\mathrm{N}\right]+\left[\mathrm{R}_{h}\cdot\mathrm{N}\right]\right\} d\sigma

vanish, the four x'-components are now seen to do so likewise.[13]


§ 20. The above considerations were intended to prepare a corollary which will be of use in the treatment of the integral on the left hand side of (10), if we now leave the special assumptions made above and suppose the quantities g_{ab} to be functions of the coordinates while also the rotations \mathrm{R}_{e} and \mathrm{R}_{h} may change from point to point.

This corollary may be formulated as follows: If all dimensions of the limiting surface \sigma are infinitely small of the first order, the integral

\int\left\{ \left[\mathrm{R}_{e}\cdot\mathrm{N}\right]+\left[\mathrm{R}_{h}\cdot\mathrm{N}\right]\right\} _{x}d\sigma

will be of the fourth order.

In order to make this clear let us suppose that in the calculation of the integral we confine ourselves to quantities of the third order. The surface \sigma being already of that order we may then omit all infinitesimal values in the quantities by which d\sigma is multiplied; we may therefore neglect the infinitesimal changes of the quantities g_{ab} over the extension considered, and also those of \mathrm{R}_{e} and \mathrm{R}_{h}. By this we just come to the case considered in § 19. Thus it is evident, that as regards quantities of the third order the first part of (10) is 0. From this it follows that in reality it is at least of the fourth order.


§ 21. Let us now return to the general case that the extension \Omega to which equation (10) refers, has finite dimensions. If by a surface \bar{\sigma} this extension is divided into two extensions \Omega_{1} and \Omega_{2}, the quantities on the two sides in (10) each consist of two parts referring to these extensions. For the right hand side this is immediately clear and as to the quantity on the left hand side, it follows from the consideration that the contributions of a to the integrals over the boundaries of \Omega_{1} and \Omega_{2} are equal with opposite signs. In the two cases namely we must take for \mathrm{N} equal but opposite vectors.

Also, if the extension \Omega is divided into an arbitrary number of parts, each term in (10) will be the sum of a number of integrals, each relating to one of these parts.

By surfaces with the equations x_{1}=\mathrm{const.},\dots x_{4}=\mathrm{const}. we can divide the extension \Omega into elements which we shall denote by \left(dx_{1},\dots dx_{4}\right). As a rule there will be left near the surface \sigma certain infinitely small extensions of a different form. From the preceding § it is evident that, in the calculation of the integrals, these latter extensions may be neglected and that only the extensions \left(dx_{1},\dots dx_{4}\right) have to be considered. From this we can conclude that equation (10) is valid for any finite extension, as soon at it holds for each of the elements \left(dx_{1},\dots dx_{4}\right).


§ 22. We shall now show what equation (10) becomes for one element \left(dx_{1},\dots dx_{4}\right). Besides the infinitesimal quantities x_{1},\dots x_{4}, occurring in the equation

F=\sum(ab)g_{ab}x_{a}x_{b}=\epsilon^{2}

of the indicatrix we introduce four other quantities \xi_{1},\dots\xi_{4}, which we define by

\xi_{a}=\frac{1}{2}\frac{\partial F}{\partial x_{a}} (18)

or

\left.\begin{array}{c}
\xi_{1}=g_{11}x_{1}+g_{12}x_{2}+\dots+g_{14}x_{4}\\
\cdots\cdots\cdots\cdots\cdots\cdots\\
\cdots\cdots\cdots\cdots\cdots\cdots\\
\xi_{4}=g_{41}x_{1}+g_{42}x_{2}+\dots+g_{44}x_{4}
\end{array}\right\} (19)

with the equalities g_{ba}=g_{ab}.

To each of these quantities corresponds a definite direction, viz. that in which we have to proceed in order to make the considered quantity change in positive sense while the other three remain constant. If we denote these directions by 1^{*},2^{*},3^{*},4^{*} and in the same way the directions of the coordinates x_{1},x_{2},x_{3}x_{4} by 1, 2, 3, 4, it is evident that 1^{*} is conjugate with 2, 3 and 4, 2^{*} with 3, 1 and 4, and so on; inversely 1 with 2^{*},3^{*},4^{*}; 2 with 3^{*},1^{*},4^{*}, and so on. From what has been said above about the algebraic signs of g_{11},g_{22},g_{33},g_{44} it follows further that, if directions opposite to 1, 1^{*} etc. are denoted by — 1, -1^{*} etc., the directions — 1 and 1^{*} will point to the same side of an extension x_{1}=\mathrm{const}.. The same may be said of the directions —2 and2^{*} or —3 and 3^{*} with respect to extensions x_{2}=\mathrm{const}., or x_{3}=\mathrm{const}., while with respect to an extension x_{4}=\mathrm{const}., the directions 4 and 4^{*} point to the same side.

Finally, we shall fix (§11) as far as is necessary, which direction corresponds to three others. For that purpose we shall imagine the directions of coordinates 1,\dots 4 to pass into mutually conjugate directions, which will also be called <math>1,\dots 4</math>, by gradual changes, in such a way that never three of them come to lie in one plane. We shall agree that after this change —4 corresponds to 1, 2, 3.

Let a,b,c,d be the numbers 1, 2, 3, 4 in an order obtained from the natural one by an even number of permutations. Then the rule of § 11 teaches us that the direction -d corresponds to a,b,c. It is clear that this would be the ease with d, if a,b,c,d were obtained from 1, 2, 3, 4 by an odd number of permutations. If further it is kept in mind that, always in the new case, the directions 1^{*},2^{*},3^{*},4^{*} coincide with —1, —2, —3, 4, we come to the conclusion that the directions 1, 2, 3 and 4 correspond to the sets 2^{*},3^{*},4^{*};3^{*},1^{*},4^{*};1^{*},2^{*},4^{*} and 1^{*},2^{*},3^{*} respectively. The rule of gradual change (§11) involves that this holds also for the original case, in which 1, 2, 3, 4 were not yet mutually conjugate.

This is all that has to be said about the relations between the different directions. It must only be kept in mind, that whenever two of the first three directions are interchanged, the fourth must be reversed.


§ 23. In the neighbourhood of a point P of the field-figure we may introduce as coordinates instead of x_{1},\dots x_{4} the quantities \xi_{1},\dots\xi_{4} defined by (19). Line-elements or finite vectors can be resolved in the directions of these coordinates, i.e. in the directions 1^{*},2^{*},3^{*},4^{*}. Their components and the magnitudes of different extensions can now be expressed in \xi-nits in the same way as formerly in x-units. So the volume of a three-dimensional parallelepiped with the positive edges d\xi_{1},d\xi_{2},d\xi_{3} is represented by the product d\xi_{1}d\xi_{2}d\xi_{3}.

Solving x_{1},\dots x_{4} from (19) we obtain expressions of the form

\left.\begin{array}{c}
x_{1}=\gamma_{11}\xi_{1}+\gamma_{21}\xi_{2}+\dots+\gamma_{41}\xi_{4}\\
\cdots\cdots\cdots\cdots\cdots\cdots\\
\cdots\cdots\cdots\cdots\cdots\cdots\\
x_{4}=\gamma_{14}\xi_{1}+\gamma_{24}\xi_{2}+\dots+\gamma_{44}\xi_{4}\\
\gamma_{ba}=\gamma_{ab}
\end{array}\right\} (20)

If we use the coordinates \xi the coefficients \gamma_{ab} play the same part as the coefficients g_{ab} when the coordinates x are used. According to (18) and (20) we have namely

F=\sum(a)\xi_{a}x_{a}=\sum(ab)\gamma_{ab}\xi_{a}\xi_{b}

so that the equation of the indicatrix may be written

\sum(ab)\gamma_{ba}\xi_{a}\xi_{b}=\epsilon^{2}


§ 24. Let the rotations \mathrm{R}_{e} and \mathrm{R}_{h} of which we spoke in § 13 be defined by the vectors \mathrm{A^{I},A^{II}} and \mathrm{A^{III},A^{IV}} respectively, the resultants of the vectors \mathrm{A_{1^{*}}^{I},\dots A_{4^{*}}^{I}}, etc. in the directions 1^{*},\dots4^{*}. Then, according to the properties of the vector product that were discussed in § 11,

\begin{array}{ll}
\left[\mathrm{R}_{e}\cdot\mathrm{N}\right] & =\left[\mathrm{\left(A_{1^{*}}^{I}+\dots+A_{4^{*}}^{I}\right)\cdot\left(A_{1^{*}}^{II}+\dots+A_{4^{*}}^{II}\right)\cdot N}\right]\\
 & =\sum(\overline{ab})\left\{ \left[\mathrm{A}_{a^{*}}^{I},\ \mathrm{A}_{b^{*}}^{II}\cdot\mathrm{N}\right]-\left[\mathrm{A}_{a^{*}}^{II},\ \mathrm{A}_{b^{*}}^{I}\cdot\mathrm{N}\right]\right\} 
\end{array}

where the stroke over ab indicates that each combination of two different numbers a, b contributes one term to the sum. For the vector product \left[\mathrm{R}_{h}\cdot\mathrm{N}\right] we have a similar equation. Now two or more rotations in one and the same plane, e.g. in the plane a^{*}b^{*}, may be replaced by one rotation, which can be represented by means of two vectors with arbitrarily chosen directions in that plane, e.g. the directions a^{*} and b^{*}. We may therefore introduce two vectors \mathrm{B}_{a^{*}} and \mathrm{B}_{b^{*}} directed along a^{*} and b^{*} resp., so that

\left[\mathrm{B}_{a^{*}}\cdot\mathrm{B}_{b^{*}}\right]=\left[\mathrm{A}_{a^{*}}^{I}\cdot\mathrm{A}_{b^{*}}^{II}\right]-\left[\mathrm{A}_{a^{*}}^{II}\cdot\mathrm{A}_{b^{*}}^{I}\right]+\left[\mathrm{A}_{a^{*}}^{III}\cdot\mathrm{A}_{b^{*}}^{IV}\right]-\left[\mathrm{A}_{a^{*}}^{IV}\cdot\mathrm{A}_{b^{*}}^{III}\right] (21)

Then we must substitute in (10)

\left[\mathrm{R}_{e}\cdot\mathrm{N}\right]+\left[\mathrm{R}_{h}\cdot\mathrm{N}\right]=\sum(\overline{ab})\left[\mathrm{B}_{a^{*}}\cdot\mathrm{B}_{b^{*}}\cdot\mathrm{N}\right] (22)

Here it must be remarked that the magnitude and the sense of one of the vectors \mathrm{B} may be chosen arbitrarily; when this has been done, the other vector is perfectly determined.

In the following calculations the vector \mathrm{N} has one of the directions 1^{*},\dots4^{*}. As this is also the case with the vectors \mathrm{B}_{a^{*}} and \mathrm{B}_{b^{*}}, the vector product occurring in (22) can easily be expressed in \xi-units. After that we may pass to natural units and finally, as is necessary for the substitution in (10), to x-units.

In order to pass from \xi-units to natural units we have to multiply a vector in the direction a^{*} by a certain coefficient \lambda_{a}, and a part of the extension a^{*},b^{*},c^{*} by a coefficient \lambda_{abc}. These coefficients correspond to l_{a} (§ 10) and l_{abc} (§ 12). The factors \lambda_{abc} e.g. can be expressed by means of the minors \Gamma_{ab} of the determinant \gamma of the quantities \gamma_{ab}. If this is worked out and if the equations

\gamma_{ab}=\frac{G_{ab}}{g},\ g_{ab}=\frac{\Gamma_{ab}}{\gamma},\ g\gamma=1

are taken into consideration, we obtain the following corollary, which we shall soon use:

Let a, b, c, d and also a', b', c', d' be the numbers 1, 2, 3, 4 in any order, a' being not the same as a, then we have, if none of the two numbers \alpha and \alpha' is 4,

\frac{l_{bcd}\lambda b'c'd'}{l_{a'}\lambda_{a}}=-1 (23)

and if one of the two is 4

\frac{l_{bcd}\lambda b'c'd'}{l_{a'}\lambda_{a}}=1 (24)


§ 25. We shall now suppose (comp. § 24) that in \xi-units the vector \mathrm{B}_{a^{*}} has the value +1, and we shall write \chi_{ab} for the value that must then be given to \mathrm{B}_{b^{*}}. If the \xi-components of the vectors \mathrm{A^{I}} etc. are denoted by \Xi_{1}^{I},\dots\Xi_{4}^{I} etc., we find from (21)

\chi_{ab}=\left(\Xi_{a}^{I}\Xi_{b}^{II}-\Xi_{a}^{II}\Xi_{b}^{I}\right)+\left(\Xi_{a}^{III}\Xi_{b}^{IV}-\Xi_{a}^{IV}\Xi_{b}^{III}\right) (25)

This formula involves that

\chi_{ba}=-\chi_{ab} (26)

It may be remarked that \chi_{ba} is the value that must be given to the vector \mathrm{B}_{a^{*}} if \mathrm{B}_{b^{*}} is taken to be 1.

The quantities \chi_{ab} may be said to represent the rotations \left[\mathrm{B}_{a^{*}}\cdot\mathrm{B}_{b^{*}}\right].

At the end of our calculations we shall introduce instead of \chi_{ab} the quantities t\psi_{ab} defined by

\psi_{ab}=\chi_{a'b'}(a\mp b),\ \psi_{aa}=0 (27)

In the first of these equations a, b, a', b' are supposed to be the numbers 1, 2, 3, 4, in an order obtained from 1, 2, 3, 4 by an even number of permutations.

§ 26. We have now to calculate the left hand side of equation (10) for the case that \sigma is the surface of an element \left(dx_{1},\dots dx_{4}\right). For this purpose we shall each time take together two opposite sides, calculating for each pair the contributions due to the different terms on the right hand side of (22), or as we may say to the different rotations \chi_{ab}. It is convenient now to denote by a, b, c the numbers 1, 2, 3 either in this order or in any other derived from it by a cyclic permutation, while the x-components of the vector we are calculating and which stands on the left hand side of (10) will be represented by X_{1},\dots X_{4}.

a. Let us first consider that one of the sides \left(dx_{a},dx_{b},dx_{c}\right) which faces towards the side of the positive x_4. The vector \mathrm{N} drawn outward has the direction 4^{*} and in \xi-units the magnitude \frac{1}{\lambda_{4}}. As the direction c corresponds to a^{*},b^{*},4^{*}, the rotation \chi_{ab} gives with \mathrm{N} a vector product represented by a vector in the direction c. The magnitude of this vector is in \xi-units

\frac{1}{\lambda_{4}}\chi_{ab}

and in natural units

\frac{\chi_{ab4}}{\lambda_{4}}\chi_{ab}

This must be multiplied by l_{abc}dx_{a}dx_{b}dx_{c}, the magnitude of the side under consideration in natural units, and finally by \tfrac{1}{l_{c}} to express the vector product in x-units. Because of (24) we may write for the result

\chi_{abc}dx_{a}dx_{b}dx_{c}=\psi_{c4}dx_{a}dx_{b}dx_{c}

The opposite side gives a similar result with the opposite sign (\mathrm{N} having for that side the direction -4^{*}), so that together the sides contribute the term

\frac{\partial\psi_{c4}}{\partial x_{4}}dW

to the component X_{c}. For shortness sake we have put here

dx_{1}dx_{2}dx_{3}dx_{4}=dW

Finally we may take, c = 1, 2, 3.

b. Secondly we consider a side \left(dx_{a},dx_{b},dx_{4}\right) facing towards the positive x_c. The vector \mathrm{N} has now the direction -c^{*}. We consider the vector products of this vector with the rotations \chi_{b4}, \chi_{4a} and \chi_{ba}, which vector products have the directions a, b and 4. A calculation exactly similar to the one we performed just now gives the contributions to X_{a},X_{b},X_{4}. For these we thus find the products of dx_{a}dx_{b}dx_{4} by

\begin{array}{c}
\frac{l_{ab4}\lambda_{bc4}}{l_{a}\lambda_{c}}\chi_{b4}=\chi_{4b}=\psi_{ac},\\
\\
\frac{l_{ab4}\lambda_{ac4}}{l_{b}\lambda_{c}}\chi_{4a}=\chi_{a4}=\psi_{bc},\\
\\
\frac{l_{ab4}\lambda_{abc}}{l_{4}\lambda_{c}}\chi_{ba}=\chi_{ba}=\psi_{4c}.
\end{array}

Taking also into consideration the opposite side \left(dx_{a},dx_{b},dx_{4}\right) we find for X_{a},X_{b},X_{4} the contributions

\frac{\partial\psi_{ac}}{\partial x_{c}}dW,\ \frac{\partial\psi_{bc}}{\partial x_{c}}dW,\ \frac{\partial\psi_{4c}}{\partial x_{c}}dW.

This may be applied to each of the three pairs of sides not yet mentioned under a; we have only to take for c successively 1, 2, 3.

Summing up what has been said in this § we may say: the components of the vector on the left hand side of (10) are

X_{a}=\sum(b)\frac{\partial\psi_{ab}}{\partial x_{b}}dW


§ 27. For the components of the vector occurring on the right hand side of (10) we may write

i\mathrm{q}_{a}d\Omega

if \mathrm{q}_{a} is the component of the vector \mathrm{q} in the direction x_{a} expressed in x-units, while d\Omega represents the magnitude of the element \left(dx_{1},\dots dx_{4}\right) in natural units. This magnitude is

-i\sqrt{-g}dW

so that by putting

\sqrt{-g}\mathrm{q}_{a}=w_{a} (28)

we find for equation (10)

\sum(b)\frac{\partial\psi_{ab}}{\partial x_{b}}=w_{a} (29)

The four relations contained in this equation have the same form as those expressed by formula (25) in my paper of last year[14]. We shall now show that the two sets of equations correspond in all respects. For this purpose it will be shown that the transformation formulae formerly deduced for w_{a} and \psi_{ac} follow from the way in which these quantities have been now defined. The notations from the former paper will again be used and we shall suppose the transformation determinant p to be positive.

§ 28. Between the differentials of the original coordinates x_{a} and the new coordinates x'_{a} which we are going to introduce we have the relations

dx'_{a}=\sum(b)\pi_{ba}dx_{b} (30)

and formulae of the same form (comp. § 10) may be written down for the components of a vector expressed in x-measure. As the quantities \mathrm{q}_{a} constitute a vector and as

\sqrt{-g'}=p\sqrt{-g}

we have according to (28)[15]

\frac{1}{\sqrt{-g'}}w'_{a}=\frac{1}{\sqrt{-g}}\sum(b)\pi_{ba}w_{b}

or

w'_{a}=p\sum(b)\pi_{ba}w_{b}

Further we have for the infinitely small quantities \xi_{a}[16] defined by (19)

\xi'_{a}=\sum(b)p_{ba}\xi_{b}

and in agreement with this for the components of a vector expressed in \xi-units

\Xi'_{a}=\sum(b)p_{ba}\Xi_{b}

so that we find from (25)[17]

\chi'_{ab}=\sum(cd)p_{ca}p_{db}\chi_{cd}

Interchanging here c and d, we obtain

\chi'_{ab}=\sum(cd)p_{da}p_{cb}\chi_{dc}=-\sum(cd)p_{da}p_{cb}\chi_{cd}

and

\chi'_{ab}=\frac{1}{2}\sum(cd)\left(p_{ca}p_{db}-p_{da}p_{cb}\right)\chi_{cd} (31)

The quantity between brackets on the right hand side is a second order minor of the determinant p and as is well known this minor is related to a similar minor of the determinant of the coefficients \pi_{ab}. If a'b' corresponds to ab in the way mentioned in § 25, and c'd' in the same way to cd, we have

p_{ca}d_{db}-p_{da}p_{ch}=p\left(\pi_{c'a'}\pi_{d'b'}-\pi_{d'a'}\pi_{c'b'}\right)

so that (31) becomes

\chi'_{ab}=\frac{1}{2}p\sum(cd)\left(\pi_{c'a'}\pi_{d'b'}-\pi_{d'a'}\pi_{c'b'}\right)\chi_{cd}

According to (27) this becomes

\psi'_{a'b'}=\frac{1}{2}p\sum(cd)\left(\pi_{c'a'}\pi_{d'b'}-\pi_{d'a'}\pi_{c'b'}\right)\psi_{c'd'}

for which we may write

\psi'_{ab}=\frac{1}{2}p\sum(cd)\left(\pi_{ca}\pi_{db}-\pi_{da}\pi_{cb}\right)\psi_{cd}

Interchanging c and d in the second of the two parts into which the sum on the right hand side can be decomposed, and taking into consideration that

\psi_{dc}=-\psi_{cd}

as is evident from (26) and (27), we find[18]

\psi'_{ab}=p\sum(cd)\pi_{ca}\pi_{db}\psi_{cd}


§ 29. Finally it can be proved that if equation (10) holds for one system of coordinates x_{1},\dots x_{4}, it will also be true for every other system x'_{1},\dots x'_{4}, so that

\int\left\{ \left[\mathrm{R}_{e}\cdot\mathrm{N}\right]+\left[\mathrm{R}_{h}\cdot\mathrm{N}\right]\right\} _{x'}d\sigma=i\int\{\mathrm{q}\}_{x'}d\Omega (32)

To show this we shall first assume that the extension \Omega, which is understood to be the same in the two cases, is the element \left(dx_{1},\dots dx_{4}\right).

For the four equations taken together in (10) we may then write

\int u_{1}d\sigma=v_{1}d\Omega,\dots\int u_{4}d\sigma=v_{4}d\Omega (33)

and in the same way for the four equations (32)

\int u'_{1}d\sigma=v'_{1}d\Omega,\dots\int u'_{4}d\sigma=v'_{4}d\Omega (34)

We have now to deduce these last equations from (33). In doing so we must keep in mind that u_{1},\dots u_{4} are the x-components and u'_{1},\dots u'_{4} the x-components of one definite vector and that the same may be said of v_{1},\dots v_{4} and v'_{1},\dots v'_{4}.

Hence, at a definite point (comp. (30))

v'_{a}=\sum(b)\pi_{ba}v_{b} (35)

We shall particularly denote by \pi_{ba} the values of these quantities belonging to the angle P from which the edges dx_{1},\dots dx_{4} issue in positive directions. To the right hand sides of the equations (34) we may apply transformation (35) with these values of \pi_{ba}, d\Omega-being infinitely small of the fourth order and it being allowed to confine ourselves to quantities of this order.

On the left hand sides of (34), however, we must take into consideration, the surface being of the third order, that the values of \pi_{ba} change from point to point. Let \mathrm{x}_{1},\dots\mathrm{x}_{4} be the changes which x_{1},\dots x_{4} undergo when we pass from P to any other point of the surface. Then we must write for the value of the coefficient at this last point

\pi_{ba}+\sum(c)\frac{d\pi_{ba}}{dx_{c}}\mathrm{x}_{c}

We thus have

\int u'_{a}d\sigma=\sum(b)\pi_{ba}\int u_{b}d\sigma+\sum(b)\int u_{b}\sum(c)\frac{\partial\pi{}_{ba}}{\partial x_{c}}\mathrm{x}_{c}d\sigma

It will be shown presently that the last term vanishes. This being proved, it is clear that the relations (34) follow from (33); indeed, multiplying equations (33) by\pi_{1a},\dots\pi_{4a} respectively and adding them we find

\int u'_{a}d\sigma=v'_{a}d\Omega


§ 30. The proof for

\sum(b)\int u_{b}\sum(c)\frac{\partial\pi{}_{ba}}{\partial x_{c}}\mathrm{x}_{c}d\sigma=0 (36)

rests on the relations

\frac{\partial\pi{}_{ba}}{\partial x_{c}}=\frac{\partial\pi{}_{ea}}{\partial x_{b}} (37)

which follow from

\pi{}_{ba}=\frac{\partial x'_{b}}{\partial x_{b}},\ \pi{}_{ea}=\frac{\partial x'_{a}}{\partial x_{e}},

The integral which occurs in (36) differs from

\int u_{b}d\sigma (38)

by the infinitely small factor under the sign of integration

\sum(c)\frac{\partial\pi{}_{ba}}{\partial x_{c}}\mathrm{x}_{c}

Now we have calculated in § 26 integrals like (38) by taking together each time two opposite sides, one of which \Sigma_{1} passes through P while the second \Sigma_{2} is obtained from the first by a shift in the direction of one of the coordinates e. g. of x_{e} over the distance dx_{e}. We had then to keep in mind that for the two sides the values of u_{b}, which have opposite signs, are a little different; and it was precisely this difference that was of importance. In the calculation of the integral

\int u_{b}\sum(c)\frac{\partial\pi{}_{ba}}{\partial x_{c}}\mathrm{x}_{c}d\sigma (39)

however it may be neglected. Hence, when we express the components u_{b} in terms of the quantities \psi_{ab}, we may give to these latter the values which they have at the point P.

Let us consider two sides situated at the ends of the edges dx_{e} and whose magnitude we may therefore express in x-units dx_{j}dx_{k}dx_{l} if j, k, l are the numbers which are left of 1, 2, 3, 4 when the number e is omitted. For the part contributed to (38) by the side \Sigma_{2} we found in § 26

\psi{}_{be}dx_{j}dx_{k}dx_{l}

We now find for the part of (39) due to the two sides

\psi{}_{be}\sum(c)\frac{\partial\pi{}_{ba}}{\partial x_{c}}\left[\int\limits _{2}\mathrm{x}_{c}d\sigma-\int\limits _{1}\mathrm{x}_{c}d\sigma\right]

where the first integral relates to \Sigma_{2} and the second to \Sigma_{1}. It is clear that but one value of c, viz. e has to be considered. As everywhere in \Sigma_{1}:\mathrm{x}_{c}=0 and everywhere in \Sigma_{2}:\mathrm{x}_{c}=dx_{e} it is further evident that the above expression becomes

\psi{}_{eb}\frac{\partial\pi{}_{ba}}{\partial x_{c}}dW

This is one part contributed to the expression (36). A second part, the origin of which will be immediately understood, is found by interchanging b and e. With a view to (37) and because of

\psi{}_{eb}=-\psi{}_{be}

we have for each term of (36) another by which it is cancelled. This is what had to be proved.


§ 31. Now that we have shown that equation (32) holds for each element \left(dx_{1},\dots dx_{4}\right) we may conclude by the considerations of § 21 that this is equally true for any arbitrarily chosen magnitude and shape of the extension \Omega. In particular the equation may be applied to an element \left(dx'_{1},\dots dx'_{4}\right) and by considerations exactly similar to those presented in § 26 we see that in the new coordinates as well as in the original ones we have equations of the form (29).

Whatever be our choice of the coordinates the part of the principal function indicated in § 14 can therefore be derived for a given current vector \mathrm{q}.

In a sequel to this paper some conclusions that may be drawn from Hamilton's principle will be considered.

III.

(Communicated in the meeting of April 1916.)[19]


§ 32. In the two preceding papers[20] we have tried so far as possible to present the fundamental principles of the new gravitation theory in a simple form.

We shall now show how Einstein's differential equations for the gravitation field can be derived from Hamilton's principle. In this connexion we shall also have to consider the energy, the stresses, momenta and energy-currents in that field.

We shall again introduce the quantities g_{ab} formerly used and we shall also use the "inverse" system of quantities for which we shall now write g^{ab}. It is found useful to introduce besides these the quantities

g^{ab}=\sqrt{-g}g^{ab}

Differential coefficients of all these variables with respect to the coordinates will be represented by the indices belonging to these latter, e.g.

g_{ab,p}=\frac{\partial g_{ab}}{\partial x_{p}},\ g_{ab,pq}=\frac{\partial^{2}g_{ab}}{\partial x_{q}\partial x_{p}}

We shall use Christoffel's symbols

\left[\begin{array}{c}
ab\\
c
\end{array}\right]=\frac{1}{2}\left(g_{ac,b}+g_{bc,a}-g_{ab,c}\right)

and Riemann's symbol

\begin{array}{l}
(ik,lm)=\frac{1}{2}\left(g_{im,lk}+g_{kl,im}-g_{il,km}-g_{km,il}\right)+\\
\\
\qquad+\sum(ab)g^{ab}\left\{ \left[\begin{array}{c}
im\\
a
\end{array}\right]\left[\begin{array}{c}
kl\\
b
\end{array}\right]-\left[\begin{array}{c}
il\\
a
\end{array}\right]\left[\begin{array}{c}
km\\
b
\end{array}\right]\right\} 
\end{array}

Further we put

G_{im}=\sum(kl)g^{kl}(ik,lm) (40)
G=\sum(im)g^{im}G_{im} (41)
This latter quantity is a measure for the curvature of the field-figure. The principal function of the gravitation field is

\frac{1}{2\varkappa}\int QdS

where

Q=\sqrt{-g}G

In the integral dS, the element of the field-figure, is expressed in x-units. The integration has to be extended over the domain within a certain closed surface \sigma; \varkappa is a positive constant.


§ 33. When we pass from the system of coordinates x_{1},\dots x_{4} to another, the value of G proves to remain unaltered; it is a scalar quantity. This may be verified by first proving that the quantities ik, lm form a covariant tensor of the fourth order[21]. Next, g^{kl} being a contravariant tensor of the second order[22], we can deduce from (40) that \left(G_{im}\right) is a covariant tensor of the same order[23]. According to (41) G is then a scalar. The same is true[24] for Q dS.

We remark that g_{ba}=g_{ab}[25] and g_{ab,fe}=g_{ab,ef}. We shall suppose Q to be written in such a way that its form is not altered by interchanging g_{ba} and g_{ab} or g_{ab,fe} and g_{ab,ef}. If originally this condition is not fulfilled it is easy to pass to a "symmetrical" form of this kind.

It is clear that Q may also be expressed in the quantities g_{ab} and their first and second derivatives and in the same way in the \mathfrak{g}_{ab} and first and second derivatives of these quantities.

If the necessary substitutions are executed with due care, these new forms of Q will also be symmetrical.


§ 34. We shall first express the quantity Q in the g_{ab}'s and their derivatives and we shall determine the variation it undergoes by arbitrarily chosen variations \delta g_{ab}, these latter being continuous functions of the coordinates. We have evidently

\delta Q=\sum(ab)\frac{\partial Q}{\partial g_{ab}}\delta g_{ab}+\sum(abe)\frac{\partial Q}{\partial g_{ab,e}}\delta g_{ab,e}+\sum(abef)\frac{\delta Q}{\partial g_{ab,ef}}\delta g_{ab,ef}

By means of the equations

\delta g_{ab,ef}=\frac{\partial}{\partial x_{f}}\delta g_{ab,e} and \delta g_{ab,e}=\frac{\partial}{\partial x_{e}}\delta g_{ab}

this may be decomposed into two parts

dQ=\delta_{1}Q+\delta_{2}Q (42)

namely

\delta_{1}Q=\sum(ab)\left\{ \frac{\partial Q}{\partial g_{ab}}-\sum(e)\frac{\partial}{\partial x_{e}}\frac{\partial Q}{\partial g_{ab,e}}+\sum(ef)\frac{\partial^{2}}{\partial x_{e}\partial x_{f}}\frac{\partial Q}{\partial g_{ab,ef}}\right\} \delta g_{ab} (43)
\begin{array}{c}
\delta_{2}Q=\sum(abe)\frac{\partial Q}{\partial x_{e}}\left(\frac{\partial Q}{\partial g_{ab,e}}\delta g_{ab}\right)+\sum(abef)\frac{\partial}{\partial x_{f}}\left(\frac{\partial Q}{\partial g_{ab,ef}}\delta g_{ab,e}\right)-\\
\\
-\sum(abef)\frac{\partial}{\partial x_{e}}\left\{ \frac{\partial}{\partial x_{f}}\left(\frac{\partial Q}{\partial g_{ab,ef}}\right)\delta g_{ab}\right\} 
\end{array} (44)

The last equation shows that

\int\delta_{2}QdS=0 (45)

if the variations \delta g_{ab} and their first derivatives vanish at the boundary of the domain of integration.


§ 35. Equations of the same form may also be found if Q is expressed in one of the two other ways mentioned in § 33. If e.g. we work with the quantities \mathfrak{g}^{ab} we shall find

(\delta Q)=\left(\delta_{1}Q\right)+\left(\delta_{2}Q\right)

where \left(\delta_{1}Q\right) and \left(\delta_{2}Q\right) are directly found from (43) and (44) by replacing g_{ab}, g_{ab,e}, g_{ab,ef}, \delta g_{ab} and \delta g_{ab,e} etc. by \mathfrak{g}^{ab}, \mathfrak{g}^{ab,e} etc. If the variations chosen in the two cases correspond to each other we shall have of course

(dQ)=\delta Q

Moreover we can show that the equalities

\left(\delta_{1}Q\right)=\delta_{1}Q,\ \left(\delta_{2}Q\right)=\delta_{2}Q

exist separately.[26]

The decomposition of \delta Q into two parts is therefore the same, whether we use g_{ab},g^{ab} or \mathfrak{g}^{ab}.

It is further of importance that when the system of coordinates is changed, not only \delta QdS is an invariant, but that this is also the case with \delta_{1}QdS and \delta_{2}QdS separately.[27]

We have therefore

\frac{\delta_{1}Q'}{\sqrt{-g'}}=\frac{\delta_{1}Q}{\sqrt{-g}} (46)


§ 36. For the calculation of \delta_{1}Q we shall suppose Q to be expressed in the quantities \mathfrak{g}^{ab} and their derivatives. Therefore (comp. (43))

\delta_{1}Q=\sum(ab)M_{ab}d\mathfrak{g}^{ab} (47)

if we put

M_{ab}=\frac{\partial Q}{\partial\mathfrak{g}^{ab}}-\sum(e)\frac{\partial}{\partial x_{e}}\frac{\partial Q}{\partial\mathfrak{g}^{ab,e}}+\sum(ef)\frac{\partial}{\partial x_{e}\partial x_{f}}\frac{\partial Q}{\partial\mathfrak{g}^{ab,ef}}

Now we can show that the quantities M_{ab} are exactly the quantities G_{ab} defined by (40). To this effect we may use the following considerations.

We know that \left(\tfrac{1}{\sqrt{-g}}\mathfrak{g}^{ab}\right) is a contravariant tensor of the second order. From this we can deduce that \left(\frac{1}{\sqrt{-g}}\delta\mathfrak{g}^{ab}\right) is also such a tensor.

Writing for it \epsilon^{ab} we find according to (46) and (47) that

\sum(ab)M_{ab}\epsilon^{ab}

is a scalar for every choice of \left(\epsilon^{ab}\right).

This involves that \left(M_{ab}\right) is a covariant tensor of the second order and as the same is true for \left(G_{ab}\right) we must prove the equation

M_{ab}=G_{ab}

only for one special choice of coordinates.


§ 37. Now this choice can be made in such a way that at the point P of the field-figure g_{11}=g_{22}=g_{33}=-1, g_{44}=+1, g_{ab}=0 for a\neq b and that moreover all first derivatives g_{ab,e} vanish. If then the values g_{ab} at a point Q near P are developed in series of ascending powers of the differences of coordinates x_{a}(Q)-x_{a}(P) the terms directly following the constant ones will be of the second order. It is with these terms that we are concerned in the calculation both of M_{ab} and of G_{ab} for the point P. As in the results the coefficients of these terms occur to the first power only, it is sufficient to show that each of the above mentioned terms separately contributes the same value to M_{ab} and to G_{ab}.

From these considerations we may conclude that

\delta_{1}Q=\sum(ab)G_{ab}\delta\mathfrak{g}^{ab} (48)

Expressions containing instead of \delta\mathfrak{g}^{ab} either the variations \delta g^{ab} or \delta g_{ab} might be derived from this by using the relations between the different variations. Of these we shall only mention the formula

\delta g^{ab}=\frac{1}{\sqrt{-g}}\delta\mathfrak{g}^{ab}-\frac{g^{ab}}{2\sqrt{-g}}\sum(cd)g_{cd}\delta\mathfrak{g}^{cd} (49)


§ 38. In connexion with what precedes we here insert a consideration the purpose of which will be evident later on. Let the infinitely small quantity \xi be an arbitrarily chosen continuous function of the coordinates and let the variations \delta g_{ab} be defined by the condition that at some point P the quantities g_{ab} have after the change the values which existed before the change at the point Q, to which P is shifted when x_{h} is diminished by \xi, while the three other coordinates are left constant. Then we have

\delta g_{ab}=-g_{ab,h}\xi

and similar formulae for the variations \delta\mathfrak{g}^{ab}.

If for \delta_{1}Q and \delta_{2}Q the expressions (48) and (44) are taken, the equation

dQ-\delta_{2}Q=\delta_{1}Q (50)

is an identity for every choice of the variations.

It will likewise be so in the special case considered and we shall also come to an identity if in (50) the terms with the derivatives of \xi are omitted while those with \xi itself are preserved.

When this is done \delta Q reduces to

-\frac{\partial Q}{\partial x_{h}}\xi

and, taking into consideration (44) and (48), we find after division by \xi

\begin{array}{c}
-\frac{\partial Q}{\partial x_{h}}+\sum(abe)\frac{\partial}{\partial x_{e}}\left(\frac{\partial Q}{\partial g_{ab,e}}g_{ab,h}\right)+\sum(abef)\frac{\partial}{\partial x_{e}}\left(\frac{\partial Q}{\partial g_{ab,fe}}g_{ab,fh}\right)-\\
\\
-\sum(abef)\frac{\partial}{\partial x_{e}}\left\{ \frac{\partial}{\partial x_{f}}\left(\frac{\partial Q}{\partial g_{ab,fe}}\right)g_{ab,h}\right\} =-\sum(ab)G_{ab}\mathfrak{g}^{ab,h}
\end{array} (51)

In the second term of (44) we have interchanged here the indices e and f.

If for shortness' sake we put, for e\ne h

\mathfrak{s}_{h}^{e}=\sum(ab)\frac{\partial Q}{\partial g_{ab,e}}g_{ab,h}+\sum(abf)\frac{\partial Q}{\partial g_{ab,fe}}g_{ab,fh}-\sum(abf)\frac{\partial}{\partial x_{f}}\left(\frac{\partial Q}{\partial g_{ab,fe}}\right)g_{ab,h} (52)

and for e=h

\mathfrak{s}_{h}^{h}=-Q+\sum(ab)\frac{\partial Q}{\partial g_{ab,h}}g_{ab,h}+\sum(abf)\frac{\partial Q}{\partial g_{ab,fh}}g_{ab,fh}-\sum(abf)\frac{\partial}{\partial x_{f}}\left(\frac{\partial Q}{\partial g_{ab,hf}}\right)g_{ab,h} (53)

we may write

\sum(e)\frac{\partial\mathfrak{s}_{h}^{e}}{\partial x_{e}}=-\sum(ab)G_{ab}\mathfrak{g}^{ab,h} (54)

The set of quantities \mathfrak{s}_{h}^{e} will be called the complex \mathfrak{s} and the set of the four quantities which stand on the left hand side of (54) in the cases h=1,2,3,4, the divergency of the complex.[28] It will be denoted by div\mathfrak{s} and each of the four quantities separately by div_{h}\mathfrak{s}.

The equation therefore becomes

div_{h}\mathfrak{s}=-\sum(ab)G_{ab}\mathfrak{g}^{ab,h} (55)
If we take other coordinates the right hand side of this equation is transformed according to a formula which can be found easily. Hence we can also write down the transformation formula for the left hand side. It is as follows
div'_{h}\mathfrak{s}'=p\sum(m)p_{mh}div_{m}\mathfrak{s}-Q\sum(a)p_{ah}\frac{\partial p}{\partial x_{a}}+2p\sum(abc)p_{ah,c}\mathfrak{g}^{bc}G_{ab} (56)


§ 39. We shall now consider a second complex \mathfrak{s}_{0}, the components of which are defined by

\mathfrak{s}_{0h}^{e}=-G\sum(a)\mathfrak{g}^{ae}g_{ah}+2\sum(a)\mathfrak{g}^{ae}G_{ah} (57)

Taking also the divergency of this complex we find that the difference

div'_{h}\mathfrak{s}'_{0}-p\sum(m)p_{mh}div_{m}\mathfrak{s}_{0}

has just the value which we can deduce from (56) for the corresponding difference

div'_{h}\mathfrak{s}'-p\sum(m)p_{mh}div_{m}\mathfrak{s}

It is thus seen that

div'_{h}\mathfrak{s}'-div'_{h}\mathfrak{s}'_{0}=p\sum(m)p_{mh}\left(div_{m}\mathfrak{s}-div_{m}\mathfrak{s}_{0}\right)

and that we have therefore

div\mathfrak{s}=div\mathfrak{s}_{0} (58)

for all systems of coordinates as soon as this is the case for one system.

Now a direct calculation starting from (52), (53) and (57) teaches us that the terms with the highest derivatives of the quantities g_{ab}, (viz. those of the third order) are the same in div_{h}\mathfrak{s} and div_{h}\mathfrak{s}_{0}. Further it is evident that in the system of coordinates introduced in § 37 these terms with the third derivatives are the only ones. This proves the general validity of equation (58). It is especially to be noticed that if \mathfrak{s} and \mathfrak{s}_{0} are determined by (52), (53) and (57) and if the function defined in § 32 is taken for G, the relation is an identity.


§ 40. We shall now derive the differential equations for the gravitation field, first for the case of an electromagnetic system.[29] For the part of the principal function belonging to it we write

\int\mathrm{L}dS

where \mathrm{L} is defined by (35) (1915). From \mathrm{L} we can derive the stresses, the momenta, the energy-current and the energy of the electromagnetic system; for this purpose we must use the equations (45) and (46) (1915) or in Einstein's notation, which we shall follow here,[30]

\mathfrak{T}_{c}^{c}=-\mathrm{L}+\sum\limits _{a\ne c}(a)\psi_{ac}^{*}\psi_{a'c'} (59)

and for b\ne c

\mathfrak{T}_{c}^{b}=\sum\limits _{a\ne c}(a)\psi_{ac}^{*}\psi_{a'c'} (60)

The set of quantities \mathfrak{T}_{c}^{b} might be called the stress-energy-complex (comp. § 38). As for a change of the system of coordinates the transformation formulae for \mathfrak{T} are similar to those by which tensors are defined, we can also speak of the stress-energy-tensor. We have namely

\frac{1}{\sqrt{-g'}}\mathfrak{T}_{c}^{'b}=\frac{1}{\sqrt{-g}}\sum(kl)p_{kc}\pi lb\mathfrak{T}_{k}^{l}


§ 41. The equations for the gravitation field are now obtained (comp. §§ 13 and 14, 1915) from the condition that

\delta_{\psi}\int\mathrm{L}dS+\frac{1}{2\varkappa}\delta\int QdS=0 (61)

for all variations \delta g_{ab} which vanish at the boundary of the field of integration together with their first derivatives. The index \psi in the first term indicates that in the variation of \mathrm{L} the quantities \psi_{ab} must be kept constant.

If we suppose \mathrm{L} to be expressed in the quantities g^{ab} and if (42), (45) and (48) are taken into consideration, we find from (61) that at each point of the field-figure

\sum(ab)\left(\frac{\partial\mathrm{L}}{\partial g^{ab}}\right)_{\psi}\delta g^{ab}+\frac{1}{2\varkappa}\sum(ab)G_{ab}\delta\mathfrak{g}^{ab}=0 (62)

If now in the first term we put

\left(\frac{\partial\mathrm{L}}{\partial g^{ab}}\right)_{\psi}=\frac{1}{2}\sqrt{-g}T_{ab} (63)

and if for \partial g^{ab} the value (49) is substituted, this term becomes

\frac{1}{2}\sum(ab)T_{ab}\partial\mathfrak{g}^{ab}-\frac{1}{4}\sum(abcd)g^{ab}g_{cd}T_{ab}\delta\mathfrak{g}^{cd}

or if in the latter summation a, b is interchanged with c, d and if the quantity

T=\sum(cd)g^{cd}T_{cd} (64)

is introduced,

\frac{1}{2}\sum(ab)\left(T_{ab}-\frac{1}{2}g_{ab}T\right)\delta\mathfrak{g}^{ab}

Finally, putting equal to zero the coefficient of each \delta\mathfrak{g}^{ab} we find from (62) the differential equation required

G_{ab}=-\varkappa\left(T_{ab}-\frac{1}{2}g_{ab}T\right) (65)

This is of the same form as Einstein's field equations, but to see that the formulae really correspond to each other it remains to show that the quantities T_{ab} and \mathfrak{T}_{c}^{b} defined by (63), f59) and (60) are connected by Einstein's formulae

\mathfrak{T}_{c}^{b}=\sqrt{-g}\sum(a)g^{ab}T_{ac} (66)

We must have therefore

2\sum(a)g^{ac}\left(\frac{\partial\mathrm{L}}{\partial g^{ac}}\right)_{\psi}=-\mathrm{L}+\sum\limits _{a\ne c}(a)\psi_{ac}^{*}\psi_{a'c'} (67)

and for b\ne c

2\sum(a)g^{ab}\left(\frac{\partial\mathrm{L}}{\partial g^{ac}}\right)_{\psi}=\sum\limits _{a\ne c}(a)\psi_{ab}^{*}\psi_{a'c'} (68)


§ 42. This can be tested in the following way. The function \mathrm{L} (comp. § 9, 1915) is a homogeneous quadratic function of the \psi_{ab}'s and when differentiated with respect to these variables it gives the quantities \bar{\psi}_{ab}. It may therefore also be regarded as a homogeneous quadratic function of the \bar{\psi}_{ab}. From (35), (29) and (32)[31], 1915 we find therefore

L=\frac{1}{8}\sqrt{-g}\sum(pqrs)\left(g^{pr}g^{qs}-g^{qr}g^{ps}\right)\bar{\psi}_{pq}\bar{\psi}_{rs} (69)

Now we can also differentiate with respect to the g^{ab}'s, while not the \psi_{ab}'s but the quantities \bar{\psi}_{ab} are kept constant, and we have e.g.

\left(\frac{\partial\mathrm{L}}{\partial g^{ac}}\right)_{\psi}=-\left(\frac{\partial\mathrm{L}}{\partial g^{ac}}\right)_{\psi} (70)

According to (69) one part of the latter differential coefficient is obtained by differentiating the factor \sqrt{-g} only and the other part by keeping this factor constant.

For the calculation of the first of these parts we can use the relation

\frac{\partial\log\left(\sqrt{-g}\right)}{\partial g^{ac}}=-\frac{1}{2}g_{ac}

and for the second part we find

\frac{1}{2}\sqrt{-g}\sum(pq)g^{pq}\bar{\psi}_{ap}\bar{\psi}_{cq}

If (32) 1915 is used (67) and (68) finally become

\begin{array}{c}
\sum(q)\psi_{cq}\bar{\psi}_{cq}+\sum\limits _{a\ne c}(a)\psi_{ac}^{*}\psi_{a'c'}=2\mathrm{L}\\
\\
\sum(q)\bar{\psi}_{cq}\psi_{bq}+\sum\limits _{a\ne c}(a)\psi_{ab}^{*}\psi_{a'c'}=0
\end{array}

These equations are really fulfilled. This is evident from \psi_{aa}=0, \bar{\psi}_{aa}=0, \psi_{ba}=-\psi_{ab} and \bar{\psi}_{ba}=-\bar{\psi}_{ab}, besides, the meaning of \psi_{ab}^{*} (§ 11, 1915) and equation (35) 1915 must be taken into consideration.


§ 43. In nearly the same way we can treat the gravitation field of a system of incoherent material points; here the quantities w_{a} and u_{a} (§§ 4 and 5, 1915) play a similar part as \psi_{ab} and \bar{\psi}_{ab} in what precedes. To consider a more general case we can suppose "molecular forces" to act between the material points (which we assume to be equal to each other); in such a way that in ordinary mechanics we should ascribe to the system a potential energy depending on the density only. Conforming to this we shall add to the Lagrangian function \mathrm{L} (§ 4, 1915) a term which is some function of the density of the matter at the point P of the field-figure, such as that density is when by a transformation the matter at that point has been brought to rest. This can also be expressed as follows. Let d\sigma be an infinitely small three-dimensional extension expressed in natural units, which at the point P is perpendicular to the world-line passing through that point, and \bar{\varrho}d\sigma the number of points where d\sigma intersects world-lines. The contribution of an element of the field-figure to the principal function will then be found by multiplying the magnitude of that element expressed in natural units by a function of \bar{\varrho}. Further calculation teaches us that the term to be added to \mathrm{L} must have the form

\sqrt{-g}\varphi\left(\frac{P}{\sqrt{-g}}\right) (71)
where P is given by (15) 1915. As the Lagrangian function defined by (11) 1915 equally falls under this form and also the sum of this function and the new term, the expression (71) may be regarded as the total function \mathrm{L}. The function \varphi may be left indeterminate. If now with this function the calculations of §§ 5 and 6, 1915 are repeated, we find the components of the stress-energy-tensor of the matter.

The equations for the gravitation field again take the form (65). T_{ab} is defined by an equation of the form (63), where on the left hand side we must differentiate while the w_{a}'s are kept constant. Relation (66) can again be verified without difficulty.

We shall not, however, dwell upon this, as the following considerations are more general and apply e.g. also to systems of material points that are anisotropic as regards the configuration and the molecular actions.


§ 44. At any point P of the field-figure the Lagrangian function \mathrm{L} will evidently be determined by the course and the mutual situation of the world-lines of the material points in the neighbourhood of P. This leads to the assumption that for constant g_{ab}'s the variation \delta\mathrm{L} is a homogeneous linear function of the virtual displacements \delta x_{a} of the material points and of the differential coefficients

\frac{\partial\delta x_{a}}{\partial x_{b}}

these last quantities evidently determining the deformation of an infinitesimal part of the figure formed by the world-lines[32].

The calculation becomes most simple if we put

\mathrm{L}=\sqrt{-g}H (72)

and for constant g_{ab}'s

\delta H=\sum(a)U_{a}\delta x_{a}+\sum(ab)V_{a}^{b}\frac{\partial\delta x_{a}}{\partial x_{b}} (73)

Considerations corresponding exactly to those mentioned in §§ 4 — 6, 1915, now lead to the equations of motion and to the following expressions for the components of the stress-energy-tensor

\mathfrak{T}_{c}^{c}=-\mathrm{L}-\sqrt{-g}V_{c}^{c} (74)

and for b\ne c

\mathfrak{T}_{c}^{b}=-\sqrt{-g}V_{c}^{b} (75)
The differential equations again take the form (65) if the quantities T_{ab} are defined by

\left(\frac{\partial\mathrm{L}}{\partial g^{ab}}\right)_{x}=\frac{1}{2}\sqrt{-g}T_{ab}

in the differentiation on the left hand side the coordinates of the material points are kept constant. To show that T_{ab} and \mathfrak{T}_{c}^{b} satisfy equation (66) we must now show that

-\mathrm{L}-\sqrt{-g}V_{c}^{c}=2\sum(a)g^{ac}\left(\frac{\partial L}{\partial g^{ac}}\right)_{x}

and for b\ne c

-\sqrt{-g}V_{c}^{b}=2\sum(a)g^{ab}\left(\frac{\partial\mathrm{L}}{\partial g^{ac}}\right)_{x}

If here the value (72) is substituted for \mathrm{L} and if (70) is taken into account, these equations say that for all values of b and c we must have

2\sum(a)g^{ab}\left(\frac{\partial H}{\partial g^{ac}}\right)_{x}+V_{c}^{b}=0 (76)

Now this relation immediately follows from a condition, to which \mathrm{L} must be subjected at any rate, viz. that \mathrm{L}dS is a scalar quantity. This involves that in a definite case we must find for H always the same value whatever be the choice of coordinates.


§ 45. Let us suppose that instead of only one coordinate x_{c} a new one x'_{c} has been introduced, which differs infinitely little from x_{c}, with the restriction that if

x'_{c}=x_{c}+\xi_{c}

the term \xi_{c} depends on the coordinate x_b only and is zero at the point in question of the field-figure. The quantities g^{ab} then take other values and in the new system of coordinates the world-lines of the material points will have a slightly changed course.

By each of these circumstances separately H would change, but all together must leave it unaltered. As to the first change we remark that, according to the transformation formula for g^{ab}, the variation \delta g^{ab} vanishes when the two indices are different from c, while

\delta g^{cc}=2g^{cb}\frac{\partial\xi_{c}}{\partial x_{b}}

and for a\ne c

\delta g^{ac}=2g^{ca}=g^{ab}\frac{\partial\xi_{c}}{\partial x_{b}}

The change of H due to these variations is

2\frac{\partial\xi_{c}}{\partial x_{b}}\sum(a)g^{ab}\left(\frac{\partial H}{\partial g^{ac}}\right)_{x}

Further, in the new system of coordinates the figure formed by the world-lines differs from that figure in the old system by the variation \delta x_{c}=\xi_{c} which is a function of x_{b} only. Therefore according to (73) the second variation of H is

V_{c}^{b}=\frac{\partial\xi_{c}}{\partial x_{b}}

By putting equal to zero the sum of this expression and the preceding one we obtain (76).


§ 46. We have thus deduced for some cases the equations of the gravitation field from the variation theorem. Probably this can also be done for thermodynamic systems, if the Lagrangian function is properly chosen in connexion with the thermodynamic functions, entropy and free energy. But as soon as we are concerned with irreversible phenomena, when e.g. the energy-current consists in a conduction of heat, the variation principle cannot be applied. We shall then be obliged to take Einstein's field-equations as our point of departure, unless, considering the motions of the individual atoms or molecules, we succeed in treating these by means of the generalized principle of Hamilton.


§ 47. Finally we shall consider the stresses, the energy etc. which belong to the gravitation field itself. The results will be the same for all the systems treated above, but we shall confine ourselves to the case of §§ 44 and 45. We suppose certain external forces K_{a} to act on the material points, though we shall see that strictly speaking this is not allowed.

For any displacements \delta x_{a} of the matter and variations of the gravitation field we first have the equation which summarizes what we found above

\begin{array}{l}
\delta\mathrm{L}+\frac{1}{2\varkappa}\delta Q+\sum(a)K_{a}^{*}\delta x_{a}=\sqrt{-g}\sum(a)U_{a}\delta x_{a}+\\
\\
\qquad+\sum(ab)\frac{\partial}{\partial x_{b}}\left(\sqrt{-g}V_{a}^{b}\delta x_{a}\right)-\sum(ab)\frac{\partial}{\partial x_{b}}\left(\sqrt{-g}V_{a}^{b}\right)\delta x_{a}+\\
\\
\qquad\qquad+\sum(ab)\left(\frac{\partial L}{\partial g^{ab}}\right)_{x}\delta g^{ab}+\frac{1}{2\varkappa}\delta_{1}Q+\frac{1}{2\varkappa}\delta_{2}Q+\sum(a)K_{a}\delta x_{a}.
\end{array}

In virtue of the equations of motion of the matter, the terms with \delta x_{a} cancel each other on the right hand side and similarly, on account of the equations of the gravitation field, the terms with \delta g^{ab} and \delta_{1}Q. Thus we can write[33]

\sum(a)K_{a}\delta x_{a}=-\delta L+\sum(ae)\frac{\partial}{\partial x_{e}}\left(\sqrt{-g}V_{a}^{e}\delta x_{a}\right)-\frac{1}{2\varkappa}\left(\delta Q-\delta_{2}Q\right) (77)

Let us now suppose that only the coordinate x_{h} undergoes an infinitely small change, which has the same value at all points of the field-figure. Let at the same time the system of values g_{ab} be shifted everywhere in the direction of x_{h} over the distance \delta x_{h}. The left hand side of the equation then becomes K_{h}\delta x_{h} and we have on the right hand side

\delta\mathrm{L}=-\frac{\partial\mathrm{L}}{\partial x_{h}}\delta x_{h},\ dQ=-\frac{\partial Q}{\partial x_{h}}\delta x_{h}

After dividing the equation by \delta x_{h} we may thus, according to (74) and (75), write

-\sum(e)\frac{\partial\mathrm{T}h^{e}}{\partial x_{e}}=-div_{h}\mathfrak{T}

By the same division we obtain from \delta Q-\delta_{2}Q the expression occurring on the left hand side of (51), which we have represented by

\sum(e)\frac{\partial\mathfrak{s}_{h}^{e}}{\partial x_{e}}=div_{h}\mathfrak{s}

where the complex \mathfrak{s} is defined by (52) and (53). If therefore we introduce a new complex \mathfrak{t} which differs from \mathfrak{s} only by the factor \tfrac{1}{2\varkappa}, so that

\mathfrak{t}_{h}^{e}=\frac{1}{2\varkappa}\mathfrak{s}_{h}^{e} (78)

we find

K_{h}=-div_{h}\mathfrak{T}-div_{h}\mathfrak{t} (79)

The form of this equation leads us to consider \mathfrak{t} as the stress-energy-complex of the gravitation field, just as \mathfrak{T} is the stress-energy-tensor for the matter. We need not further explain that for the case K_{h}=0 the four equations contained in (79) express the conservation of momentum and of energy for the total system, matter and gravitation field taken together.


§ 48. To learn something about the nature of the stress-energy-complex \mathfrak{t} we shall consider the stationary gravitation field caused by a quantity of matter without motion and distributed symmetrically around a point O. In this problem it is convenient to introduce for the three space coordinates x_{1},x_{2},x_{3}, (x_{4} will represent the time) "polar" coordinates. By x_{3} we shall therefore denote a quantity r which is a measure for the "distance" to the centre. As to x_{1} and x_{2}, we shall put x_{1}=\cos\vartheta, x_{2}=\varphi, after first having introduced polar coordinates \vartheta, \varphi (in such a way that the rectangular coordinates are r\cos\vartheta, r\sin\vartheta\cos\varphi, r\sin\vartheta\sin\varphi). It can be proved that, because of the symmetry about the centre, g_{ab}=0 for a\ne b, while we may put for the quantities g_{aa}

g_{11}=-\frac{u}{1-x_{1}^{2}},\ g_{22}=-u\left(1-x_{1}^{2}\right),\ g_{33}=-v,\ g_{44}=w (80)

where u, v, w are certain functions of r. Ditferentiations of these functions will be represented by accents. We now find that of the complex \mathfrak{t} only the components \mathfrak{t}_{1}^{1}, \mathfrak{t}_{3}^{3} and \mathfrak{t}_{4}^{4} are different from zero. The expressions found for them may be further simplified by properly choosing r. If the distance to the centre is measured by the time the light requires to be propagated from to the point in question, we have w = v. One then finds

\left.\begin{array}{l}
\mathfrak{t}_{1}^{1}=\frac{1}{2\varkappa}\left(-\frac{u'^{2}}{2u}+2u''-\frac{uv'^{2}}{v^{2}}+\frac{uv''}{v}\right),\\
\\
\mathfrak{t}_{3}^{3}=\frac{1}{2\varkappa}\left(-2v+\frac{u'^{2}}{2u}+\frac{uv'}{v}\right),\\
\\
\mathfrak{t}_{4}^{4}=\frac{1}{2\varkappa}\left(-2v-\frac{u'^{2}}{2u}+2u''+\frac{uv''}{v}\right),
\end{array}\right\} (81)


§ 49. We must assume that in the gravitation fields really existing the quantities g_{ab} have values differing very little from those which belong to a field without gravitation. In this latter we should have

u=r^{3},\ v=w=1,

and thus we put now

u=r^{2}(1+\mu),\ v=w=1+\nu

where the quantities \mu and \nu which depend on r are infinitely small, say of the first order, and their derivatives too. Neglecting quantities of the second order we find from (81)

\begin{array}{l}
\mathfrak{t}_{1}^{1}=\frac{1}{2\varkappa}\left(2+2\mu+6r\mu'+2r^{2}\mu''+r^{2}\nu''\right),\\
\\
\mathfrak{t}_{3}^{3}=\frac{1}{\varkappa}\left(\mu-\nu+r\mu'+r\nu'\right),\\
\\
\mathfrak{t}_{4}^{4}=\frac{1}{2\varkappa}\left(2\mu-2\nu+6r\mu'+2r^{2}\mu''+r^{2}\nu''\right),
\end{array}

For our degree of approximation we may suppose that of the quantities T_{ab} only T_{44} differs from 0. If we put

T_{44}=\varrho (82)

a quantity which depends on r and which we shall assume to be zero outside a certain sphere, we find from the field equations

\begin{array}{c}
\mu=\varkappa\left\{ -\frac{2}{r}\int\limits _{0}^{r}\frac{dr}{r}\int\limits _{0}^{r}r^{2}\varrho dr-\frac{1}{r}\int\limits _{0}^{r}r^{2}\varrho dr+\int\limits _{\infty}^{r}r\varrho dr\right\} ,\\
\\
\nu=\varkappa\left\{ -\frac{1}{r}\int\limits _{0}^{r}r^{2}\varrho dr+\int\limits _{\infty}^{r}r\varrho dr\right\} 
\end{array}

We thus obtain

\mathfrak{t}_{1}^{1}=\frac{1}{\varkappa}+\int\limits _{\infty}^{r}r\varrho dr-\frac{1}{r}\int\limits _{0}^{r}r^{2}\varrho dr-\frac{1}{2}r^{2}\varrho, (83)
\mathfrak{t}_{3}^{3}=0,\ \mathfrak{t}_{4}^{4}=-\frac{1}{2}r^{2}\varrho (84)


§ 50. If first we leave aside the first term of \mathfrak{t}_{1}^{1}, which would also exist if no attracting matter were present, it is remarkable that the gravitation constant \varkappa does not occur in the stress \mathfrak{t}_{1}^{1} nor in the energy \mathfrak{t}_{4}^{4}; the same would have been found if we had used other coordinates. This constitutes an important difference between Einstein's theory and other theories in which attracting or repulsing forces are reduced to "field actions". The pulsating spheres of Bjerknes e.g. are subjected to forces which, for a given motion, are proportional to the density of the fluid in which they are imbedded; and the changes of pressure and the energy in that fluid are likewise proportional to this density. In this case we shall therefore ascribe to the stress-energy-complex values proportional to the intensity of the actions which we want to explain. In Einstein's theory such a proportionality does not exist. The value of \mathfrak{t}_{4}^{4} is of the same order of magnitude as \mathfrak{T}_{4}^{4} in the matter. To our degree of approximation we find namely from (82) \mathfrak{T}_{4}^{4}=r^{2}\varrho.


§ 51. If we had not worked with polar coordinates but with rectangular coordinates we should have had to put for the field without gravitation g_{11}=g_{22}=g_{33}=-1, g_{44}=1, g_{ab}=0 for a\ne b. Then we should have found zero for all the components of the complex. In the system of coordinates used above we found for the field without gravitation \mathfrak{t}_{1}^{1}=\tfrac{1}{\varkappa}; this is due to the complex \mathfrak{t} being no tensor. If it were, the quantities \mathfrak{t}_{a}^{b} would be zero in every system of coordinates if they had that value in one system.

It is also remarkable that in real eases the first term in (83) can be much larger than the following ones. If we consider e. g. a point P outside the attracting sphere, we can prove that the ratio of the first term to the third is of the same order as the ratio of the square of the velocity of light to the square of the velocity with which a material point can describe a circular orbit passing through P.

The following must also be noticed. In the system of polar coordinates used above there will exist in the field without gravitation the stress \mathfrak{t}_{1}^{1}=\tfrac{1}{\varkappa}. If a stress of this magnitude were produced by means of actions which give rise to a stress-energy-tensor, the passage to rectangular coordinates would give us a stress which becomes infinite at the point O. In those coordinates we should namely have

\mathfrak{t}_{1}^{'1}=\frac{\sin^{2}\vartheta}{r^{2}}\cdot\frac{1}{\varkappa}


§ 52. Evidently it would be more satisfactory if we could ascribe a stress-energy-tensor to the gravitation field. Now this can really be done. Indeed, the quantities \mathfrak{s}_{0h}^{e} determined by (57) form a tensor and according to (58), (79) may be replaced by

K_{h}=-div_{h}\mathfrak{T}-div_{h}\mathfrak{t}_{0} (85)

if \mathfrak{t}_{0} is defined by a relation similar to (78), viz.

\mathfrak{t}_{0h}^{e}=\frac{1}{2\varkappa}\mathfrak{s}_{0h}^{e} (86)

Equation (85) shows that, just as well as \mathfrak{t}_{h}^{c}, we may consider the quantities \mathfrak{t}_{0h}^{e} as the stresses etc. in the gravitation field. This way of interpretation is very simple. With a view to (41) we can namely derive from the equations for the gravitation field (65)

G=\varkappa T

and

T_{ab}=-\frac{1}{\varkappa}\left(G_{ab}-\frac{1}{2}g_{ab}G\right)

Further we find from (66)

\mathfrak{T}_{h}^{e}=\frac{1}{2\varkappa}G\sum(a)\mathfrak{g}^{ae}g_{ah}-\frac{1}{\varkappa}\sum(a)\mathfrak{g}^{ae}G_{ah}

and from (57) and (86)

\mathfrak{t}_{0h}^{e}=-\mathfrak{T}_{h}^{e} (87)

At every point of the field-figure the components of the stress-energy-tensor of the gravitation field would therefore be equal to the corresponding quantities for the matter or the electro-magnetic system with the opposite sign. It is obvious that by this the condition of the conservation of momentum and energy for the whole system would be immediately fulfilled. It was in fact this circumstance that made me think of the tensor \mathfrak{t}_{0}=-\mathfrak{T}. The way in which \mathfrak{s}_{0} was introduced in §§ 38 and 39 has only been chosen in order to lay stress on (58) being an identity, so that equation (85) is but another form of (79).

At first sight the relations (87) and the conception to which they have led, may look somewhat startling. According to it we should have to imagine that behind the directly observable world with its stresses, energy etc. there is hidden the gravitation field with stresses, energy etc. that are everywhere equal and opposite to the former; evidently this is in agreement with the interchange of momentum and energy which accompanies the action of gravitation. On the way of a light-beam e.g. there would be everywhere in the gravitation field an energy current equal and opposite to the one existing in the beam. If we remember that this hidden energy-current can be fully described mathematically by the quantities g_{ab} and that only the interchange just mentioned makes it perceptible to us, this mode of viewing the phenomena does not seem unacceptable. At all events we are forcibly led to it if we want to preserve the advantage of a stress-energy-tensor also for the gravitation field. It can namely be shown that a tensor which is transformed in the same way as the tensor \mathfrak{t}_{0} defined by (57) and (86) and which in every system of coordinates has the same divergency as the latter, must coincide with \mathfrak{t}_{0}.

Finally we may remark that (78), (86), (58), (87) give

div\ \mathfrak{t}=div\ \mathfrak{t}_{0}=-div\ \mathfrak{T}

so that we have, both from (79) and from (85), K_{h}=0.

The question is this, that, so long as the gravitation field is considered as given, we may introduce "external" forces, but that in the equations for the gravitation field itself we must also take into consideration the stress-energy-tensor of the system by which those forces are exerted.

IV.

(Communicated in the meeting of October 28, 1916).


§ 53. The expressions for the stress-energy-components of the gravitation field found in the preceding paper call for some further remarks. If by \delta_{h}^{e} we denote a quantity having the value 1 for e = h and being 0 for e\ne h, those expressions can be written in the form (comp. equations (52) and (78))

\mathfrak{t}_{h}^{e}=\frac{1}{2\varkappa}\left\{ -\delta_{h}^{e}Q+\sum(ab)\frac{\partial Q}{\partial g_{ab,e}}g_{ab,h}+\sum(abf)\frac{\partial Q}{\partial g_{ab,fe}}g_{ab,fh}-\sum(abf)\frac{\partial Q}{\partial x_{f}}\left(\frac{\partial Q}{\partial g_{ab,ef}}\right)g_{ab,h}\right\} (88)

They contain the first and second derivatives of the quantities g_{ab}. Einstein on the contrary has given values for the stress-energy-components which contain the first derivatives only and which therefore are in many respects much more fit for application.

It will now be shown how we can also find formulae without second derivatives, if we start from (88).


§ 54. For this purpose we shall consider the complex \mathfrak{u} defined by

\mathfrak{u}_{h}^{e}=\frac{1}{2\varkappa}\left\{ \delta_{h}^{e}Q-\sum(abf)\frac{\partial}{\partial x_{h}}\left(\frac{\partial Q}{\partial g_{ab,fe}}g_{ab,f}\right)\right\} (89)

and we shall seek its divergency.

We have

(div\ \mathfrak{u})_{h}
 =\sum(e)\frac{\partial\mathfrak{u}^{e}}{\partial x_{e}}=\frac{1}{2\varkappa}\left\{ \frac{\partial Q}{\partial x_{h}}-\sum(abfe)\frac{Q^{2}}{\partial x_{e}\partial x_{h}}\left(\frac{\partial Q}{\partial g_{ab,fe}}g_{ab,f}\right)\right\}

or

(div\ \mathfrak{u})_{h}=\frac{1}{2\varkappa}\frac{\partial R}{\partial x_{h}} (90)

if we put

R=Q-\sum(abfe)\frac{\partial Q}{\partial x_{e}}\left(\frac{\partial Q}{\partial g_{ab,fe}}g_{ab,f}\right) (91)

Now Q=\sqrt{-g}G can be divided into two parts, the first of which Q_{1} contains differential coefficients of the quantities g_{ab} of the first order only, while the second Q_{2} is a homogeneous linear function of the second derivatives of those quantities. This latter involves that, if we replace (91) by

R=Q_{1}+Q_{2}-\sum(abfe)\left(\frac{\partial Q}{\partial g_{ab,fe}}g_{ab,fe}\right)-\sum(abfe)\frac{\partial}{\partial x_{e}}\left(\frac{\partial Q}{\partial g_{ab,fe}}\right)g_{ab,f}

the second and the third term annul each other. Thus

R=Q_{1}-\sum(abfe)\frac{\partial}{\partial x_{e}}\left(\frac{\partial Q}{\partial g_{ab,fe}}\right)g_{ab,f} (92)

If now we define a complex \mathfrak{v} by the equation

\mathfrak{v}_{h}^{e}=-\frac{1}{2\varkappa}\delta_{h}^{e}R (93)

we have

(div\ \mathfrak{v})_{h}=-\frac{1}{2\varkappa}\frac{\partial R}{\partial x_{h}} (94)

If finally we put

\mathfrak{t'=t+u+v}

we infer from (90) and (94)

div\ \mathfrak{t}'=div\ \mathfrak{t} (95)

and from (88), (89), (93) and (92)

\begin{array}{c}
\mathfrak{t}_{h}^{'h}=\frac{1}{2\varkappa}\left\{ -Q_{1}+\sum(ab)\frac{\partial Q}{\partial g_{ab,h}}g_{ab,h}-\sum(abf)\frac{\partial}{\partial x_{h}}\left(\frac{\partial Q}{\partial g_{ab,fh}}\right)g_{ab,f}-\right.\\
\\
\left.\sum(abf)\frac{\partial}{\partial x_{f}}\left(\frac{\partial Q}{\partial g_{ab,hf}}\right)g_{ab,h}+\sum(abfe)\frac{\partial}{\partial x_{e}}\left(\frac{\partial Q}{\partial g_{ab,fe}}\right)g_{ab,f}\right\} 
\end{array} (96)

and for e\ne h

\begin{array}{c}
\mathfrak{t}_{h}^{'e}=\frac{1}{2\varkappa}\left\{ \sum(ab)\frac{\partial Q}{\partial g_{ab,e}}g_{ab,h}-\sum(abf)\frac{\partial}{\partial x_{h}}\left(\frac{\partial Q}{\partial g_{ab,fe}}\right)g_{ab,f}-\right.\\
\\
\left.-\sum(abf)\frac{\partial}{\partial x_{f}}\left(\frac{\partial Q}{\partial g_{ab,ef}}\right)g_{ab,h}\right\} 
\end{array} (97)

Formula (95) shows that the quantities \mathfrak{t}_{h}^{'e} can be taken just as well as the expressions (88) for the stress-energy-components and we see from (96) and (97) that these new expressions contain only the first derivatives of the coefficients g_{ab}; they are homogeneous quadratic functions of these differential coefficients.

This becomes clear when we remember that Q_{1} is a function of this kind and that only Q_{1} contributes something to the second term of (96) and the first of (97); further that the derivatives of Q occurring in the following terms contain only the quantities g_{ab} and not their derivatives.


§ 55. Einstein's stress-energy-components have a form widely different from that of the above mentioned ones. They are

\mathfrak{t}_{(E)h}^{e}=\frac{1}{2\varkappa}\delta_{h}^{e}\sum(abcf)g^{ab}\Gamma_{ac}^{f}\Gamma_{bf}^{c}-\frac{1}{\varkappa}\sum(abc)g^{ab}\Gamma_{ac}^{e}\Gamma_{bh}^{c}

where for the sake of simplicity it has been assumed that \sqrt{-g}=1. Further we have

\Gamma_{ab}^{c}=-\left\{ \begin{array}{c}
ab\\
c
\end{array}\right\} =-\sum(e)g^{ce}\left[\begin{array}{c}
ab\\
e
\end{array}\right]

If now our formulae (96) and (97) are likewise simplified by the assumption \sqrt{-g}=1 (so that Q becomes equal to G), we may expect that \mathfrak{t}' will become identical with \mathfrak{t}_{(E)}. This is really so in the case g_{ab}=0 for a\ne b; by which it seems very probable that the agreement will exist in general.

In the preceding paper it was shown already that the stress-energy-components \mathfrak{t}_{h}^{e} do not form a "tensor", but what was called a "complex". The same may be said of the quantities \mathfrak{t}_{h}^{'e} defined by (96) and (97) and of the expressions given by Einstein. If we want a stress-energy-tensor, there are only left the quantities \mathfrak{t}_{0h}^{e} defined by (86) and (57), the values of which are always equal and opposite to the corresponding stress-energy-components \mathfrak{T}_{h}^{e} for the matter or the electromagnetic field.

It must be noticed that the four equations

\sum(e)\frac{\partial}{\partial x_{e}}\left(\mathfrak{T}_{h}^{e}+\mathfrak{T}_{(g)h}^{e}\right)=0

always express the same relations, whether we choose \mathfrak{t}_{0h}^{e},\ \mathfrak{t}_{h}^{e},\ \mathfrak{t}_{h}^{'e} or \mathfrak{t}_{(E)h}^{e} as stress-energy-components \mathfrak{T}_{(g)h}^{e} of the gravitation field. If however in a definite case we want to use the equations in order to calculate how the momentum and the energy of the matter and the electromagnetic field change by the gravitational actions, it is best to use \mathfrak{t}_{h}^{'e} or \mathfrak{t}_{(E)h}^{e}, just because these quantities are homogeneous quadratic functions of the derivatives g_{ab,c}.

Experience namely teaches us that the gravitation fields occurring in nature may be regarded as feeble, in this sense that the values of the g_{ab}'s are little different from those which might be assumed if no gravitation field existed. For these latter values, which will be called the "normal" ones, we may write in orthogonal coordinates

g_{11}=g_{22}=g_{32}=-1,\ g_{44}=c^{2},\ g_{ab}=0,\quad\textrm{for}\quad a\ne b (98)

In a first approximation, which most times will be sufficient, the deviations of the values of the g_{ab}'s from these normal ones may be taken proportional to the gravitation constant \varkappa. This factor also appears in the differential coefficients g_{ab,c}; hence, according to the character of the functions \mathfrak{t}_{h}^{'e} mentioned above (and on account of the factor \tfrac{1}{\varkappa} in (96) and (97)) these functions become proportional to \varkappa, so that in a feeble gravitation field they have low values.


§ 56. Because of the complicated form of equations (96) and (97), we shall confine ourselves to the calculation for some cases of \mathfrak{t}_{4}^{'4}, i.e. of the energy per unit of volume. This calculation is considerably simplified if we consider stationary fields only. Then all differential coefficients with respect to x_{4} vanish, so that we have according to (96)

\mathfrak{t}_{4}^{'4}=\frac{1}{2\varkappa}\left\{ -Q_{1}+\sum(abfe)\frac{\partial}{\partial x_{e}}\left(\frac{\partial Q}{\partial g_{ab,fe}}\right)g_{ab,f}\right\} (99)

We shall work out the calculation, first for a field without gravitation and secondly for the case of an attracting spherical body in which the matter is distributed symmetrically round the centre.

If there is no gravitation field we may take for the quantities g_{ab} the "normal" values. For the case of orthogonal coordinates these are given by (98). When we want to use the polar coordinates introduced into § 48 we have the corresponding formulae

\left.\begin{array}{c}
g_{11}=-\frac{r^{2}}{1-x_{1}^{2}},\ g_{22}=-r^{2}\left(1-x_{1}^{2}\right),\ g_{33}=-1,\ g_{44}=c^{2},\\
\\
g_{ab}=0,\quad\textrm{for}\quad a\ne b
\end{array}\right\} (100)

If, using polar coordinates, we have to do with an attracting sphere and if we take its centre as origin, we may put

g_{11}=-\frac{u}{1-x_{1}^{2}},\ g_{22}=-\left(1-x_{1}^{2}\right)u,\ g_{33}=-v,\ g_{44}=w, (101)

where u, v, w are functions of r. The g_{ab}'s which belong to an orthogonal system of coordinates may be expressed in the same functions.

These g_{ab}'s are

\begin{array}{l}
g_{11}=-\frac{u}{r^{2}}+\frac{x_{1}^{2}}{r^{2}}\left(\frac{u}{r^{2}}-v\right),\ etc.\\
\\
g_{12}=\frac{x_{1}x_{2}}{r^{2}}\left(\frac{u}{r^{2}}-v\right),\ etc.\\
\\
g_{14}=g_{24}=g_{34}=0,\ g_{44}=w.
\end{array}

The "etc." means that for g_{22},g_{33} we have similar expressions as for g_{11} and forg_{23},g_{31} similar ones as for g_{12}.


§ 57. In order to deduce the differential equations determining u, v, w we may arbitrarily use rectangular or polar coordinates; the latter however are here to be preferred. If differentiations with respect to r are indicated by accents, we have according to (40) and (101)

\begin{array}{l}
G_{11}=\frac{1}{1-x_{1}^{2}}\left(-1+\frac{u''}{2v}-\frac{u'v'}{4v^{2}}+\frac{u'w'}{4vw}\right),\\
\\
G_{22}=\left(1-x_{1}^{2}\right)\left(-1+\frac{u''}{2v}-\frac{u'v'}{4v^{2}}+\frac{u'w'}{4vw}\right),\\
\\
G_{33}=\frac{u''}{u}-\frac{u'^{2}}{2u^{2}}-\frac{u'v'}{2uv}-\frac{v'w'}{4vw}+\frac{w''}{2w}-\frac{w'^{2}}{4w^{2}},\\
\\
G_{44}=-\frac{u'w'}{2uv}+\frac{v'w'}{4v^{2}}-\frac{w''}{2v}+\frac{w'^{2}}{4vw},
\end{array}

G_{ab}=0 for a\ne b

So we have found the left hand sides of the field equations (65). Before considering these equations more closely we shall introduce the simplification that the g_{ab}'s, are very little different from the normal values (100). For these latter we have

u=r^{2},\ v=1,\ w=c^{2} (102)

and therefore we now put

u=r^{2}(1+\lambda),\ v=1+\mu,\ w=c^{2}(1+\nu) (103)

The quantities \lambda,\mu,\nu, which depend on r, will be regarded as infinitely small of the first order and in the field equations we shall neglect quantities of second and higher orders.

Then we may write for G_{11} etc.

\begin{array}{l}
G_{11}=\frac{1}{1-x_{1}^{2}}\left(\lambda+2r\lambda'+\frac{1}{2}r^{2}\lambda''-\mu-\frac{1}{2}r\mu'+\frac{1}{2}r\nu'\right)\\
\\
G_{22}=1-x_{1}^{2}\left(\lambda+2r\lambda'+\frac{1}{2}r^{2}\lambda''-\mu-\frac{1}{2}r\mu'+\frac{1}{2}r\nu'\right)\\
\\
G_{33}=\frac{2}{r}\lambda'+\lambda''-\frac{1}{r}\mu'+\frac{1}{2}\nu'',\\
\\
G_{44}=-c^{2}\left(\frac{1}{r}\nu'+\frac{1}{2}\nu''\right)
\end{array}

On the right hand-sides of the field equations (65) we may take for g_{ab} the normal value; moreover we shall take for T_{ab} and T the values which hold for a system of incoherent material points. We may do so if we assume no other internal stresses but those caused by the mutual attractions; these stresses may be neglected in the present approximation.

As we supposed the attracting matter to be at rest we have according to (10), (16) and (15) (1915) w_{1}=w_{2}=w_{3}=0, w_{4}=\varrho, u_{1}=u_{2}=u_{3}=0, u_{4}=c^{2}\varrho, P=c\varrho.

In the notations we are now using we have further, according to (23) (1915),

\mathfrak{T}_{h}^{e}=\frac{u_{h}w_{e}}{P}

so that of the stress-energy-components of the matter only one is different from zero, namely

\mathfrak{T}_{4}^{4}=c\varrho

Further (66) involves that, also of the quantities T_{ab}, only one, namely T_{44}, is not equal to zero. As we may put \sqrt{-g}=cr^{2} we have namely

T_{44}=\frac{c^{2}}{r^{2}}\varrho,\ T=\frac{1}{r^{2}}\varrho

Finally we are led to the three differential equations

\lambda=2r\lambda'+\frac{1}{2}r^{2}\lambda''-\mu-\frac{1}{2}r\mu'+\frac{1}{2}r\nu'=-\frac{1}{2}\varkappa\varrho (104)
2r\lambda'+r^{2}\lambda''-r\mu'+\frac{1}{2}r\nu''=-\frac{1}{2}\varkappa\varrho (105)
r\nu'+\frac{1}{2}r^{2}\nu''=\frac{1}{2}\varkappa\varrho (106)

It may be remarked that \varrho dx_{1}dx_{2}dx_{3}, represents the "mass" present in the element of volume dx_{1}dx_{2}dx_{3}. Because of the meaning of x_{1},x_{2},x_{3} (§ 48) the mass in the shell between spheres with radii r and r + dr is found when \varrho dx_{1}dx_{2}dx_{3} is integrated with respect to x_{1} between the limits —1 and +1 and with respect to x_{2} between 0 and 2\pi. As \varrho depends on r only, this latter mass becomes 4\pi\varrho dr, so that \varrho is connected with the "density" in the ordinary sense of the word, which will be called \overline{\varrho}, by the equation

\varrho=r^{2}\overline{\varrho}

The differential equations also hold outside the sphere if \varrho is put equal to zero. We can first imagine \varrho to change gradually to near the surface and then treat the abrupt change as a limiting case.

In all the preceding considerations we have tacitly supposed the second derivatives of the quantities g_{ab} to have everywhere finite values. Therefore \nu and \nu' will be continuous at the surface, even in the case of an abrupt change.


§ 58. Equation (106) gives

\nu'=\frac{\varkappa}{r^{2}}\int\limits _{0}^{r}\varrho\ dr (107)

where the integration constant is determined by the consideration that for r = 0 all the quantities g_{ab} and their derivatives must be finite, so that for r = 0 the product r^{2}\nu' must be zero. As it is natural to suppose that at an infinite distance \nu vanishes, we find further

\nu=\varkappa\int\limits _{\infty}^{r}\frac{dr}{r^{2}}\int\limits _{0}^{r}\varrho dr (108)

The quantities \lambda and \mu on the contrary are not completely determined by the differential equations. If namely equations (105) and (106) are added to (104) after having been multiplied by -\tfrac{1}{2} and +\tfrac{1}{2} respectively, we find

\lambda+r\lambda'-\mu+r\nu'=0 (109)

and it is clear that (104) and (105) are satisfied as soon as this is the case with this condition (109) and with (106). So we have only to attend to (108) and (109). The indefiniteness remaining in \lambda and \mu is inevitable on account of the covariancy of the field equations. It does not give rise to any difficulties.

Equation (107) teaches us that near the centre

\nu'=\frac{1}{3}\varkappa\overline{\varrho_{0}}r

if \overline{\varrho_{0}} is the density at the centre, whereas from (108) we find a finite value for \nu itself. This confirms what has been said above about the values at the centre. We shall assume that at that point \lambda,\mu and their derivatives have likewise finite values. Moreover we suppose (and this agrees with (109)) that \lambda,\mu,\lambda' and \mu' are continuous at the surface of the sphere.

If a is the radius of the sphere we find from (108) for an external point

\nu=-\frac{\varkappa}{r}\int\limits _{0}^{a}\varrho dr

Without contradicting (109) we may assume that at a great distance from the centre \lambda and \mu are likewise proportional to \tfrac{1}{r}, so that \lambda' and \mu' decrease proportionally to \tfrac{1}{r^{2}}.


§ 59. We can now continue the calculation of \mathfrak{t}_{4}^{'4} (§ 56). Substituting (101) in (99) and using polar coordinates we find

\mathfrak{t}_{4}^{'4}=-\frac{1}{2\varkappa}u\sqrt{\frac{w}{v}}\left(\frac{1}{2}\frac{u'^{2}}{u^{2}}+\frac{u'w'}{uw}\right)

whence by substituting (102) we derive for a field without gravitation

\mathfrak{t}_{4}^{'4}=-\frac{c}{\varkappa}

This equation shows that, working with polar coordinates, we should have to ascribe a certain negative value of the energy to a field without gravitation, in such a way (comp. § 57) that the energy in the shell between the spheres described round the origin with radii r and r + dr becomes

-\frac{4\pi c}{\varkappa}dr

The density of the energy in the ordinary sense of the word would be inversely proportional to r^{2}, so that it would become infinite at the centre.

It is hardly necessary to remark that, using rectangular coordinates we find a value zero for the same case of a field without gravitation. The normal values of g_{ab} are then constants and their derivatives vanish.


§ 60. Using rectangular coordinates we shall now indicate the form of \mathfrak{t}_{4}^{'4} for the field of a spherical body, with the approximation specified in § 57. Thus we put

\left.\begin{array}{l}
g_{11}=-(1+\lambda)+\frac{x_{1}^{2}}{r^{2}}(\lambda-\mu),\ etc.\\
\\
g_{12}=\frac{x_{1}x_{2}}{r^{2}}(\lambda-\mu),\ etc.\\
\\
g_{14}=g_{24}=g_{34}=0,\ g_{44}=c^{2}(1+\nu)
\end{array}\right\} (110)

By (109) and (110) we find[34]

\mathfrak{t}_{4}^{'4}=\frac{c}{2\varkappa}\left\{ \nu'^{2}+\frac{1}{r}(\lambda-\mu)\left[\frac{1}{r}(\lambda-\mu)+2(\lambda'-\mu')\right]\right\} (111)

Thus we see (comp. § 58) that at a distance from the attracting sphere \mathfrak{t}_{4}^{'4} decreases proportionally to \tfrac{1}{r^{4}}. Further it is to be noticed that on account of the indefiniteness pointed out in § 58, there remains some uncertainty as to the distribution of the energy over the space, but that nevertheless the total energy of the gravitation field

E=4\pi\int\limits _{0}^{\infty}\mathfrak{t}_{4}^{'4}r^{2}dr

has a definite value.

Indeed, by the integration the last terra of (111) vanishes. After multiplication by r^{2} this term becomes namely

(\lambda-\mu)^{2}+2r(\lambda-\mu)(\lambda'-\mu')=\frac{d}{dr}\left[r(\lambda-\mu)^{2}\right]

The integral of this expression is 0 because (comp. §§ 57 and 58) r(\lambda-\mu)^{2} is continuous at the surface of the sphere and vanishes both for r = 0 and for r=\infty.

We have thus

E=\frac{\pi c}{\varkappa}\int\limits _{0}^{\infty}\nu'^{2}r^{2}dr (112)

where the value (107) can be substituted for \nu'. If e.g. the density \overline{\varrho} is everywhere the same all over the sphere, we have at an internal point

\nu'=\frac{1}{3}\varkappa\overline{\varrho}r

and at an external point

\nu'=\frac{1}{3}\varkappa\overline{\varrho}\frac{a^{3}}{r^{2}}

From this we find

E=\frac{2}{15}\pi c\varkappa\overline{\varrho}^{2}a^{5}


§ 61. The general equation (99) found for \mathfrak{t}_{4}^{'4} can be transformed in a simple way. We have namely

\begin{array}{c}
\sum(abfe)\frac{\partial}{\partial x_{e}}\left(\frac{\partial Q}{\partial g_{ab,fe}}\right)g_{ab,f}=\sum(abfe)\frac{\partial}{\partial x_{e}}\left(\frac{\partial Q}{\partial g_{ab,fe}}g_{ab,f}\right)-\\
\\
-\sum(abfe)\frac{\partial Q}{\partial g_{ab,fe}}g_{ab,fe}
\end{array}

and we may write -Q_{2} (§ 54) for the last term. Hence

\mathfrak{t}_{4}^{'4}=\frac{1}{2\varkappa}\left\{ -Q+\sum(abfe)\frac{\partial}{\partial x_{e}}\left(\frac{\partial Q}{\partial g_{ab,fe}}g_{ab,f}\right)\right\} (113)

where we must give the values 1, 2, 3 to e and f.

The gravitation energy lying within a closed surface consists therefore of two parts, the first of which is

E_{1}=-\frac{1}{2\varkappa}\int Q\ dx_{1}dx_{2}dx_{3} (114)

while the second can be represented by surface integrals. If namely q_{1},q_{2},q_{3} are the direction constants of the normal drawn outward

E_{2}=\frac{1}{2\varkappa}\sum(abfe)\frac{\partial Q}{\partial g_{ab,fe}}g_{ab,f}q_{e}d\sigma (115)

In the case of the infinitely feeble gravitation field represented by \lambda,\mu,\nu (§ 57) both expressions E_{1} and E_{2} contain quantities of the first order, but it can easily be verified that these cancel each other in the sum, so that, as we knew already, the total energy is of the second order.

From Q=\sqrt{-g}G and the equations of § 32 we find namely

\frac{\partial Q}{\partial g_{ab,fe}}=\frac{1}{2}\sqrt{-g}\left(2g^{ab}g^{fe}-g^{bf}g^{ae}-g^{af}g^{be}\right) (116)

so that we can write

E_{2}=\frac{1}{4\varkappa}\int\sqrt{-g}\sum(abfe)\left(2g^{ab}g^{fe}-g^{bf}g^{ae}-g^{af}g^{be}\right)g_{ab,f}q_{e}d\sigma

The factor g_{ab,f} is of the first order. Thus, if we confine ourselves to that order, we may take for all the other quantities these normal values. Many of these are zero and we find

E_{2}=-\frac{c}{2\varkappa}\sum(ae)\int g^{aa}\left(g_{aa,e}-g_{ae,a}\right)q_{e}d\sigma (117)

Here we must take a = 1, 2, 3, 4; e = 1, 2, 3, while we remark that for a=e the expression between brackets vanishes. For a = 4 the integral becomes \int\frac{\partial\nu}{\partial x_{e}}q_{e}d\sigma do, which after summation with respect to e gives

\int\frac{\partial\nu}{\partial n}d\sigma (118)

n representing the normal to the surface. If a and e differ from each other, while neither of them is equal to 4, we can deduce from (110) and (109)

g_{aa,e}-g_{ae,a}=\frac{\partial\nu}{\partial x_{e}}

Each value of e occurring twice, i.e. combined with the two values different from e which a can take, we have in addition to (118)

-2\int\frac{\partial\nu}{\partial n}d\sigma

so that (117) becomes

E_{2}=\frac{c}{2\varkappa}\int\frac{\partial\nu}{\partial n}d\sigma

As now outside the sphere

\nu=-\frac{\varkappa}{r}\int\limits _{0}^{a}\varrho\ dr

we have for every closed surface that does not surround the sphere E_{2}=0, but for every surface that does

E_{2}=2\pi c\int\limits _{0}^{a}\varrho\ dr (119)

As to E_{1} we remark that substituting (65) in (41) and taking into consideration (64) we find,

G=\varkappa T,\ Q=\varkappa\sqrt{-g}T (120)

From this we conclude that E_{1} is zero if there is no matter inside the surface \sigma. In order to determine E_{1} in the opposite case, we remember that G is independent of the choice of coordinates. To calculate this quantity we may therefore use the value of T indicated in § 56, which is sufficient to calculate E_{1} as far as the terms of the first order. We have therefore

G=\frac{\varkappa}{r^{2}}\varrho

and if, using further on rectangular coordinates, we take for \sqrt{-g} the normal value c,

Q=\frac{c\varkappa}{r^{2}}\varrho

From this we find by substitution in (114) for the case of the closed surface a surrounding the sphere

E_{1}=-2\pi c\int\limits _{0}^{a}\varrho\ dr

This equation together with (119) shows that in (113) when integrated over the whole space the terms of the first order really cancel each other. In order to calculate those of the second order and thus to derive the result (112) from (113), we should have to determine the quantity T (comp. 120)), accurately to the order \varkappa. The surface integrals in (115) too would have to be considered more closely. We shall not however dwell upon this.


§ 62. From the expression for \mathfrak{t}_{4}^{'4} given in (113) and the value

E=E_{1}+E_{2}

derived from it, it can be inferred that, though \mathfrak{t}' is no tensor, we yet may change a good deal in the system of coordinates in which the phenomena are described, without altering the value of the total energy. Let us suppose e.g. that x_4 is left unchanged but that, instead of the rectangular coordinates x_{1},x_{2},x_{3} hitherto used, other quantities x_{1}',x_{2}',x_{3}' are introduced, which are some continuous function of x_{1},x_{2},x_{3}, with the restriction that x'_{1}=x_{1},x'_{2}=x_{2},x'_{3}=x_{3} outside a certain closed surface surrounding the attracting matter at a sufficient distance. If we use these new coordinates, we shall have to introduce other quantities g'_{ab} instead of g_{ab} however outside the closed surface the quantities x_{1},x_{2},x_{3} and their derivatives do not change, the value of E_{1} will approach the same limit as when we used the coordinates x_{1},x_{2},x_{3}, if the surface \sigma for which it is calculated expands indefinitely. The value which we find for E_1 after the transformation of coordinates will also be the same as before. Indeed, if d\tau is an element of volume expressed in x_{1},x_{2},x_{3}-units and d\tau' the same element expressed in x_{1}',x_{2}',x_{3}'-units, while Q' represents the new value of Q, we have

Qd\tau=Q'd\tau'

It is clear that the total energy will also remain unchanged if x_{1}',x_{2}',x_{3}' differ from x_{1},x_{2},x_{3} at all points, provided only that these differences decrease so rapidly with increasing distance from the attracting body, that they have no influence on the limit of the expression (115).

The result which we have now found admits of another interpretation. In the mode of description which we first followed (using x_{1},x_{2},x_{3}), \varrho[35]) and g_{ab} are certain functions of x_{1},x_{2},x_{3}; in the new one \varrho', g'_{ab} are certain other functions of x_{1}',x_{2}',x_{3}'. If now, without leaving the system of coordinates x_{1},x_{2},x_{3}, we ascribe to the density and to the gravitation potentials values which depend on x_{1},x_{2},x_{3}, in the same way as \varrho', g'_{ab} depended on x_{1}',x_{2}',x_{3}' just now, we shall obtain a new system (consisting of the attracting body and the gravitation field) which is different from the original system because other functions of the coordinates occur in it, but which nevertheless no observation will be able to discern from it, the indefiniteness which is a necessary consequence of the covariancy of the field equations, again presenting itself.

What has been said shows that the total gravitation energy in this new system will have the same value as in the original one, as has been found already in § 60 with the restrictions then introduced.


§ 63. If \mathfrak{t}' were a tensor, we should have for all substitutions the transformation formulae given at the end of § 40. In reality this is not the case now, but from (96) and (97) we can still deduce that those formulae hold for linear substitutions. They may likewise be applied to the stress-energy-components of the matter or of an electromagnetic system. Hence, if \mathfrak{T}_{a}^{b} represents the total stress-energy-components, i. e. quantities in which the corresponding components for the gravitation field, the matter and the electromagnetic field are taken together, we have for any linear transformation

\frac{1}{\sqrt{-g'}}\mathfrak{T}_{c}^{'b}=\frac{1}{\sqrt{-g}}\sum(kl)p_{kc}\pi_{lb}\mathfrak{T}_{k}^{l} (121)

We shall apply this to the case of a relativity transformation, which can be represented by the equations

x'_{1}=ax_{1}+bcx_{4},\ x'_{2}=x_{2},\ x'_{3}=x_{3},\ x'_{4}=ax_{4}+\frac{b}{c}x_{1} (122)

with the relation

a^{2}-b^{2}=1 (123)

In doing so we shall assume that the system, when described in the rectangular coordinates x_{1},x_{2},x_{3} and with respect to the time x_{4}, is in a stationary state and at rest.

Then we derive from (97)[36]

\mathfrak{t}_{1}^{'4}=\mathfrak{t}_{2}^{'4}=\mathfrak{t}_{3}^{'4}=0;\ \mathfrak{t}_{4}^{'1}=\mathfrak{t}_{4}^{'2}=\mathfrak{t}_{4}^{'3}=0

which means that in the system \left(x_{1},x_{2},x_{3},x_{4}\right) there are neither momenta nor energy currents in the gravitation field.

We may assume the same for the matter, so that we have for the total stress-energy-components in the system \left(x_{1},x_{2},x_{3},x_{4}\right)

\mathfrak{T}_{1}^{4}=\mathfrak{T}_{2}^{4}=\mathfrak{T}_{3}^{4}=0;\ \mathfrak{T}_{4}^{1}=\mathfrak{T}_{4}^{2}=\mathfrak{T}_{4}^{3}=0

Let us now consider especially the components \mathfrak{T}_{1}^{'4},\mathfrak{T}_{4}^{'1} and \mathfrak{T}_{4}^{'4} in the system \left(x'_{1},x'_{2},x'_{3},x'_{4}\right) For these we find from (121) and (122)

\mathfrak{T}_{1}^{'4}=\frac{ab}{c}\mathfrak{T}_{1}^{1}-\frac{ab}{c}\mathfrak{T}_{4}^{4},\ \mathfrak{T}_{4}^{'1}=-abc\ \mathfrak{T}_{1}^{1}+abc\ \mathfrak{T}_{4}^{4} (124)
\mathfrak{T}_{4}^{'4}=-b^{2}\mathfrak{T}_{1}^{1}+a^{2}\mathfrak{T}_{4}^{4} (125)

It is thus seen in the first place that between the momentum in the direction of x_{1}\left(-\mathfrak{T}_{1}^{'4}\right) and the energy-current in that direction \left(\mathfrak{T}_{4}^{'1}\right) there exists the relation

\mathfrak{T}_{4}^{'1}=-c^{2}\mathfrak{T}_{1}^{'4}

well known from the theory of relativity.

Further we have for the total energy in the system \left(x'_{1},x'_{2},x'_{3},x'_{4}\right)

E'=\int\mathfrak{T}_{4}^{'4}dx'_{1}dx'_{2}dx'_{3}

where the integration has to be performed for a definite value of the time x'_{4}. On account of (122) we may write for this

E'=\frac{1}{a}\int\mathfrak{T}_{4}^{'4}dx{}_{1}dx{}_{2}dx{}_{3}

where we have to keep in view a definite value of the time x_{4}.

If the value (125) is substituted here and if we take into consideration that, the state being stationary in the system \left(x_{1},x_{2},x_{3},x_{4}\right),

\int\mathfrak{T}_{1}^{1}dx{}_{1}dx{}_{2}dx{}_{3}=0

we have

E'=aE

if E is the energy ascribed to the system in the coordinates \left(x_{1},x_{2},x_{3},x_{4}\right). By integration of the first of the expressions (124) we find in the same way for the total momentum in the direction of x_{1}

G'=\frac{b}{c}E

§ 64. Equations (122) show that in the coordinates \left(x'_{1},x'_{2},x'_{3},x'_{4}\right) the system has a velocity of translation \tfrac{bc}{a} in the direction of x'_{1}. If this velocity is denoted by v, we have according to (123)

a=\frac{1}{\sqrt{1-\frac{v^{2}}{c^{2}}}}

If therefore we put

M=\frac{E}{c^{2}}

we find

E'=\frac{Mc^{2}}{\sqrt{1-\frac{v^{2}}{c^{2}}}},\ G'=\frac{Mv}{\sqrt{1-\frac{v^{2}}{c^{2}}}} (126)

When the system moves as a whole we may therefore ascribe to it an energy and a momentum which depend on the velocity of translation in the way known from the theory of relativity. The quantity M, to which the energy of the gravitation field also contributes a certain part, may be called the "mass" of the system. From what has been said in § 62 it follows that within certain limits it depends on the way in which the system and the gravitation field are described.

It must be remarked however that, if for the gravitation field we had chosen the stress-energy-tensor \mathfrak{t}_{0} (§ 52), the total energy of the system even when in motion would be zero. The same would be true of the total momentum and we should have to put M=0.

At first sight it may seem strange that we may arbitrarily ascribe to the moving system the momentum determined by (126) or a momentum 0; one might be inclined to think that, when a definite system of coordinates has been chosen, the momentum must have a definite value, which might be determined by an experiment in which the system is brought to rest by "external" forces. We must remember however (comp. § 52) that in the theory of gravitation we may introduce no "external" forces without considering also the material system S' in which they originate. This system S' together with the system S with which we were originally concerned, will form an entity, in which there is a gravitation field, part of which is due to S' (and a part also to the simultaneous existence of S and S'). There is no doubt that we may apply the above considerations to the total system (S, S') without being led into contradiction with any observation.


  1. A. Einstein, Zur allgemeinen Relativitätstheorie, Berliner Sitzungsberichte 1915, pp. 778 799; Die Feldgleichungen der Gravitation, ibid. 1915, p. 844.
  2. D. Hilbert, Die Grundlagen der Physik I, Göttinger Nachrichten, Math.-phys. Klasse, Nov. 1915.
  3. It will be known that in the theory of relativity Minkowski was the first who used this geometric representation in an extension of four dimensions. The name "world-line" has been borrowed from him.
  4. For the sake of simplicity we shall imagine the two motions not to be disturbed by this coincidence, so that e.g. two material points penetrate each other or pass each other at an extremely small distance without any mutual influence.
  5. In a correspondence I had with him.
  6. In other terms, that the data procured by astronomical observations can be extended arbitrarily and unboundedly.
  7. A "surface" determined by one equation between the coordinates is a three-dimensional extension. It will cause no confusion if sometimes we apply the name of "plane" to certain two-dimensional extensions, if we speak e.g. of the "plane" determined by two line-elements.
  8. This corresponds to the negative value which (1) gives for ds^{2} .
  9. For a radius-vector on the asymptotic cone we may take either of these values; this makes no difference, as the numerical value of a line-element in the direction of such a radius-vector becomes 0 in both cases.
  10. This agrees with the value of the Lagrangian function, which is to be found e.g. in my paper on "Hamilton's principle in Einstein's theory of gravitation." These Proceedings 19 (1916). p. 751.
  11. If, according to circumstances, different signs arc given to \mathrm{R} , the angle whose sine occurs in the formula for the area of a parallelogram must be understood to be positive in one case and negative in the other.
  12. From § 10 it follows that if the length of a vector \mathrm{A} that is represented by a line (§ 17) coincides with a radius-vector of the conjugate indicatrix, it is always represented by an imaginary number. We may however obtain a vector which in natural units is represented by a real number e.g. by 1 (§ 13) if we multiply the vector \mathrm{A} by an imaginary factor, which means that its components and also those of a vector product in which it occurs are multiplied by that factor.
  13. In the above considerations difficulties might arise if the vector \mathrm{N} lay on the asymptotic cone of the indicatrix, our definition of a vector of the value 1 would then fail (comp. note 2, p. 1345). With a view to this we can choose the form of the extension \Omega (§ 13) in such a way that this case does not occur, a restriction leading to a boundary with sharp edges.
  14. Zittingsverslag Akad. Amsterdam, 23 (1915), p. 1073; translated in Proceedings Amsterdam, 19 (1910), p. 751. Further on this last paper will be cited by l. c.
  15. Comp. § 7, l. c.
  16. For the infinitesimal quantities x_a occurring in (19) we have namely (comp. (30))

    x'_{a}=\sum(b)\pi_{ba}x_{b}

    and taking into consideration (19) and (20), i e.

    \xi_{a}=\sum(b)g_{ab}x_{b},\ x_{a}=\sum(b)\gamma_{ba}\xi_{b}

    and formula (7) l. c, we may write (comp. note 2, p. 758, l. c.)

    \begin{array}{l}
\xi'_{a}=\sum(b)g'_{ab}x'_{b}=\sum(bcde)p_{ca}p_{db}\pi_{eb}g_{cd}x_{e}=\\
\\
\qquad=\sum(cd)p_{ca}g_{cd}x_{d}=\sum(cdf)p_{ca}g_{cd}\gamma_{fd}\xi_{f}=\sum(c)p_{ca}\xi_{c}
\end{array}

  17. Put \Xi_{a}^{I}\Xi_{b}^{II}=\vartheta_{ab}. Then we have

    \vartheta'_{ab}=\Xi_{a}^{'I}\Xi_{b}^{'II}=\sum(cd)p_{ca}p_{db}\Xi_{c}^{I}\Xi_{d}^{II}=\sum(cd)p_{ca}p_{db}\vartheta_{cd}

    and similar formulae for the other three parts of (25).

  18. Comp. (28) l. c.
  19. Published September 1916, a revision having been found desirable.
  20. See Proceedings Vol. XIX, p. 1341 and 1354.
  21. Namely:

    g'^{lk}=\sum(ab)\pi_{ak}\pi_{bl}g^{ab}

    The symbol \left(g^{kl}\right) denotes the complex of all the quantities g^{kl}.

  22. Namely:

    G'_{im}=\sum(ab)p_{ai}p_{bm}G_{ab}

  23. On account of the relation

    \sqrt{-g'}dS'=\sqrt{-g}dS

  24. Similarly:

    g^{ba}=g^{ab},\ \mathfrak{g}^{ba}=\mathfrak{g}^{ab}

  25. This means that the transformation formulae for these quantities have the form

    (ik,lm)'=\sum(abce)p_{ai}p_{bk}p_{cl}p_{em}(ab,ce)

    See for the notations used here and for some others to be used later on my communication in Zittingsverslag Akad Amsterdam 23 (1915), p. 1073 (translated in Proceedings Amsterdam 19 (1916), p. 751). In referring to the equations and the articles of this paper I shall add the indication 1915.

  26. Suppose that at the boundary of the domain of integration \delta g_{ab}=0 and \delta g_{ab,e}=0. Then we have also \delta\mathfrak{g}^{ab}=0 and \delta\mathfrak{g}^{ab,e}=0, so that

    \int\left(\delta_{2}Q\right)dS=0,\ \int\delta_{2}QdS=0

    and from

    \int(\delta Q)dS=\int\delta Q\ dS

    we infer

    \int(\delta_{1}Q)dS=\int\delta_{1}Q\ dS

    As this must hold for every choice of the variations \delta g_{ab} (by which choice the variations \delta\mathfrak{g}_{ab} are determined too) we must have at each point of the field-figure

    (\delta_{1}Q)=\delta_{1}Q

  27. This may be made clear by a reasoning similar to that used in the preceding note. We again suppose \delta g_{ab} and \delta g_{ab,e} to be zero at the boundary of the domain of integration. Then \delta g'_{ab} and \delta g'_{ab,e} vanish too at the boundary, so that

    \int\delta_{2}Q'\ dS'=0,\ \int\delta_{2}Q\ dS=0

    From

    \int\delta Q'\ dS'=\int\delta Q\ dS

    we may therefore conclude that

    \int\delta_{1}Q'\ dS'=\int\delta_{1}Q\ dS

    As this must hold for arbitrarily chosen variations \delta g_{ab} we have the equation

    \delta_{1}Q'\ dS'=\delta_{1}Q\ dS

  28. Einstein uses the word "divergency" in a somewhat different sense. It seemed desirable however to have a name for the left hand side of (54) and it was difficult to find a better one.
  29. This has also been done by de Donder, Zittingsverslag Akad. Amsterdam, 35 (1916), p. 153.
  30. The notations \psi_{ab},\overline{\psi_{ab}} and \psi_{ab}^{*} (see (27), (29) and § 11, 1915), will however be preserved though they do not correspond to those of Einstein. As to formulae (59) and (60) it is to be understood that if p and q are two of the numbers 1, 2, 3, 4, p' and q' denote the other two in such a way that the order p\ q\ p'\ q' is obtained from 1 2 3 4 by an even number of permutations of two ciphers.
    If x_{1},x_{2},x_{3},x_{4} are replaced by x, y, z, t and if for the stresses the usual notations X_{x},X_{y}, etc., are used (so that e.g. for a surface element d\sigma perpendicular to the axis of x,X_{x} is the first component of the force per unit of surface which the part of the system situated on the positive side of d\tau exerts on the opposite part) then \mathfrak{T}_{1}^{1}=X_{x},\mathfrak{T}_{1}^{2}=X_{y}, etc. Further -\mathfrak{T}_{1}^{4},-\mathfrak{T}_{2}^{4},-\mathfrak{T}_{3}^{4} are the components of the momentum per unit of volume and \mathfrak{T}_{4}^{1},\mathfrak{T}_{4}^{2},\mathfrak{T}_{4}^{3} the components of the energy-current. Finally \mathfrak{T}_{4}^{4} is the energy per unit of volume.
  31. The quantities \gamma_{ab} in that equation are the same as those which are now denoted by g^{ab}.
  32. In the cases considered in § 43, \delta\mathrm{L} can indeed be represented in this way.
  33. To make the notation agree with that of § 38 b has been replaced by e.
  34. Of the laborious calculation it may be remarked here only that it is convenient to write the values (110) in the form

    g_{11}=-1+\alpha+\frac{\partial^{2}\beta}{\partial x_{1}^{2}},\ etc.

    g_{12}=\frac{\partial^{2}\beta}{\partial x_{1}\partial x_{2}},\ etc.

    where \alpha and \beta are infinitesimal functions of r. We then find

    \begin{array}{l}
\mathfrak{t}_{4}^{'4}=\frac{c}{2\varkappa}\left\{ -\frac{1}{2}\sum(a)\left(\frac{\partial\alpha}{\partial x_{a}}\right)^{2}+\sum(a)\frac{\partial\nu}{\partial x_{a}}\frac{\partial\alpha}{\partial x_{a}}+\right.\\
\\
\qquad\left.+\frac{1}{4}\sum(aik)\left[\frac{\partial^{3}\beta}{\partial x_{a}\partial x_{i}^{2}}\frac{\partial^{3}\beta}{\partial x_{a}\partial x_{k}^{2}}-\left(\frac{\partial^{3}\beta}{\partial x_{a}\partial x_{i}\partial x_{k}}\right)^{2}\right]\right\} \\
\\
\qquad\qquad(a,i,k=1,2,3)
\end{array}


    which reduces to (111) if the relations between \alpha,\beta and \gamma,\mu, viz.

    \alpha+\frac{1}{r}\beta'=-\lambda,\ -\frac{1}{r}\beta'+\beta''=\lambda-\mu

    and the equality \alpha'=\nu' involved in (109) are taken into consideration.

  35. By \varrho we mean here what was denoted by \bar{\varrho} in § 56.
  36. We have g_{14}=g_{24}=g_{34}=0, while all the other quantities gab are independent of x_4. Thus we can say that the quantities g_{ab} and g_{ab,c} are equal to zero when among their indices the number 4 occurs an odd number of times. The same may be said of g^{ab}, g^{ab,c}, \tfrac{\partial Q}{\partial g_{ab,cd}} (according to (116)), \tfrac{\partial}{\partial x_{k}}\left(\tfrac{\partial Q}{\partial g_{ab,cd}}\right) and also of products of two or more of such quantities. As in the last two terms of (97) the indices a, b and f occur twice, these terms will vanish when only one of the indices e and h has the value 4. As to the first term of (97) we remark that, according to the formulae of § 32, each of the indices a, b and e occurs only once in the differential coefficient of Q with respect to g_{ab,e}, while other indices are repeated. As to the number of times which e, h and the other indices occur we can therefore say the same of the first term of (97) as of the other terms. The first term also is therefore zero, if no more than one of the two indices e and h has the value 4.
    That t{}_{4}^{'e} vanishes for e\ne4 is seen immediately.
This work is in the public domain in the United States because it was published before January 1, 1923.

The author died in 1928, so this work is also in the public domain in countries and areas where the copyright term is the author's life plus 80 years or less. This work may also be in the public domain in countries and areas with longer native copyright terms that apply the rule of the shorter term to foreign works.