The Conformal Transformations of a Space of Four Dimensions and their Applications to Geometrical Optics
THE CONFORMAL TRANSFORMATIONS OF A SPACE OF FOUR DIMENSIONS
AND THEIR APPLICATIONS TO GEOMETRICAL OPTICS
By H. Bateman.
[Received October 9th, 1908. — Read November 12th, 1908.]
§ 2. The Conformal Transformations of a Space of Four Dimensions. § 4. Applications to Geometrical Optics. § 5. Applications of the preceding results to a Symmetrical Optical Instrument. 
1. The method of inversion which was first applied to problems in electrostatics by Lord Kelvin,^{[1]} and which forms the basis of his theory of electric images, has also been applied with success in other branches of mathematical physics, as, for instance, in hydrodynamics. In geometrical optics, however, the method has been seldom used, probably because the necessary developments are not to be found in books on geometrical optics. The object of this paper is to show that the method can be of real value in both geometrical and physical optics. It is found that the transformation which is really needed is an inversion in a space of four dimensions, the transition to threedimensional space being made by replacing the fourth coordinate by ict, where t is the time and c the velocity of light.
The first part of the paper is devoted to the general conformal transformation of a space of four dimensions. Shortly after Lord Kelvin's discovery of the method of transforming electrostatical problems by means of inversion,^{[2]} Liouville^{[3]} obtained the most general transformation that can be used for threedimensional problems in this way.
The group of transformations of this kind is known as the group of conformal transformations of space,^{[4]} it preserves the angles between two surfaces and changes a sphere into either a sphere or a plane.^{[5]}
The property, however, upon which the applications to electrostatical problems depends is that the transformations enable us to pass from one solution of the equation
. 
to another.^{[6]}
Now the group of conformal transformations in a space of four dimensions possesses the analogous property in connection with the two differential equations

which are fundamental in the wave theory of light.
This has been known for some time,^{[7]} but the analysis given in § 2 will be useful in indicating the procedure to be adopted to obtain the relation connecting the two solutions for any transformation of the group.
In § 3 a particular solution of the first of the above equations is expressed in terms of Riemann's general hypergeometric function, and new light is thrown upon the theory of the transformations of the hypergeometric equation into itself.
In § 4 the applications to geometrical optics are considered. When applied to a symmetrical optical instrument, the transformation reduces to a homographic transformation of the points on the axis.
2. The Conformal Transformations of a Space of Four Dimensions.
The study of the conformal transformations of a space of four dimensions is simplified by the introduction of the six homogeneous coordinates^{[8]}:
(1) 
connected by the identical relation
(2) 
A function F(x, y, z, w) can be expressed by means of them as a homogeneous function of arbitrary degree. For instance, we may write

In the first representation V is a homogeneous function of degree zero, and in the second U is a homogeneous function of degree 1. The coordinate n may be introduced into the representations by means of the identical relation (2), the homogeneity of the expression being thereby unaltered.
Conversely, any homogeneous function of the six variables (l, m, n, λ, μ, ν) can be expressed as a function of (x, y, z, w). We shall now consider under what circumstances such a function can satisfy one of the differential equations—
But, since V is a homogeneous function of degree zero,
and ν = 1, therefore
If, instead of (l, m, n, λ, μ, ν), we use the usual hexaspherical coordinates defined by the relations
;

the equation takes the more symmetrical form
This relation shows that a homogeneous function of degree zero, which is a solution of
i.e., of
when expressed in terms of x, y, z, w.
Next, let U be a homogeneous function of degree 1 in (l, m, n, λ, μ, ν), then we can show in a similar way that
Hence, if U is a solution of
i.e., of
it is a solution of
when expressed in terms of (x, y, z, w).
When () are interpreted as the coordinates of a point in a space of six dimensions, the expressions
remain unaltered in form after any change of rectangular axes in which the origin remains the same. Any change of this kind corresponds to a transformation in the (x, y, z, w) space, enabling us to pass from one solution of the equation
to another, and a similar remark applies to the equation
To illustrate the method of formation of the transformation, we may consider the effect of simply interchanging n and ν. The functions V and U of formulæ (8) and (4) then transform into
and
respectively, r² being written in place of x² + y² + z² + w².
Accordingly, from a solution V = F(x, y, z, w) of the equation
we may derive a second solution
and from the solution U = f(x, y, z, w) of
we may derive another solution
Putting w = ict, where c is the velocity of light and t is the time, the equations take the well known form^{[9]}

and the transformation may be written
where now
The study of this transformation will be taken up later.
A second transformation of some interest is obtained by interchanging m with n and μ with ν. This changes
into
that is
f(x, y, z, w)
into
Putting w = ict and changing the sign of z, the formulæ for the transformation are
where, now,
If V = F(x, y, z, t) is a solution of
the function F(X, Y, Z, T) is also a solution, and, if U = f(x, y, z, t) is a solution of
the function
is also a solution.
In following up the connection between different solutions, it is convenient to use polar coordinates. Putting

we obtain the relations
There is a similar transformation for Laplace's equation.^{[10]}
If
a solution f(x, y, z) corresponds to a second solution
Putting
the formulæ of transformation become
The transition from one solution of Laplace's equation to another is now easily effected.
The effects of combining the different transformations belonging to a group of conformal transformations is most easily studied by interpreting the transformation as a change of axes in a space in which the coordinates are the spherical coordinates .^{[11]} It is important to notice that the angle between two manifolds in this space is equal to the angle between the corresponding manifolds in the space to which the conformal transformations are applied. In the case of a space of four dimensions, we have, in fact,

from which the result easily follows.
A change in the sign of corresponds to an inversion, a change in the sign of coupled with a change in the sign of corresponds to the other transformation we have mentioned. It is evident that each of these transformations is of period 2. In general, a reflexion in a linear manifold in the a space corresponds to an inversion with regard to the corresponding circle, sphere, or hypersphere, in the space of four dimensions. A displacement of period n in the a space may be obtained by taking successive reflexions in two plane fivefolds which cut at an angle π/n.^{[12]} It evidently corresponds to a periodic conformal transformation made up of inversions with regard to two hyperspheres which cut at an angle π/n.
3. The Relation between Riemann's General Hypergeometric Function and the Group of Conformal Transformations of a Space of Four Dimensions.
We shall now endeavour to satisfy the differential equation
by means of a function of the form
(1) 
. 
if the relation
is satisfied, for it will then be a homogeneous function of degree 1 in (l, m, n, λ, μ, ν).
Let us put
where a, b and c are arbitrary constants. The relation
is then satisfied, and P becomes a function of z alone. We may thus write
the particular functional form in terms of ξ, η, ζ being chosen to facilitate the calculations. H is clearly a homogeneous function of degree zero in ξ, η, ζ and therefore in l, m, n, λ, μ, ν.
On differentiating equation (1), we obtain

The differential equation
will thus be satisfied, if


Hence
Also, since
we have
this relation being obtained in the same way as the one above.
The last expression may be written
i.e.,
or
Now
and
; 
consequently,
This is Papperitz's form^{[13]} of the differential equation satisfied by Riemann's general hypergeometric function^{[14]}
; 
hence we have the result that
is a homogeneous function of (l, m, n, λ, μ, ν) of degree 1, satisfying the equation
When expressed in terms of x, y, z and w, it will thus be a solution of the equation
The various transformations^{[15]} of the general hypergeometric function are easily obtained from this result. If we write U in the form
we see that
is a multiple of
Again, if we write
we have
so that
This shows that P is the same function of the quantities a', b', c', z' as it is of a, b, c, z; that is
Hence the general hypergeometric function is unaltered if the quantities a, b, c, z are replaced by quantities a', b', c', z' which are derived from them by the same homographic substitution.
4. Applications to Geometrical Optics.
Let us consider a series of waves of light traversing a homogeneous or heterogeneous medium, and let
be the reduced path from a standard orthotomic surface or wave front to the point (x, y, z). Let us suppose, moreover, that V is expressed only in terms of the coordinates (x, y, z), and the constants of the standard wave front. Then V satisfies the differential equation^{[16]}
and is, in fact, the characteristic function introduced by Hamilton. If it is expressed as a function of x, y, z, and the coordinates of the initial point , it is the Eikonal according to the nomenclature of Bruns.^{[17]}
Since V is proportional to the time this differential equation may be replaced by
where C is the velocity of radiation at the point (x, y, z).
Now suppose that the surfaces t = const, are obtained by solving an equation
for t; then, since
, 
the function F must satisfy the differential equation
Confining ourselves to the case in which C is constant, we may use the results of § 2 to obtain new solutions of this differential equation.
Let
be the formulæ giving a transformation which enables us to pass from one solution of the above equation to another; then
when expressed in terms of x, y, z, t, is a second solution of the equation, and if the equation
be solved for t, the surfaces t = const, will form a system of parallel wave surfaces and t considered as a function of (x, y, z) will be the characteristic function for them.
The transformation
is of special importance because it makes the standard wave surface t = 0 in the original system correspond to a standard wave surface t = 0 in the new system; also, since the equations of the surfaces are
and
respectively, it is clear that one is the inverse of the other with regard to a unit sphere whose centre is at the origin. Our theorem tells us that if the surfaces parallel to the first are given by
the surfaces parallel to the second are given by
Applying this result to the family of right circular cones parallel to a given one, we may obtain the family of Dupin's cyclides parallel to a given one.
To obtain a geometrical interpretation of the transformation we describe a sphere of radius ct round the point (x, y, z) as centre. The inverse sphere is then of radius cT and its centre is at the point (XYZ).
A more general result is that the sphere
corresponds in the transformation to the sphere
, 
the centres of the two spheres being corresponding points at the times respectively.
To show that the laws of reflection and refraction remain unchanged in the transformation, we take the surface at which the light is incident as the standard one from which the time is measured. Let be a point on this surface; then we must associate with this point the time .
Consider a ray of light travelling from in a direction (l, m, n) with velocity c. At time t the wave has reached a point (x, y, z) on the ray, where
The corresponding point (X, Y, Z) derived from this by the transformation also travels along a straight line, for
, 
where
The corresponding ray thus passes through the inverse point on the inverse surface at which it may be supposed to be incident. Its direction cosines (L, M, N) are connected with those of the former ray by means of the equations
These relations establish a correspondence between the sheafs of rays through the points , respectively. This correspondence is such that the angle between two rays (l, m, n), (l', m', n') is equal to the angle between the two corresponding rays (L, M, N), (L', M', N'), for we have identically
Since the transformation enables us to derive the surfaces which are parallel to one surface from the surfaces which are parallel to the inverse surface, it is natural to expect that the above relation between the direction cosines will make the normals to the two surfaces correspond.
Now
hence, if are the direction cosines of a tangent to the first surface, the direction cosines of the tangent for the corresponding displacement on the inverse surface are given by
The tangents to the two surfaces can thus be paired with one another in the correspondence.
Now, if (l, m, n) are the direction cosines of the normal to one surface, those of a tangent,
It follows, then, that for the corresponding ray (L, M, N)
This ray, being perpendicular to all the tangents to the inverse surface, is the normal at to this surface.
Since the angles between lines remain unaltered by the transformation, it follows that if (l, m, n), (l', m', n'), (p, q, r) are the direction cosines of an incident ray, the refracted ray and the normal at a point on a surface f, the corresponding quantities (L, M, N), (L', M', N'), (P, Q, R) are the direction cosines of an incident ray, refracted ray and the normal at the point of the inverse surface F. This is also easy to verify from the analytical formulæ.
The method of inversion can thus be applied to problems in geometrical optics. A medium of refractive index μ inverts into a medium of refractive index μ, a ray through the origin corresponds to a ray through the origin, and a ray through a fixed point (x, y, z) not on the surface of separation of the two media corresponds to a ray intersecting the line joining (x, y, z) to the origin.
A spherical shell of radiation emitted by an electron whose velocity was suddenly changed at time t, corresponds to a shell of radiation emitted by an electron whose velocity was suddenly changed at time T.
This method of inversion promises to be of great importance in the theory of radiation. It will be noticed that the spheres of radii ct whose centres are at the points
all touch one another at the point which may be regarded as a node of the radiation.
From a study of the nodes in the ether produced by a vibrating atom or molecule, it appears that various systems of nodes may be obtained from one another by inversions which correspond to the same frequencies of vibration. The investigation will be reserved, however, for another paper.
5. Application of the Preceding Results to a Symmetrical Optical Instrument.
Let QR (Fig.) be the incident ray, Q'R the refracted ray, OR the normal to the spherical interface, and let C be the centre of inversion.
We shall suppose that the incident ray makes a small angle with the axis.
Let
and let the velocity of light for the first medium be represented by 1/μ, the times corresponding to the points Q, Q', O, A respectively may then be taken to be μx, μx', μa, 0, respectively, and the corresponding quantities ct are simply the reduced distances (x, x', a, 0).
Let be the points corresponding to Q, Q', O, A in the transformation, their distances from C.
Now the sphere centre Q and radius QA inverts into the sphere centre and radius ; hence, if the radius of inversion be equal to k³, we have for the points in which these spheres meet the axis
We also have the relations

Let c be the distance of the second interface from the first; then we associate the length c with the second interface B, and the formulæ of the transformation become

The last of which may be written
i.e.,
where z and x' are measured, as before, from the first surface. This is exactly the former relation between two corresponding points; consequently the whole course of a ray in one instrument corresponds in the transformation to that of the corresponding ray in the corresponding instrument.^{[18]}
The linear magnification is given by
hence
which is of the same form as before.
If M and be the total linear magnifications for the two instruments, we have
The relation between two corresponding points is evidently a homographic one; hence we have the following theorem:—
If the points on the axis of a symmetrical optical instrument be transformed by means of a homographic transformation, any pair of conjugate points for the instrument are transformed into a pair of points which are conjugate with regard to a second instrument. The centres of curvature of the interfaces and the points in which the interfaces meet the axis correspond in the two instruments, and the refractive indices of corresponding media are the same.
 ↑ In a letter to Liouville dated October 8th, 1845. Liouville's Journal de Mathématiques 1845).
 ↑ The method of inversion had been used in geometry some time before. It apparently originated with Ptolemy. Quetelet used it in 1827 and Bellavatis gave a general statement of it in 1836. In 18434 it was propounded afresh by Ingram and Stubbs (Transactions of the Dublin Philosophical Society, Vol. I., pp. 58, 145, 159 ; Philosophical Magazine, Vol. XXIII., p. 338, Vol. XXV., p. 208).
 ↑ Journal de Mathématiques (1845) ; T. XV. (1850), p. 103.
 ↑ A simple method of obtaining the group of conformal transformations is given in Bianchi's Vorlesungen über Differential Geometrie, Leipzig (1899), p. 487. Another investigation is given in Maxwell's Collected Papers, Vol. II., p. 297, where reference is made to a paper by J. N. Haton de Goupillière, Journal de l'Ecole Polytechnique, T. XXV., p. 188 (1867). See also a paper by Bromwich, Proc. London Math. Soc., Vol. XXXIII., p. 185, and three papers by Tait, Collected Papers, Vol. I., pp. 176, 352, Vol. II., p. 329.
 ↑ The effect of combining the elementary transformations of the group is discussed by Darboux, Une Classe remarquable de courbes et de surfaces algébriques, Paris (1896), pp. 236241. It is shown that any number of successive inversions can be replaced by a single inversion followed by a displacement. It follows from this that any conformal transformation of the group can be replaced by successive inversions with regard to suitably chosen spheres, Cf. Math. Tripos, Part I. (1903).
 ↑ In this connection see a paper by Forsyth, Proceedings of the London Mathematical Society, Vol. XXIX. (1898), p. 165. The transformations which can be applied to the equation
are derived by J. E. Campbell, Messenger of Mathematics, Vol. XXVIII. (1898), p. 97.
 ↑ Liouville's theorem was extended by Lie to a space of n dimensions in 1871. Math. Ann., Vol. V., p. 145; Göttinger Nachrichten, May, 1871. I cannot, however, find any statement with regard to the first of the two equations.
 ↑ These bear the same relation to the hexaspherical coordinates of a point as the ordinary line coordinates of a line bear to the system of coordinates introduced by Klein.
 ↑ It may be mentioned here that Lorentz's fundamental equations of the electron theory, viz.,
may be reduced to a symmetrical form by writing s = ict and putting
The four mutually orthogonal vectors (A, B, C, D) whose components are respectively
(0, r, q, p), (r, 0, p, q), (q, p, 0, r), (p, q, r, 0)
satisfy the equations
where
if
Again, if we put
and introduce four new vectors whose components are respectively
(S, Z, Y, X) (Z, S, X, Y) (Y, X, S, Z) (X, Y, Z, S),
we find
where
Finally, if X, Y, Z, S can be derived from a potential function n so that
we can form four mutually orthogonal vectors θ, Φ, ψ, χ whose components are respectively
(n, r, q, p). (r, n, p, q), (q, p, n, r), (p, q, r, n),
and the equations then take the simple form
 ↑ This transformation was given by the author in a Smith's Prize Essay of 1905; it was deduced from a result given by Brill, Messenger of Mathematics (1891), pp. 135137. If is a solution of the differential equation
another solution is given by
 ↑ Reference should be made to Darboux, Théorie des Surfaces, Tome I., p. 213. Jessop's Treatise on the Line Complex, p. 251. Koenig's La Géometrie réglée, p. 125, and to an article by Borel on the "Transformations of Geometry in Niewenglowsky's Solid Geometry."
 ↑ This is equivalent to a rotation through an angle 2π/n just as successive reflexions in two planes are equivalent to a rotation about their line of intersection.
 ↑ Mathematische Annalen, T. XXV. (1885), p. 213.
 ↑ Abhandlungen d. K. Gesell. d. Wissenschaften zu Göttingen, Band VII. (1857), Gesammelte Werke, p. 63.
 ↑ See Whittaker's Analysis, p. 240. Forsyth's Theory of Linear Differential Equations, Vol. IV., p. 135.
 ↑ See Herman's Optics, p. 253.
 ↑ Cf. Schwarzschild's Untersuchungen zur Geometrischen Optik, Göttingen Abhandlungen (2), 4.
 ↑ We can verify the equation
connecting two conjugate points in the second instrument as follows
and
since Q" and Q' are conjugate points in the first instrument; hence the relation is satisfied.
This work is in the public domain in the United States because it was published before January 1, 1923.
The author died in 1946, so this work is also in the public domain in countries and areas where the copyright term is the author's life plus 70 years or less. This work may also be in the public domain in countries and areas with longer native copyright terms that apply the rule of the shorter term to foreign works.