1911 Encyclopædia Britannica/Differential Equation

←

1911 Encyclopædia Britannica, Volume 8

Differential Equation by Henry Frederick Baker

→

sister projects: Wikipedia article, quotes, course, Wikidata item.

7945701911 Encyclopædia Britannica, Volume 8 — Differential EquationHenry Frederick Baker

DIFFERENTIAL EQUATION, in mathematics, a relation between one or more functions and their differential coefficients. The subject is treated here in two parts: (1) an elementary introduction dealing with the more commonly recognized types of differential equations which can be solved by rule; and (2) the general theory.

Part I.—Elementary Introduction.

Of equations involving only one independent variable, x (known as ordinary differential equations), and one dependent variable, y, and containing only the first differential coefficient dy/dx (and therefore said to be of the first order), the simplest form is that reducible to the type

dy/dx＝ƒ(x)/F(y),

leading to the result ƒF(y)dy − ƒf(x)dx＝A, where A is an arbitrary constant; this result is said to solve the differential equation, the problem of evaluating the integrals belonging to the integral calculus.

Another simple form is

dy/dx + y P＝Q,

where P, Q are functions of x only; this is known as the linear equation, since it contains y and dy/dx only to the first degree. If ƒPdx = u, we clearly have

${\frac {d}{dx}}(ye^{u})=e^{u}\left({\frac {dy}{dx}}+\mathrm {P} y\right)=e^{u}\mathrm {Q} ,$

so that y = e^−u(ƒe^uQdx + A) solves the equation, and is the only possible solution, A being an arbitrary constant. The rule for the solution of the linear equation is thus to multiply the equation by e^u, where u = ƒPdx.

A third simple and important form is that denoted by

y＝px + ƒ(p),

where p is an abbreviation for dy/dx; this is known as Clairaut’s form. By differentiation in regard to x it gives

$p=p+x{\frac {dp}{dx}}+f'(p){\frac {dp}{dx}},$

where

$f'(p)={\frac {d}{dp}}f(p);$

thus, either (i.) dp/dx＝ 0, that is, p is constant on the curve satisfying the differential equation, which curve is thus any one of the straight lines y = cx = ƒ(c), where c is an arbitrary constant, or else, (ii.) x + ƒ′(p) = 0; if this latter hypothesis be taken, and p be eliminated between x + ƒ′(p) = 0 and y = px + ƒ(p), a relation connecting x and y, not containing an arbitrary constant, will be found, which obviously represents the envelope of the straight lines y = cx + ƒ(c).

In general if a differential equation φ(x, y, dy/dx) = 0 be satisfied by any one of the curves F(x, y, c) = 0, where c is an arbitrary constant, it is clear that the envelope of these curves, when existent, must also satisfy the differential equation; for this equation prescribes a relation connecting only the co-ordinates x, y and the differential coefficient dy/dx, and these three quantities are the same at any point of the envelope for the envelope and for the particular curve of the family which there touches the envelope. The relation expressing the equation of the envelope is called a singular solution of the differential equation, meaning an isolated solution, as not being one of a family of curves depending upon an arbitrary parameter.

An extended form of Clairaut’s equation expressed by

y＝xF(p) + ƒ(p)

may be similarly solved by first differentiating in regard to p, when it reduces to a linear equation of which x is the dependent and p the independent variable; from the integral of this linear equation, and the original differential equation, the quantity p is then to be eliminated.

Other types of solvable differential equations of the first order are (1)

M dy/dx＝N,

where M, N are homogeneous polynomials in x and y, of the same order; by putting v = y/x and eliminating y, the equation becomes of the first type considered above, in v and x. An equation (aB≷bA)

(ax+by+c)dy/dx＝Ax+By+C

may be reduced to this rule by first putting x+h, y+k for x and y, and determining h, k so that ah+bk+c = 0, Ah+Bk+C = 0.

(2) An equation in which y does not explicitly occur,

ƒ(x, dy/dx)＝0,

may, theoretically, be reduced to the type dy/dx = F(x); similarly an equation F(y, dy/dx) = 0.

(3) An equation

ƒ(dy/dx, x, y)＝0,

which is an integral polynomial in dy/dx, may, theoretically, be solved for dy/dx, as an algebraic equation; to any root dy/dx = F₁(x, y) corresponds, suppose, a solution φ₁(x, y, c) = 0, where c is an arbitrary constant; the product equation φ₁(x, y, c)φ₂(x, y, c) . . . = 0, consisting of as many factors as there were values of dy/dx, is effectively as general as if we wrote φ₁(x, y, c₁)φ₂(x, y, c₂) . . . = 0; for, to evaluate the first form, we must necessarily consider the factors separately, and nothing is then gained by the multiple notation for the various arbitrary constants. The equation φ₁(x, y, c)φ₂(x, y, c) . . . = 0 is thus the solution of the given differential equation.

In all these cases there is, except for cases of singular solutions, one and only one arbitrary constant in the most general solution of the differential equation; that this must necessarily be so we may take as obvious, the differential equation being supposed to arise by elimination of this constant from the equation expressing its solution and the equation obtainable from this by differentiation in regard to x.

A further type of differential equation of the first order, of the form

dy/dx＝A + By + Cy ²

in which A, B, C are functions of x, will be briefly considered below under differential equations of the second order.

When we pass to ordinary differential equations of the second order, that is, those expressing a relation between x, y, dy/dx and d ²y/dx², the number of types for which the solution can be found by a known procedure is very considerably reduced. Consider the general linear equation

${\frac {d^{2}y}{dx^{2}}}+\mathrm {P} {\frac {dy}{dx}}+\mathrm {Q} =\mathrm {R} ,$

where P, Q, R are functions of x only. There is no method always effective; the main general result for such a linear equation is that if any particular function of x, say y ₁, can be discovered, for which

${\frac {d^{2}y_{1}}{dx^{2}}}+\mathrm {P} {\frac {dy_{1}}{dx}}+\mathrm {Q} y_{1}=0,$

then the substitution y = y ₁η in the original equation, with R on the right side, reduces this to a linear equation of the first order with the dependent variable dη/dx. In fact, if y = y ₁η we have

${\frac {dy}{dx}}=y_{1}{\frac {d\eta }{dx}}+\eta {\frac {dy_{1}}{dx}}$ and ${\frac {d^{2}y}{dx^{2}}}=y_{1}{\frac {d^{2}\eta }{dx^{2}}}+2{\frac {dy_{1}}{dx}}{\frac {d\eta }{dx}}+\eta {\frac {d^{2}y_{1}}{dx^{2}}},$

and thus

${\frac {d^{2}y}{dx^{2}}}+\mathrm {P} {\frac {dy}{dx}}+\mathrm {Q} =y_{1}{\frac {d^{2}\eta }{dx^{2}}}+\left(2{\frac {dy_{1}}{dx}}+\mathrm {P} y_{1}\right){\frac {d\eta }{dx}}+\left({\frac {d^{2}y_{1}}{dx^{2}}}+\mathrm {P} {\frac {dy_{1}}{dx}}+\mathrm {Q} y_{1}\right)\eta ;$

if then

${\frac {d^{2}y_{1}}{dx^{2}}}+\mathrm {P} {\frac {dy_{1}}{dx}}+\mathrm {Q} y_{1}=0,$

and z denote dη/dx, the original differential equation becomes

$y_{1}{\frac {dz}{dx}}+\left(2{\frac {dy_{1}}{dx}}\mathrm {P} y_{1}\right)=\mathrm {R} .$

From this equation z can be found by the rule given above for the linear equation of the first order, and will involve one arbitrary constant; thence y = y ₁ η = y ₁ ∫ zdx + Ay ₁, where A is another arbitrary constant, will be the general solution of the original equation, and, as was to be expected, involves two arbitrary constants.

The case of most frequent occurrence is that in which the coefficients P, Q are constants; we consider this case in some detail. If θ be a root of the quadratic equation θ² + θP + Q = 0, it can be at once seen that a particular integral of the differential equation with zero on the right side is y ₁ = e^θx. Supposing first the roots of the quadratic equation to be different, and φ to be the other root, so that φ + θ = −P, the auxiliary differential equation for z, referred to above, becomes dz/dx + (θ − φ)z = Re^−θx which leads to ze^(θ−φ) = B + ∫ Re^−θxdx, where B is an arbitrary constant, and hence to

$y=Ae^{\theta x}+e^{\theta x}\int \mathrm {B} e^{(\phi -\theta )x}dx+e^{\theta x}\int e^{(\phi -\theta )x}\int Re^{-\phi x}dxdx,$

or say to $y=\mathrm {A} e^{\theta x}+\mathrm {C} e^{\phi x}+\mathrm {U}$ , where A, C are arbitrary constants and U is a function of x, not present at all when R = 0. If the quadratic equation θ² + Pθ + Q = 0 has equal roots, so that 2θ = −P, the auxiliary equation in z becomes ${\frac {dz}{dx}}=Re^{\theta x}$ giving $z=B+\int \mathrm {R} e^{\theta x}dx$ , where B is an arbitrary constant, and hence

$y=(\mathrm {A} +\mathrm {B} x)e^{\theta x}+e^{\theta x}\iint \mathrm {R} e^{-\theta x}dxdx,$

or, say, $y=(\mathrm {A} +\mathrm {B} x)e^{\theta x}+U$ , where A, B are arbitrary constants, and U is a function of x not present at all when R = 0. The portion $\mathrm {A} e^{\theta x}+\mathrm {B} e^{\phi x}$ or $(\mathrm {A} +\mathrm {B} x)e^{\theta x}$ of the solution, which is known as the complementary function, can clearly be written down at once by inspection of the given differential equation. The remaining portion U may, by taking the constants in the complementary function properly, be replaced by any particular solution whatever of the differential equation

${\frac {d^{2}y}{dx^{2}}}+\mathrm {P} {\frac {dy}{dx}}+\mathrm {Q} =\mathrm {R} ;$

for if u be any particular solution, this has a form

$\displaystyle u=A_{0}e^{\theta x}+B_{0}e^{\phi x}+U,$

or a form

$\displaystyle u=(A_{0}+B_{0}x)e^{\theta x}+U;$

thus the general solution can be written

$\displaystyle (A-A_{0})e^{\theta x}+(B-B_{0})e^{\theta x}+u,$ or $\displaystyle \{A-A_{0}+(B-B_{0})x\}e^{\theta x}+u,$

where A − A₀, B − B₀, like A, B, are arbitrary constants.

A similar result holds for a linear differential equation of any order, say

${\frac {d^{n}y}{dx^{n}}}+\mathrm {P} _{1}{\frac {d^{n-1}y}{dx^{n-1}}}+\ \ldots \ +\mathrm {P} _{n}=\mathrm {R} ,$

where P₁, P₂, . . . P_n are constants, and R is a function of x. If we form the algebraic equation θⁿ + P₁θⁿ⁻¹ + . . . + P_n = 0, and all the roots of this equation be different, say they are θ₁, θ₂, . . . θ_n, the general solution of the differential equation is

$\displaystyle y=A_{1}e^{\theta _{1}x}+A_{2}e^{\theta _{2}x}+\ ...\ +A_{n}e^{\theta _{n}x}+u,$

where A₁, A₂, . . . A_n are arbitrary constants, and u is any particular solution whatever; but if there be one root θ₁ repeated r times, the terms A₁ e^{θ₁ x} + ... + A_r e^{θ_r x} must be replaced by (A₁ + A₂x + ... + A_rx^r−1)e^{θ₁ x} where A₁, ... A_n are arbitrary constants; the remaining terms in the complementary function will similarly need alteration of form if there be other repeated roots.

To complete the solution of the differential equation we need some method of determining a particular integral u; we explain a procedure which is effective for this purpose in the cases in which R is a sum of terms of the form e^axφ(x), where φ(x) is an integral polynomial in x; this includes cases in which R contains terms of the form cos bx·φ(x) or sin bx·φ(x). Denote d/dx by D; it is clear that if u be any function of x, D(e^axu) = e^axDu + ae^axu, or say, D(e^axu) = e^ax(D + a)u; hence D²(e^axu), i.e. d ²/dx² (e^axu), being equal to D(e^axv), where v = (D + a)u, is equal to e^ax(D + a)v, that is to e^ax(D + a)²u. In this way we find Dⁿ(e^axu) = e^ax(D + a)ⁿu, where n is any positive integer. Hence if ψ(D) be any polynomial in D with constant coefficients, ψ(D) (e^axu) = e^axψ(D + a)u. Next, denoting ∫ udx by D⁻¹u, and any solution of the differential equation dz/dx + az = u by z = (d + a)⁻¹u, we have D[e^ax(D + a)⁻¹u] = D(e^axz) = e^ax(D + a)z = e^axu, so that we may write D⁻¹(e^axu) = e^ax(D + a)⁻¹u, where the meaning is that one value of the left side is equal to one value of the right side; from this, the expression D^-2(e^axu), which means D⁻¹[D⁻¹(e^axu)], is equal to D⁻¹(e^axz) and hence to e^ax(D + a)⁻¹z, which we write e^ax(D + a)^-2u; proceeding thus we obtain

D^-n(e^axu)＝e^ax(D + a)^-nu,

where n is any positive integer, and the meaning, as before, is that one value of the first expression is equal to one value of the second. More generally, if ψ(D) be any polynomial in D with constant coefficients, and we agree to denote by [1/ψ(D)]u any solution z of the differential equation ψ(D)z = u, we have, if v = [1/ψ(D + a)]u, the identity ψ(D)(e^axv) = e^axψ(D + a)v = e^axu, which we write in the form

${\frac {1}{\psi (\mathrm {D} )}}(e^{ax}u)=e^{ax}{\frac {1}{\psi (\mathrm {D} +a)}}u.$

This gives us the first step in the method we are explaining, namely that a solution of the differential equation ψ(D)y = e^axu + e^bxv + ... where u, v, ... are any functions of x, is any function denoted by the expression

$e^{ax}{\frac {1}{\psi (\mathrm {D} +a)}}u+e^{bx}{\frac {1}{\psi (\mathrm {D} +b)}}v+\ldots .$

It is now to be shown how to obtain one value of ${\frac {1}{\psi (\mathrm {D} +a)}}u$ , when u is a polynomial in x, namely one solution of the differential equation ψ(D + a)z = u. Let the highest power of x entering in u be x^m; if t were a variable quantity, the rational fraction in t, ${\frac {1}{\psi (t+a)}}u$ by first writing it as a sum of partial fractions, or otherwise, could be identically written in the form

K_rt^-r + K_r−1t^-r+1 + ... + K₁t⁻¹ + H + H₁t + ... + H_mt^m + t^m+1φ(t)/ψ(t + a),

where φ(t) is a polynomial in t; this shows that there exists an identity of the form

1＝ψ(t + a)(K_rt^−r + ... + K₁t⁻¹ + H + H₁t + ... + H_mt^m) + φ(t)t^m+1,

and hence an identity

u＝ψ(D + a) [K_rD^−r + ... + K₁D⁻¹ + H + H₁D + ... + H_mD^m] u + φ(D) D^m+1u;

in this, since u contains no power of x higher than x^m, the second term on the right may be omitted. We thus reach the conclusion that a solution of the differential equation ψ(D + a)z = u is given by

z＝(K_rD^−r + ... + K₁D⁻¹ + H + H₁D + ... + H_mD^m)u,

of which the operator on the right is obtained simply by expanding 1/ψ(D + a) in ascending powers of D, as if D were a numerical quantity, the expansion being carried as far as the highest power of D which, operating upon u, does not give zero. In this form every term in z is capable of immediate calculation.

Example.—For the equation

${\frac {d^{4}v}{dx^{4}}}+2{\frac {d^{2}y}{dx^{3}}}+y=x^{3}\cos x$ or $\displaystyle (\mathrm {D} ^{2}+1)^{2}=x^{3}\cos x,$

the roots of the associated algebraic equation (θ² + 1)² = 0 are θ = ±i, each repeated; the complementary function is thus

(A + Bx)e^ix + (C + Dx)e^−ix,

where A, B, C, D are arbitrary constants; this is the same as

(H + Kx) cos x + (M + Nx) sin x,

where H, K, M, N are arbitrary constants. To obtain a particular integral we must find a value of (1 + D²)−²x³ cos x; this is the real part of (1 + D²)−² e^ixx³ and hence of e^ix [1 + (D + i)²]−² x³ or

e^ix [2iD(1 + 1/2iD)]−² x³,

or

−1/4e^ix D−² (1 + iD − 3/4D² − 1/2iD³ + 5/16D⁴ + 3/16iD⁵ ...)x³,

or

−1/4e^ix (1/20x⁵ + 1/4ix⁴ − 3/4x³ − 3/2 ix² + 15/8 x + 9/8 i);

the real part of this is

−1/4 (1/20 x⁵ − 3/4x² + 15/8x) cos x + 1/4 (1/4x⁴ − 3/2x² + 9/8) sin x.

This expression added to the complementary function found above gives the complete integral; and no generality is lost by omitting from the particular integral the terms −15/32x cos x + 9/32 sin x, which are of the types of terms already occurring in the complementary function.

The symbolical method which has been explained has wider applications than that to which we have, for simplicity of explanation, restricted it. For example, if ψ(x) be any function of x, and a₁, a₂, ... a_n be different constants, and [(t + a₁) (t + a₂) ... (t + a_n)]⁻¹ when expressed in partial fractions be written Σc_m(t + a_m)⁻¹, a particular integral of the differential equation (D + a₁)(D + a₂) ... (D + a_n)y = ψ(x) is given by

y＝Σc_m(D + a_m)⁻¹ ψ(x)＝Σc_m (D + a_m)⁻¹ e^−am^xe^am^x ψ(x)＝Σc_me^−am^xD⁻¹ (e^am^xψ(x) )＝Σc_me^−am^x ∫ e^am^xψ(x)dx.

The particular integral is thus expressed as a sum of n integrals. A linear differential equation of which the left side has the form

$x^{n}{\frac {d^{n}y}{dx^{n}}}+\mathrm {P} _{1}x^{n-1}{\frac {d^{n-1}y}{dx^{n-1}}}+\ldots +\mathrm {P} _{n-1}x{\frac {dy}{dx}}+\mathrm {P} _{n}y,$

where P₁, ... P_n are constants, can be reduced to the case considered above. Writing x = e^t we have the identity

$x^{m}{\frac {d^{m}u}{dx^{m}}}=\theta (\theta -1)(\theta -2)\ldots (\theta -m+1)u,$ where θ＝d/dt

When the linear differential equation, which we take to be of the second order, has variable coefficients, though there is no general rule for obtaining a solution in finite terms, there are some results which it is of advantage to have in mind. We have seen that if one solution of the equation obtained by putting the right side zero, say y₁, be known, the equation can be solved. If y₂ be another solution of

${\frac {d^{2}y}{dx^{2}}}+\mathrm {P} {\frac {dy}{dx}}+\mathrm {Q} =0,$

there being no relation of the form my₁ + ny₂ = k, where m, n, k are constants, it is easy to see that

${\frac {d}{dx}}(y_{1}'y_{2}-y_{1}y_{2}')=\mathrm {P} (y_{1}'y_{2}-y_{1}y_{2}'),$

so that we have

y₁′y₂ − y₁y₂′＝A exp. (∫ Pdx),

where A is a suitably chosen constant, and exp. z denotes e^z. In terms of the two solutions y₁, y₂ of the differential equation having zero on the right side, the general solution of the equation with R = φ(x) on the right side can at once be verified to be Ay₁ + By₂ + y₁u − y₂v, where u, v respectively denote the integrals

u＝∫ y₂φ(x) (y₁′y₂ − y₂′y₁)⁻¹dx, v＝∫ y₁φ(x) (y₁′y₂ − y₂′y₁)⁻¹dx.

The equation

${\frac {d^{2}y}{dx^{2}}}+\mathrm {P} {\frac {dy}{dx}}+\mathrm {Q} =0,$

by writing y = v exp. (−1/2 ∫ Pdx), is at once seen to be reduced to d ²v/dx² + Iv = 0, where I = Q − 1/2dP/dx − 1/4P². If η = − 1/v dv/dx, the equation d ²v/dx² + Iv = 0 becomes dη/dx = I + η², a non-linear equation of the first order.

More generally the equation

${\frac {d\eta }{dx}}=\mathrm {A} +\mathrm {B} \eta +\mathrm {C} \eta ^{2}$

where A, B, C are functions of x, is, by the substitution

$\eta =-{\frac {1}{\mathrm {C} y}}{\frac {dy}{dx}}$

reduced to the linear equation

${\frac {d^{2}y}{dx^{2}}}-\left(\mathrm {B} +{\frac {1}{\mathrm {C} }}{\frac {d\mathrm {C} }{dx}}\right){\frac {dy}{dx}}+\mathrm {AC} =0.$

The equation

${\frac {d\eta }{dx}}=\mathrm {A} +\mathrm {B} \eta +\mathrm {C} \eta ^{2}$

known as Riccati’s equation, is transformed into an equation of the same form by a substitution of the form η = (aY + b)/(cY + d), where a, b, c, d are any functions of x, and this fact may be utilized to obtain a solution when A, B, C have special forms; in particular if any particular solution of the equation be known, say η₀, the substitution η = η₀ − 1/Y enables us at once to obtain the general solution; for instance, when

$2\mathrm {B} {=}{\frac {d}{dx}}\log \left({\frac {\mathrm {A} }{\mathrm {C} }}\right),$

a particular solution is η₀ = √(-A/C). This is a case of the remark, often useful in practice, that the linear equation

$\phi (x){\frac {d^{2}y}{dx^{2}}}+{\tfrac {1}{2}}{\frac {d\phi }{dx}}{\frac {dy}{dx}}+\mu y{=}0,$

where μ is a constant, is reducible to a standard form by taking a new independent variable z = ∫ dx[φ(x)]^-½.

We pass to other types of equations of which the solution can be obtained by rule. We may have cases in which there are two dependent variables, x and y, and one independent variable t, the differential coefficients dx/dt, dy/dt being given as functions of x, y and t. Of such equations a simple case is expressed by the pair

${\frac {dx}{dt}}{=}ax+by+c,{\frac {dy}{dt}}{=}a'x+b'y+c',$

wherein the coefficients a, b, c, a′, b′, c′, are constants. To integrate these, form with the constant λ the differential coefficient of z = x + λy, that is dz/dt = (a + λa′)x + (b + λb′)y + c + λc′, the quantity λ being so chosen that b + λb′ = λ(a + λa′), so that we have dz/dt = (a + λa′)z + c + λc′; this last equation is at once integrable in the form z(a + λa′) + c + λc′ = Ae^{(a + λa′)t}, where A is an arbitrary constant. In general, the condition b + λb′ = λ(a + λa′) is satisfied by two different values of λ, say λ₁, λ₂; the solutions corresponding to these give the values of x +λ₁y and x + λ₂y, from which x and y can be found as functions of t, involving two arbitrary constants. If, however, the two roots of the quadratic equation for λ are equal, that is, if (a − b′)² + 4a′b = 0, the method described gives only one equation, expressing x + λy in terms of t; by means of this equation y can be eliminated from dx/dt = ax + by + c, leading to an equation of the form dx/dt = Px + Q + Re^{(a + λa′)t}, where P, Q, R are constants. The integration of this gives x, and thence y can be found.

A similar process is applicable when we have three or more dependent variables whose differential coefficients in regard to the single independent variables are given as linear functions of the dependent variables with constant coefficients.

Another method of solution of the equations

dx/dt = ax + by + c, dy/dt = a′x + b′y + c′,

consists in differentiating the first equation, thereby obtaining

${\frac {d^{2}x}{dt^{2}}}{=}a{\frac {dx}{dt}}+b{\frac {dy}{dx}};$

from the two given equations, by elimination of y, we can express dy/dt as a linear function of x and dx/dt; we can thus form an equation of the shape d²x/dt² = P + Qx + Rdx/dt, where P, Q, R are constants; this can be integrated by methods previously explained, and the integral, involving two arbitrary constants, gives, by the equation dx/dt = ax + by + c, the corresponding value of y. Conversely it should be noticed that any single linear differential equation

${\frac {d^{2}x}{dt^{2}}}{=}u+vx+w{\frac {dx}{dt}},$

where u, v, w are functions of t, by writing y for dx/dt, is equivalent with the two equations dx/dt = y, dy/dt = u + vx + wy. In fact a similar reduction is possible for any system of differential equations with one independent variable.

Equations occur to be integrated of the form

Xdx + Ydy + Zdz = 0,

where X, Y, Z are functions of x, y, z. We consider only the case in which there exists an equation φ(x, y, z) = C whose differential

${\frac {\partial \phi }{\partial x}}dx+{\frac {\partial \phi }{\partial y}}dy+{\frac {\partial \phi }{\partial z}}dz{=}0$

is equivalent with the given differential equation; that is, μ being a proper function of x, y, z, we assume that there exist equations

${\frac {\partial \phi }{\partial x}}{=}\mu \mathrm {X} ,{\frac {\partial \phi }{\partial y}}{=}\mu \mathrm {Y} ,{\frac {\partial \phi }{\partial z}}{=}\mu \mathrm {Z}$

these equations require

${\frac {\partial }{\partial z}}(\mu \mathrm {Y} ){=}{\frac {\partial }{\partial y}}(\mu \mathrm {Z} ),$ &c.,

and hence

$\mathrm {X} \left({\frac {\partial \mathrm {Z} }{\partial y}}-{\frac {\partial \mathrm {Y} }{\partial z}}\right)+\mathrm {Y} \left({\frac {\partial \mathrm {X} }{\partial z}}-{\frac {\partial \mathrm {Z} }{\partial x}}\right)+\mathrm {Z} \left({\frac {\partial \mathrm {Y} }{\partial x}}-{\frac {\partial \mathrm {X} }{\partial y}}\right){=}0;$

conversely it can be proved that this is sufficient in order that μ may exist to render μ(Xdx + Ydy + Zdz) a perfect differential; in particular it may be satisfied in virtue of the three equations such as

${\frac {\partial \mathrm {Z} }{\partial y}}-{\frac {\partial \mathrm {Y} }{\partial z}}{=}0$

in which case we may take μ = 1. Assuming the condition in its general form, take in the given differential equation a plane section of the surface φ = C parallel to the plane z, viz. put z constant, and consider the resulting differential equation in the two variables x, y, namely Xdx + Ydy = 0; let ψ(x, y, z) = constant, be its integral, the constant z entering, as a rule, in ψ because it enters in X and Y. Now differentiate the relation ψ(x, y, z) = ƒ(z), where ƒ is a function to be determined, so obtaining

${\frac {\partial \psi }{\partial x}}dx+{\frac {\partial \psi }{\partial y}}dy+\left({\frac {\partial \psi }{\partial z}}-{\frac {df}{dz}}\right)dz{=}0;$

there exists a function σ of x, y, z such that

${\frac {\partial \psi }{\partial x}}{=}\sigma \mathrm {X} {\frac {\partial \psi }{\partial y}}{=}\sigma \mathrm {Y}$

because ψ = constant, is the integral of Xdx + Ydy = 0; we desire to prove that ƒ can be chosen so that also, in virtue of ψ(x, y, z) = ƒ(z), we have

${\frac {\partial \psi }{\partial z}}-{\frac {df}{dz}}{=}\sigma \mathrm {Z} ,$ namely ${\frac {df}{dz}}{=}{\frac {\partial \psi }{\partial z}}-\sigma \mathrm {Z} ;$

if this can be proved the relation ψ(x, y, z) − ƒ(z) = constant, will be the integral of the given differential equation. To prove this it is enough to show that, in virtue of ψ(x, y, z) = ƒ(z), the function ∂ψ/∂x − σZ can be expressed in terms of z only. Now in consequence of the originally assumed relations,

${\frac {\partial \psi }{\partial x}}{=}\mu \mathrm {X} ,{\frac {\partial \phi }{\partial y}}{=}\mu \mathrm {Y} ,{\frac {\partial \phi }{\partial z}}{=}\mu \mathrm {Z}$

we have

${\frac {\partial \psi }{\partial x}}\left/{\frac {\partial \phi }{\partial x}}\right.{=}{\frac {\sigma }{\mu }}{=}{\frac {\partial \psi }{\partial y}}\left/{\frac {\partial \phi }{\partial y}}\right.,$

and hence

${\frac {\partial \psi }{\partial x}}{\frac {\partial \phi }{\partial y}}-{\frac {\partial \psi }{\partial y}}{\frac {\partial \phi }{\partial x}}{=}0;$

this shows that, as functions of x and y, ψ is a function of φ (see the note at the end of part i. of this article, on Jacobian determinants), so that we may write ψ = F(z, φ), from which

${\frac {\sigma }{\mu }}{=}{\frac {\partial \mathrm {F} }{\partial \phi }};$ then ${\frac {\partial \psi }{\partial z}}{=}{\frac {\partial \mathrm {F} }{\partial z}}+{\frac {\partial \mathrm {F} }{\partial \phi }}{\frac {\partial \phi }{\partial z}}{=}{\frac {\partial \mathrm {F} }{\partial z}}+{\frac {\sigma }{\mu }}\cdot \mu \mathrm {Z} {=}{\frac {\partial \mathrm {F} }{\partial z}}+\sigma \mathrm {Z}$ or ${\frac {\partial \psi }{\partial z}}-\sigma \mathrm {Z} {=}{\frac {\partial \mathrm {F} }{\partial z}};$

in virtue of ψ(x, y, z) = ƒ(z), and ψ = F(z, φ), the function φ can be written in terms of z only, thus ∂F/∂z can be written in terms of z only, and what we required to prove is proved.

Consider lastly a simple type of differential equation containing two independent variables, say x and y, and one dependent variable z, namely the equation

$\mathrm {P} {\frac {\partial z}{\partial x}}+\mathrm {Q} {\frac {\partial z}{\partial y}}{=}\mathrm {R} ,$

where P, Q, R are functions of x, y, z. This is known as Lagrange’s linear partial differential equation of the first order. To integrate this, consider first the ordinary differential equations dx/dz = P/R, dy/dz = Q/R, and suppose that two functions u, v, of x, y, z can be determined, independent of one another, such that the equations u = a, v = b, where a, b are arbitrary constants, lead to these ordinary differential equations, namely such that

$\mathrm {P} {\frac {\partial u}{\partial x}}+\mathrm {Q} {\frac {\partial u}{\partial y}}+\mathrm {R} {\frac {\partial u}{\partial z}}{=}0$ and $\mathrm {P} {\frac {\partial v}{\partial x}}+\mathrm {Q} {\frac {\partial v}{\partial y}}+\mathrm {R} {\frac {\partial v}{\partial z}}{=}0.$

Then if F(x, y, z) = 0 be a relation satisfying the original differential equations, this relation giving rise to

${\frac {\partial \mathrm {F} }{\partial x}}+{\frac {\partial \mathrm {F} }{\partial z}}{\frac {\partial z}{\partial x}}{=}0$ and ${\frac {\partial \mathrm {F} }{\partial y}}+{\frac {\partial \mathrm {F} }{\partial z}}{\frac {\partial z}{\partial y}}{=}0,$ we have $\mathrm {P} {\frac {\partial \mathrm {F} }{\partial x}}+\mathrm {Q} {\frac {\partial \mathrm {F} }{\partial y}}+\mathrm {R} {\frac {\partial \mathrm {F} }{\partial z}}{=}0.$

It follows that the determinant of three rows and columns vanishes whose first row consists of the three quantities ∂F/∂x, ∂F/∂y, ∂F/∂z, whose second row consists of the three quantities ∂u/∂x, ∂u/∂y, ∂u/∂z, whose third row consists similarly of the partial derivatives of v. The vanishing of this so-called Jacobian determinant is known to imply that F is expressible as a function of u and v, unless these are themselves functionally related, which is contrary to hypothesis (see the note below on Jacobian determinants). Conversely, any relation φ(u, v) = 0 can easily be proved, in virtue of the equations satisfied by u and v, to lead to

$\mathrm {P} {\frac {dz}{dx}}+\mathrm {Q} {\frac {dz}{dy}}{=}\mathrm {R} .$

The solution of this partial equation is thus reduced to the solution of the two ordinary differential equations expressed by dx/P = dy/Q = dz/R. In regard to this problem one remark may be made which is often of use in practice: when one equation u = a has been found to satisfy the differential equations, we may utilize this to obtain the second equation v = b; for instance, we may, by means of u = a, eliminate z—when then from the resulting equations in x and y a relation v = b has been found containing x and y and a, the substitution a = u will give a relation involving x, y, z.

Note on Jacobian Determinants.—The fact assumed above that the vanishing of the Jacobian determinant whose elements are the partial derivatives of three functions F, u, v, of three variables x, y, z, involves that there exists a functional relation connecting the three functions F, u, v, may be proved somewhat roughly as follows:—

The corresponding theorem is true for any number of variables. Consider first the case of two functions p, q, of two variables x, y. The function p, not being constant, must contain one of the variables, say x; we can then suppose x expressed in terms of y and the function p; thus the function q can be expressed in terms of y and the function p, say q = Q(p, y). This is clear enough in the simplest cases which arise, when the functions are rational. Hence we have

${\frac {\partial q}{\partial x}}{=}{\frac {\partial \mathrm {Q} }{\partial p}}{\frac {\partial p}{\partial x}}$ and ${\frac {\partial q}{\partial y}}{=}{\frac {\partial \mathrm {Q} }{\partial p}}{\frac {\partial p}{\partial y}}+{\frac {\partial \mathrm {Q} }{\partial y}};$

these give

${\frac {\partial p}{\partial x}}{\frac {\partial q}{\partial y}}-{\frac {\partial p}{\partial y}}{\frac {\partial q}{\partial x}}{=}{\frac {\partial p}{\partial x}}{\frac {\partial \mathrm {Q} }{\partial y}};$

by hypothesis ∂p/∂x is not identically zero; therefore if the Jacobian determinant of p and q in regard to x and y is zero identically, so is ∂Q/∂y, or Q does not contain y, so that q is expressible as a function of p only. Conversely, such an expression can be seen at once to make the Jacobian of p and q vanish identically.

Passing now to the case of three variables, suppose that the Jacobian determinant of the three functions F, u, v in regard to x, y, z is identically zero. We prove that if u, v are not themselves functionally connected, F is expressible as a function of u and v. Suppose first that the minors of the elements of ∂F/∂x, ∂F/∂y, ∂F/∂z in the determinant are all identically zero, namely the three determinants such as

${\frac {\partial u}{\partial y}}{\frac {\partial v}{\partial z}}-{\frac {\partial u}{\partial z}}{\frac {\partial v}{\partial y}};$

then by the case of two variables considered above there exist three functional relations. ψ₁(u, v, x) = 0, ψ₂(u, v, y) = 0, ψ₃(u, v, z) = 0, of which the first, for example, follows from the vanishing of

${\frac {\partial u}{\partial y}}{\frac {\partial v}{\partial z}}-{\frac {\partial u}{\partial z}}{\frac {\partial v}{\partial y}}.$

We cannot assume that x is absent from ψ₁, or y from ψ₂, or z from ψ₃; but conversely we cannot simultaneously have x entering in ψ₁, and y in ψ₂, and z in ψ₃, or else by elimination of u and v from the three equations ψ₁ = 0, ψ₂ = 0, ψ₃ = 0, we should find a necessary relation connecting the three independent quantities x, y, z; which is absurd. Thus when the three minors of ∂F/∂x, ∂F/∂y, ∂F/∂z in the Jacobian determinant are all zero, there exists a functional relation connecting u and v only. Suppose no such relation to exist; we can then suppose, for example, that

${\frac {\partial u}{\partial y}}{\frac {\partial v}{\partial z}}-{\frac {\partial u}{\partial z}}{\frac {\partial v}{\partial y}}$

is not zero. Then from the equations u(x, y, z) = u, v(x, y, z) = v we can express y and z in terms of u, v, and x (the attempt to do this could only fail by leading to a relation connecting u, v and x, and the existence of such a relation would involve that the determinant

${\frac {\partial u}{\partial y}}{\frac {\partial v}{\partial z}}-{\frac {\partial u}{\partial z}}{\frac {\partial v}{\partial y}}$

was zero), and so write F in the form F(x, y, z) = Φ(u, v, x). We then have

${\frac {\partial \mathrm {F} }{\partial x}}{=}{\frac {\partial \Phi }{\partial u}}{\frac {\partial u}{\partial x}}+{\frac {\partial \Phi }{\partial v}}{\frac {\partial v}{\partial x}}+{\frac {\partial \Phi }{\partial x}},{\frac {\partial \mathrm {F} }{\partial y}}{=}{\frac {\partial \Phi }{\partial u}}{\frac {\partial u}{\partial y}}+{\frac {\partial \Phi }{\partial v}}{\frac {\partial v}{\partial y}},{\frac {\partial \mathrm {F} }{\partial z}}{=}{\frac {\partial \Phi }{\partial u}}{\frac {\partial u}{\partial z}}+{\frac {\partial \Phi }{\partial v}}{\frac {\partial v}{\partial z}};$

thereby the Jacobian determinant of F, u, v is reduced to

${\frac {\partial \Phi }{\partial x}}\left({\frac {\partial u}{\partial y}}{\frac {\partial v}{\partial z}}-{\frac {\partial u}{\partial z}}{\frac {\partial v}{\partial y}}\right);$

by hypothesis the second factor of this does not vanish identically; hence ∂Φ/∂x = 0 identically, and Φ does not contain x; so that F is expressible in terms of u, v only; as was to be proved.

Part II.—General Theory.

Differential equations arise in the expression of the relations between quantities by the elimination of details, either unknown or regarded as unessential to the formulation of the relations in question. They give rise, therefore, to the two closely connected problems of determining what arrangement of details is consistent with them, and of developing, apart from these details, the general properties expressed by them. Very roughly, two methods of study can be distinguished, with the names Transformation-theories, Function-theories; the former is concerned with the reduction of the algebraical relations to the fewest and simplest forms, eventually with the hope of obtaining explicit expressions of the dependent variables in terms of the independent variables; the latter is concerned with the determination of the general descriptive relations among the quantities which are involved by the differential equations, with as little use of algebraical calculations as may be possible. Under the former heading we may, with the assumption of a few theorems belonging to the latter, arrange the theory of partial differential equations and Pfaff’s problem, with their geometrical interpretations, as at present developed, and the applications of Lie’s theory of transformation-groups to partial and to ordinary equations; under the latter, the study of linear differential equations in the manner initiated by Riemann, the applications of discontinuous groups, the theory of the singularities of integrals, and the study of potential equations with existence-theorems arising therefrom. In order to be clear we shall enter into some detail in regard to partial differential equations of the first order, both those which are linear in any number of variables and those not linear in two independent variables, and also in regard to the function-theory of linear differential equations of the second order. Space renders impossible anything further than the briefest account of many other matters; in particular, the theories of partial equations of higher than the first order, the function-theory of the singularities of ordinary equations not linear and the applications to differential geometry, are taken account of only in the bibliography. It is believed that on the whole the article will be more useful to the reader than if explanations of method had been further curtailed to include more facts.

When we speak of a function without qualification, it is to be understood that in the immediate neighbourhood of a particular set x₀, y₀, ... of values of the independent variables x, y, ... of the function, at whatever point of the range of values for x, y, ... under consideration x₀, y₀, ... may be chosen, the function can be expressed as a series of positive integral powers of the differences x − x₀, y − y₀, ..., convergent when these are sufficiently small (see Function: Functions of Complex Variables). Without this condition, which we express by saying that the function is developable about x₀, y₀, ..., many results provisionally stated in the transformation theories would be unmeaning or incorrect. If, then, we have a set of k functions, ƒ₁ ... ƒ_k of n independent variables x₁ ... x_n, we say that they are independent when n ≥ k and not every determinant of k rows and columns vanishes of the matrix of k rows and n columns whose r-th row has the constituents dƒ_r/dx₁, ... dƒ_r/dx_n; the justification being in the theorem, which we assume, that if the determinant involving, for instance, the first k columns be not zero for x₁ = xº₁ ... x_n = xº_n, and the functions be developable about this point, then from the equations ƒ₁ = c₁, ... ƒ_k = c_k we can express x₁, ... x_k by convergent power series in the differences x_k+1 − xº_k+1, ... x_n − x_nº, and so regard x₁, ... x_k as functions of the remaining variables. This we often express by saying that the equations ƒ₁ = c₁, ... ƒ_k = c_k can be solved for x₁, ... x_k. The explanation is given as a type of explanation often understood in what follows.

We may conveniently begin by stating the theorem: If each of the n functions φ₁, ... φ_n of the (n + 1) variables x₁, ... x_nt be developable Ordinary equations of the first order. about the values xº₁, ... x_n⁰t⁰, the n differential equations of the form dx₁/dt = φ₁(tx₁, ... x_n) are satisfied by convergent power series

x_r = xº_r + (t − t⁰) A_r1 + (t − t₀)² A_r2 + ...

reducing respectively to xº₁, ... xº_n when t = t⁰; and the only functions satisfying the equations and reducing respectively to xº₁, ... xº_n when t = t⁰, are those determined by continuation of these series. If the result of solving these n equations for xº₁, ... xº_n be written in the form ω₁(x₁, ... x_nt) = xº₁, ... ω_n(x₁, ... x_nt) = xº_n, Single homogeneous partial equation of the first order. it is at once evident that the differential equation

dƒ/dt + φ₁dƒ/dx₁ + ... + φ_ndƒ/dx_n = 0

possesses n integrals, namely, the functions ω₁, ... ω_n, which are developable about the values (xº₁ ... x_n⁰t⁰) and reduce respectively to x₁, ... x_n when t = t⁰. And in fact it has no other integrals so reducing. Thus this equation also possesses a unique integral reducing when t = t⁰ to an arbitrary function ψ(x₁, ... x_n), this integral being. ψ(ω₁, ... ω_n). Conversely the existence of these principal integrals ω₁, ... ω_n of the partial equation establishes the existence of the specified solutions of the ordinary equations dx_i/dt = φ_i. The following sketch of the proof of the existence of these principal integrals for the case n = 2 will show the character of more general investigations. Put x for x − x⁰, &c., and consider the equation a(xyt) dƒ/dx + b(xyt) dƒ/dy = dƒ/dt, wherein the functions a, b are developable about x = 0, y = 0, t = 0; say

a(xyt) = a₀ + ta₁ + t²a₂/2! + ..., b(xyt) = b₀ + tb₁ + t²b₂/2! + ...,

so that

ad/dx + bd/dy = δ₀ + tδ₁ + ½t²δ₂ + ...,

where δ = a_rd/dx + b_rd/dy. In order that

ƒ = p₀ + tp₁ + t²p₂/2! + ...

wherein p₀, p₁ . . . are power series in x, y, should satisfy the equation, it is necessary, as we find by equating like terms, that

p₁ = δ₀p₀, p₂ = δ₀p₁ + δ₁p₀, &c.

and in generalProof of the existence of integrals.

p_s+1 = δ₀p_s + s₁δ₁p_s−1 + s₂δ₂p_s−2 +... + δ_sp₀,

where

s_r = (s!)/(r!) (s − r)!

Now compare with the given equation another equation

A(xyt)dF/dx + B(xyt)dF/dy = dF/dt,

wherein each coefficient in the expansion of either A or B is real and positive, and not less than the absolute value of the corresponding coefficient in the expansion of a or b. In the second equation let us substitute a series

F = P₀ + tP₁ + t²P₂/2! + ...,

wherein the coefficients in P₀ are real and positive, and each not less than the absolute value of the corresponding coefficient in p₀; then putting Δ_r = A_rd/dx + B_rd/dy we obtain necessary equations of the same form as before, namely,

P₁ = Δ₀P₀, P₂ = Δ₀P₁ + Δ₁P₀, ...

and in general P_s+1 = Δ₀P_s, + s₁Δ₁P_s−1 + ... + Δ_sP₀. These give for every coefficient in P_s+1 an integral aggregate with real positive coefficients of the coefficients in P_s, P_s−1, ..., P₀ and the coefficients in A and B; and they are the same aggregates as would be given by the previously obtained equations for the corresponding coefficients in p_s+1 in terms of the coefficients in p_s, p_s−1, ..., p₀ and the coefficients in a and b. Hence as the coefficients in P₀ and also in A, B are real and positive, it follows that the values obtained in succession for the coefficients in P₁, P₂, ... are real and positive; and further, taking account of the fact that the absolute value of a sum of terms is not greater than the sum of the absolute values of the terms, it follows, for each value of s, that every coefficient in p_s+1 is, in absolute value, not greater than the corresponding coefficient in P_s+1. Thus if the series for F be convergent, the series for ƒ will also be; and we are thus reduced to (1), specifying functions A, B with real positive coefficients, each in absolute value not less than the corresponding coefficient in a, b; (2) proving that the equation

AdF/dx + BdF/dy = dF/dt

possesses an integral P₀ + tP₁ + t²P₂/2! + ... in which the coefficients in P₀ are real and positive, and each not less than the absolute value of the corresponding coefficient in p₀. If a, b be developable for x, y both in absolute value less than r and for t less in absolute value than R, and for such values a, b be both less in absolute value than the real positive constant M, it is not difficult to verify that we may take A = B = M[1 − (x + y)/r]⁻¹ (1 − t/R)⁻¹, and obtain

$\mathrm {F} =r-(r-x-y)\left[1-{\frac {4\mathrm {MR} }{r}}\left(1-{\frac {x+y}{r}}\right)^{2}\log \left(1-{\frac {t}{\mathrm {R} }}\right)^{-1}\right]^{\frac {1}{2}},$

and that this solves the problem when x, y, t are sufficiently small for the two cases p₀ = x, p₀ = y. One obvious application of the general theorem is to the proof of the existence of an integral of an ordinary linear differential equation given by the n equations dy/dx = y₁, dy₁/dx = y₂, ...,

dy_n−1/dx = p − p₁y_n−1 − ... − p_ny;

but in fact any simultaneous system of ordinary equations is reducible to a system of the form

dx_i/dt = φ_i(tx₁, ... x_n).

Suppose we have k homogeneous linear partial equations of the first order in n independent variables, the general equation being a_σ1dƒ/dx₁ + ... + a_σndƒ/dx_n = 0, where σ = 1, ... k, and that Simultaneous linear partial equations. we desire to know whether the equations have common solutions, and if so, how many. It is to be understood that the equations are linearly independent, which implies that k ≤ n and not every determinant of k rows and columns is identically zero in the matrix in which the i-th element of the σ-th row is a_σi}(i = 1, ... n, σ = 1, ... k). Denoting the left side of the σ-th equation by Pσƒ, it is clear that every common solution of the two equations P_σƒ = 0, P_ρƒ = 0, is also a solution of the equation P_ρ(p_σƒ), P_σ(p_ρƒ), We immediately find, however, that this is also a linear equation, namely, ΣH_idƒ/dx_i = 0 where H_i = P_ρa_σ − P_σa_ρ, and if it be not already contained among the given equations, or be linearly deducible from them, it may be added to them, as not introducing any additional limitation of the possibility of their having common solutions. Proceeding thus with every pair of the original equations, and then with every pair of the possibly augmented system so obtained, and so on continually, we shall arrive at a system of equations, linearly independent of each other and therefore not more than n in number, such that the combination, in the way described, of every pair of them, leads to an equation which is linearly deducible from them. If the number of this so-called complete system is n, the equations give dƒ/dx₁ = 0 ... dƒ/dx_n = 0, leading to the nugatory result ƒ = a constant. Suppose, then, the number of this system to be r < n; suppose, further, that from the Complete systems of linear partial equations. matrix of the coefficients a determinant of r rows and columns not vanishing identically is that formed by the coefficients of the differential coefficients of ƒ in regard to x₁ ... x_r; also that the coefficients are all developable about the values x₁ = xº₁, ... x_n= xº_n, and that for these values the determinant just spoken of is not zero. Then the main theorem is that the complete system of r equations, and therefore the originally given set of k equations, have in common n − r solutions, say ω_r+1, ... ω_n, which reduce respectively to x_r+1, ... x_n when in them for x₁, ... x_r are respectively put xº₁, ... xº_r; so that also the equations have in common a solution reducing when x₁ = xº₁, ... x_r = xº_r to an arbitrary function ψ(x_r+1, ... x_n) which is developable about xº_r+1, ... xº_n, namely, this common solution is ψ(ω_r+1, ... ω_n). It is seen at once that this result is a generalization of the theorem for r = 1, and its proof is conveniently given by induction from that case. It can be verified without difficulty (1) that if from the r equations of the complete system we form r independent linear aggregates, with coefficients not necessarily constants, the new system is also a complete system; (2) that if in place of the independent variables x₁, ... x_n we introduce any other variables which are independent functions of the former, the new equations also form a complete system. It is convenient, then, from the complete system of r equations to form r new equations by solving separately for dƒ/dx₁, ..., dƒ/dx_r; suppose the general equation of the new system to be

Q_σƒ = dƒ/dx_σ + c_σjr+1dƒ/dx_r+1 + ... + c_σndƒ/dx_n = 0 (σ = 1, ... r).

Then it is easily obvious that the equation Q_ρQ_σƒ − Q_σQ_ρƒ = 0 contains only the differential coefficients of ƒ in regard to x_r+1 ... x_n; as it is at most a linear function of Q₁ƒ, ... Q_rƒ, it must be identically zero. So reduced the system is called a Jacobian system. Of this system Q₁ƒ=0 has n − 1 principal solutions reducing respectively Jacobian systems. to x₂, ... x_n when

x₁ = xº₁,

and its form shows that of these the first r − 1 are exactly x₂ ... x_r. Let these n − 1 functions together with x₁ be introduced as n new independent variables in all the r equations. Since the first equation is satisfied by n − 1 of the new independent variables, it will contain no differential coefficients in regard to them, and will reduce therefore simply to dƒ/dx₁ = 0, expressing that any common solution of the r equations is a function only of the n − 1 remaining variables. Thereby the investigation of the common solutions is reduced to the same problem for r − 1 equations in n − 1 variables. Proceeding thus, we reach at length one equation in n − r + 1 variables, from which, by retracing the analysis, the proposition stated is seen to follow.

The analogy with the case of one equation is, however, still closer. With the coefficients c_σj, of the equations Q_σƒ = 0 in transposed array (σ = 1, ... r, j = r + 1, ... n) we can put down the (n − r) equations, dx_j = c_1jdx₁ + ... + c_rjdx_r, equivalent to System of total differential equations. the r(n − r) equations dx_j/dx_σ = c_σr. That consistent with them we may be able to regard x_r+1, ... x_n as functions of x₁, ... x_r, these being regarded as independent variables, it is clearly necessary that when we differentiate c_σj in regard to x_ρ on this hypothesis the result should be the same as when we differentiate c_ρj, in regard to x_σ on this hypothesis. The differential coefficient of a function ƒ of x₁, ... x_n on this hypothesis, in regard to x_ρj is, however,

dƒ/dx_ρ + c_ρjr+1dƒ/dx_r+1 + ... + c_ρndƒ/dx_n,

namely, is Q_ρƒ. Thus the consistence of the n − r total equations requires the conditions Q_ρc_σj − Q_σc_ρj = 0, which are, however, verified in virtue of Q_ρ(Q_σƒ) − Q_σ(Q_ρƒ) = 0. And it can in fact be easily verified that if ω_r+1, ... ω_n be the principal solutions of the Jacobian system, Q_σƒ = 0, reducing respectively to x_r+1, ... x_n when x₁ = xº₁, ... x_r = xº_r, and the equations ω_r+1 = x⁰_r+1, ... ω_n = xº_n be solved for x_r+1, ... x_n to give x_j = ψ_j(x₁, ... x_r, x⁰_r+1, ... xº_n), these values solve the total equations and reduce respectively to x⁰_r+1, ... xº_n when x₁ = xº₁ ... x_r = xº_r. And the total equations have no other solutions with these initial values. Conversely, the existence of these solutions of the total equations can be deduced a priori and the theory of the Jacobian system based upon them. The theory of such total equations, in general, finds its natural place under the heading Pfaffian Expressions, below.

A practical method of reducing the solution of the r equations of a Jacobian system to that of a single equation in n − r + 1 variables may be explained in connexion with a geometrical interpretation which will perhaps be clearer in a particular Geometrical interpretation and solution. case, say n = 3, r = 2. There is then only one total equation, say dz = adz + bdy; if we do not take account of the condition of integrability, which is in this case da/dy + bda/dz = db/dx + adb/dz, this equation may be regarded as defining through an arbitrary point (x₀, y₀, z₀) of three-dimensioned space (about which a, b are developable) a plane, namely, z − z₀ = a₀(x − x₀) + b₀(y − y₀), and therefore, through this arbitrary point ∞² directions, namely, all those in the plane. If now there be a surface z = ψ(x, y), satisfying dz = adz + bdy and passing through (x₀, y₀, z₀), this plane will touch the surface, and the operations of passing along the surface from (x₀, y₀, z₀) to

(x₀ + dx₀, y₀, z₀ + dz₀)

and then to (x₀ + dx₀, y₀ + dy₀, Z₀ + d¹z₀), ought to lead to the same value of d¹z₀ as do the operations of passing along the surface from (x₀, y₀, z₀) to (x₀, y₀ + dy₀, z₀ + δz₀), and then to

(x₀ + dx₀, y₀ + dy₀, z₀ + δ¹z₀),

namely, δ¹z₀ ought to be equal to d¹z₀. But we find

$a_{0}dx_{0}+b_{0}dy_{0}+dx_{0}dy_{0}\left({\frac {db}{dx_{0}}}+a_{0}{\frac {db}{dz_{0}}}\right),$

and so at once reach the condition of integrability. If now we put x = x_o + t, y = y_o + mt, and regard m as constant, we shall in fact be considering the section of the surface by a fixed plane y−y_o = m(x−x_o); along this section dz = dt(a + bm); if we then integrate the equation dx/dt = a + bm, where a, b are expressed as functions of m and t, with m kept constant, finding the solution which reduces to z_o for t = 0, and in the result again replace m by (y−y_o)/(x−x_o), we shall have the surface in question. In the general case the equations

dx_j＝c_1j dx₁ + . .c_rjdx_r

similarly determine through an arbitrary point x₁ᵒ, . . . x_nᵒ Mayer’s method
of integration. a planar manifold of r dimensions in space of n dimensions, and when the conditions of integrability are satisfied, every direction in this manifold through this point is tangent to the manifold of r dimensions, expressed by ω_r+1 = x⁰_r+1, . . . ω_n = x_nᵒ, which satisfies the equations and passes through this point. If we put x₁−x₁ᵒ = t, x₂−x₂ᵒ = m₂t, ... x_r−x_rᵒ = m_rt, and regard m₂, ... m_r as fixed, the (n−r) total equations take the form dx_j /dt = c_1j + m₂c_2j + ... + m_rc_rj, and their integration is equivalent to that of the single partial equation

${df}/{dt}+\sum _{j=r+1}^{n}(c_{1j}+m_{2}c_{2j}+\ldots +m_{r}c_{rj}){df}/{dx}_{j}=0$

in the n−r + 1 variables t, x_r+1, ... x_n. Determining the solutions Ω_r+1, ... Ω_n which reduce to respectively x_r+1, ... x_n when t = 0, and substituting t = x₁−x₁ᵒ, m₂ = (x₂−x₂ᵒ)/(x₁−x₁ᵒ), ... m_r = (x_r−x_rᵒ)/(x₁−x₁ᵒ), we obtain the solutions of the original system of partial equations previously denoted by ω_r+1, . . . ω_n. It is to be remarked, however, that the presence of the fixed parameters m₂, ... m_r in the single integration may frequently render it more difficult than if they were assigned numerical quantities.

We have above considered the integration of an equation

dz＝adz + bdy

on the hypothesis that the condition

da/dy + bda/dz＝db/dz + adb/dz.

It is natural to inquire what relations among x, y, z, if any, are implied by, or are consistent with, a differential relation adx + bdy + cdx = 0, when a, b, c are unrestricted functions of x, y, z. This problem leads to the consideration of the so-called Pfaffian Expression adx + bdy + cdz. It can be shown (1) if each of the Pfaffian Expressions.quantities db/dz−dc/dy, dc/dx−da/dz, da/dy−db/dz, which we shall denote respectively by u₂₃, u₃₁, u₁₂, be identically zero, the expression is the differential of a function of x, y, z, equal to dt say; (2) that if the quantity au₂₃ + bu₃₁ + cu₁₂ is identically zero, the expression is of the form udt, i.e. it can be made a perfect differential by multiplication by the factor 1/u; (3) that in general the expression is of the form dt + u₁dt₁. Consider the matrix of four rows and three columns, in which the elements of the first row are a, b, c, and the elements of the (r + 1)-th row, for r = 1, 2, 3, are the quantities u_r1, u_r2, u_r3, where u₁₁ = u₂₂ = u₃₃ = 0. Then it is easily seen that the cases (1), (2), (3) above correspond respectively to the cases when (1) every determinant of this matrix of two rows and columns is zero, (2) every determinant of three rows and columns is zero, (3) when no condition is assumed. This result can be generalized as follows: if a₁, ... a_n be any functions of x₁, ... x_n, the so-called Pfaffian expression a₁dx₁ + ... + a_ndx_n can be reduced to one or other of the two forms

u₁dt₁ + ... + u_kdt_k, dt + u₁dt₁ + ... + u_k−1dt_k−1,

wherein t, u₁ ..., t₁, ... are independent functions of x₁, ... x_n, and k is such that in these two cases respectively 2k or 2k−1 is the rank of a certain matrix of n + 1 rows and n columns, that is, the greatest number of rows and columns in a non-vanishing determinant of the matrix; the matrix is that whose first row is constituted by the quantities a₁, ... a_n, whose s-th element in the (r + 1)-th row is the quantity da_r/dx_s−da_s/dx_r. The proof of such a reduced form can be obtained from the two results: (1) If t be any given function of the 2m independent variables u₁, ... u_m, t₁, ... t_m, the expression dt + u₁dt₁ + ... + u_mdt_m can be put into the form u′₁dt′₁ + ... + u′_mdt′_m. (2) If the quantities u₁, ..., u₁, t₁, ... t_m be connected by a relation, the expression n₁dt₁ + ... + u_mdt_m can be put into the format dt′ + u′₁dt′₁ + ... + u′_m−1dt′_m−1; and if the relation connecting u₁, u_m, t₁, ... t_m be homogeneous in u₁, ... u_m, then t′ can be taken to be zero. These two results are deductions from the theory of contact transformations (see below), and their demonstration requires, beside elementary algebraical considerations, only the theory of complete systems of linear homogeneous partial differential equations of the first order. When the existence of the reduced form of the Pfaffian expression containing only independent quantities is thus once assured, the identification of the number k with that defined by the specified matrix may, with some difficulty, be made a posteriori.

In all cases of a single Pfaffian equation we are thus led to consider what is implied by a relation dt−u₁dt₁−. . .−u_mdt_m = 0, in which t, u₁, . . . u_m, t₁ . . ., t_m are, except for this equation, independent variables. This is to be satisfied in virtue of Single linear
Pfaffian equation. one or several relations connecting the variables; these must involve relations connecting t, t₁, . . . t_m only, and in one of these at least t must actually enter. We can then suppose that in one actual system of relations in virtue of which the Pfaffian equation is satisfied, all the relations connecting t, t₁ . . . t_m only are given by

t＝ψ(t_s+1 ... t_m), t₁＝ψ₁(t_s+1 ... t_m), ... t_s＝ψ_s(t_s+1 ... t_m);

so that the equation

dψ−u₁dψ₁−...−u_sdψ_s−u_s+1dt_s+1−...−u_mdt_m＝0

is identically true in regard to u₁, ... u_m, t_s+1 ..., t_m; equating to zero the coefficients of the differentials of these variables, we thus obtain m−s relations of the form

dψ/dt_j−u₁dψ₁/dt_j−. . .−u_sdψ_s/dt_j−u_j＝0;

these m−s relations, with the previous s + 1 relations, constitute a set of m + 1 relations connecting the 2m + 1 variables in virtue of which the Pfaffian equation is satisfied independently of the form of the functions ψ,ψ₁, ... ψ_s. There is clearly such a set for each of the values s = 0, s = 1, . . ., s = m−1, s = m. And for any value of s there may exist relations additional to the specified m + 1 relations, provided they do not involve any relation connecting t, t₁, . . . t_m only, and are consistent with the m−s relations connecting u₁, ... u_m. It is now evident that, essentially, the integration of a Pfaffian equation

a₁dx₁ + ... + a_ndx_n＝0,

wherein a₁, ... a_n are functions of x₁, ... x_n, is effected by the processes necessary to bring it to its reduced form, involving only independent variables. And it is easy to see that if we suppose this reduction to be carried out in all possible ways, there is no need to distinguish the classes of integrals corresponding to the various values of s; for it can be verified without difficulty that by putting t′ = t−u₁t₁−...−u_st_s, t′₁ = u₁, ... t′_s = u_s, u′₁ = −t₁, ..., u′_s = −t_s, t′_s+1 = t_s+1, ... t′_m = t_m, u′_s+1 = u_s+1, ... u′_m = u_m, the reduced equation becomes changed to dt′−u′₁dt′₁− ...−u′_mdt′_m = 0, and the general relations changed to

t′＝ψ(t′_s+1, ... t′_m)−t′₁ψ₁(t′_s+1, ... t′_m)− ...−t′_sψ_s(t′_s+1, ... t′_m),＝φ,

say, together with u′₁ = dφ/dt′₁, ..., u′_m = dφ/dt′_m, which contain only one relation connecting the variables t′, t′₁, ... t′_m only.

This method for a single Pfaffian equation can, strictly speaking, be generalized to a simultaneous system of (n−r) Pfaffian equations dx_j = c_1jdx₁ + ... + c_rjdx_r only in the case already treated, Simultaneous Pfaffian equations. when this system is satisfied by regarding x_r+1, ... x_n as suitable functions of the independent variables x₁, . . . x_r; in that case the integral manifolds are of r dimensions. When these are non-existent, there may be integral manifolds of higher dimensions; for if

dφ＝φ₁dx₁ + ... + φ_rdx_r + φ_r+1(c_1,r+1dx₁ + ... + c_r,r+1dx_r) + φ_r+2( ) + ...

be identically zero, then φσ + cσ,_r+1φ_r+1 + ... + cσ,_nφ_n ≈ 0, or φ satisfies the r partial differential equations previously associated with the total equations; when these are not a complete system, but included in a complete system of r−μ equations, having therefore n−r−μ independent integrals, the total equations are satisfied over a manifold of r + μ dimensions (see E. v. Weber, Math. Annal. 1v. (1901), p. 386).

It seems desirable to add here certain results, largely of algebraic character, which naturally arise in connexion with the theory of contact transformations. For any two functions of the 2n Contact transformations. independent variables x₁, ... x_n, p₁, ... p_n we denote by (φψ) the sum of the n terms such as dφdψ/dp_idx_i−dψdφ/dp_idx_i For two functions of the (2n + 1) independent variables z, x₁, ... x_n, p₁, ... p_n we denote by φψ the sum of the n terms such as

${\frac {d\phi }{dp_{i}}}\left({\frac {d\psi }{dx_{i}}}+p_{i}{\frac {d\psi }{dz}}\right)-{\frac {d\psi }{dp_{i}}}\left({\frac {d\phi }{dx_{i}}}+p_{i}{\frac {d\phi }{dz}}\right).$

It can at once be verified that for any three functions [ƒ[φψ]] + [φ[ψƒ]] + [ψ[ƒφ]] = dƒ/dz [φψ] + dφ/dz [ψƒ] + dψ/dz [ƒφ], which when ƒ, φ,ψ do not contain z becomes the identity (ƒ(φψ)) + (φ(ψƒ)) + (ψ(ƒφ)) = 0.Then, if X₁, ... X_n, P₁, ... P_n be such functions Of x₁, ... x_n, p₁ ... p_n that P₁dX₁ + ... + P_ndX_n is identically equal to p₁dx₁ + ... + p_ndx_n, it can be shown by elementary algebra, after equating coefficients of independent differentials, (1) that the functions X₁, ... P_n are independent functions of the 2n variables x₁, ... p_n, so that the equations x′_i = X_i, p′_i = P_i can be solved for x₁, ... x_n, p₁, ... p_n, and represent therefore a transformation, which we call a homogeneous contact transformation; (2) that the X₁, ... X_n are homogeneous functions of p₁, ... p_n of zero dimensions, the P₁, ... P_n are homogeneous functions of p₁, ... p_n of dimension one, and the 1/2n(n−1) relations (X_iX_j) = 0 are verified. So also are the n² relations (P_iX_i = 1, (P_iX_j) = 0, (P_iP_j) = 0. Conversely, if X₁, ... X_n be independent functions, each homogeneous of zero dimension in p₁, ... p_n satisfying the 1/2n(n−1) relations (X_iX_j) = 0, then P₁, ... P_n can be uniquely determined, by solving linear algebraic equations, such that P₁dX₁ + ... + P_ndX_n = p₁dx₁ + ... + p_ndx_n. If now we put n + 1 for n, put z for x_n+1, Z for X_n+1, Q_i for -P_i/P_n+1, for i = 1, ... n, put q_i for -p_i/p_n+1 and σ for q_n+1/Q_n+1, and then finally write P₁, ... P_n, p₁, ... p_n for Q₁, ... Q_n, q₁, ... q_n, we obtain the following results: If ZX₁ ... X_n, P₁, ... P_n be functions of z, x₁, ... x_n, p₁, ... p_n, such that the expression dZ−P₁dX₁−...−P_ndX_n is identically equal to σ(dz−p₁dx₁−...−p_ndx_n), and σ not zero, then (1) the functions Z, X₁, ... X_n, P₁, ... P_n are independent functions of z, x₁, ... x_n, p₁, ... p_n, so that the equations z′ = Z, x′_i = X_i, p′_i = P_i can be solved for z, x₁, ... x_n, p₁, ... p_n and determine a transformation which we call a (non-homogeneous) contact transformation; (2) the Z, X₁, ... X_n verify the 1/2n(n + 1) identities [ZX_i] = 0, [X_iX_j] = 0. And the further identities

$[\mathrm {Z} \sigma ]{=}\sigma {\frac {d\mathrm {Z} }{dz}}-\sigma ^{2},[\mathrm {X} _{i}\sigma ]{=}\sigma {\frac {d\mathrm {X} _{i}}{dz}},[\mathrm {P} _{i}\sigma ]{=}{\frac {d\mathrm {P} _{i}}{dz}}$

are also verified. Conversely, if Z, x₁, ... X_n be independent functions satisfying the identities [ZX_i] = 0, [X_iX_j] = 0, then σ, other than zero, and P₁, ... P_n can be uniquely determined, by solution of algebraic equations, such that

dZ − P₁dX₁ − ... − P_ndX_n＝σ(dz − p₁dx₁ − ... − p_ndx_n).

Finally, there is a particular case of great importance arising when σ = 1, which gives the results: (1) If U, X₁, ... X_n, P₁, ... P_n be 2n + 1 functions of the 2n independent variables x₁, ... x_n, p₁, ... p_n, satisfying the identity

dU + P₁dx₁ + ... + P_ndX_n＝p₁dx₁ + ... + p_ndx_n,

then the 2n functions P₁, ... P_n, X₁, ... X_n are independent, and we have

(X_iX_j)＝0, (X_iU)＝δX_i, (P_iX_i)＝1, (P_iX_j)＝0, (P_iP_j)＝0, (P_iU) + P_i＝δP_i,

where δ denotes the operator p₁d/dp₁ + ... + p_nd/dp_n; (2) If X₁, ... X_n be independent functions of x₁, ... x_n, p₁, ... p_n, such that (X_iX_j) = 0, then U can be found by a quadrature, such that

(X_iU)＝δX_i;

and when X_i, ... X_n, U satisfy these 1/2n(n + 1) conditions, then P₁, ... P_n can be found, by solution of linear algebraic equations, to render true the identity dU + P₁dX₁ + ... + P_ndX_n = p₁dx₁ + ... + p_ndx_n; (3) Functions X₁, ... X_n, P₁, ... P_n can be found to satisfy this differential identity when U is an arbitrary given function of x₁, ... x_n, p₁, ... p_n; but this requires integrations. In order to see what integrations, it is only necessary to verify the statement that if U be an arbitrary given function of x₁, ... x_n, p₁, ... p_n, and, for r < n, X₁, ... X_r be independent functions of these variables, such that (XσU) = δXσ, (XρXσ) = 0, for ρ, σ = 1 ... r, then the r + 1 homogeneous linear partial differential equations of the first order (Uƒ) + δƒ = 0, (Xρƒ) = 0, form a complete system. It will be seen that the assumptions above made for the reduction of Pfaffian expressions follow from the results here enunciated for contact transformations.

We pass on now to consider the solution of any partial differential equation of the first order; we attempt to explain certain ideas relatively to a single equation with any number of independent variables (in particular, an ordinary equation of the first order with one independent Partial differential equation of the first order.variable) by speaking of a single equation with two independent variables x, y, and one dependent variable z. It will be seen that we are naturally led to consider systems of such simultaneous equations, which we consider below. The central discovery of the transformation theory of the solution of an equation F(x, y, z, dz/dx, dz/dy) = 0 is that its solution can always be reduced to the solution of partial equations which are linear. For this, however, we must regard dz/dx, dz/dy, during the process of integration, not as the differential coefficients of a function z in regard to x and y, but as variables independent of x, y, z, the too great indefiniteness that might thus appear to be introduced being provided for in another way. We notice that if z = ψ(x, y) be a solution of the differential equation, then dz = dxdψ/dx + dydψ/dy; thus if we denote the equation by F(x, y, z, p, q,) = 0, and prescribe the condition dz = pdx + qdy for every solution, any solution such as z = ψ(x, y) will necessarily be associated with the equations p = dz/dx, q = dz/dy, and z will satisfy the equation in its original form. We have previously seen (under Pfaffian Expressions) that if five variables x, y, z, p, q, otherwise independent, be subject to dz − pdx − qdy = 0, they must in fact be subject to at least three mutual relations. If we associate with a point (x, y, z) the plane

Z − z＝p(X − x) + q(Y − y)

passing through it, where X, Y, Z are current co-ordinates, and call this association a surface-element; and if two consecutive elements of which the point(x + dx, y + dy, z + dz) of one lies on the plane of the other, for which, that is, the condition dz = pdx + qdy is satisfied, be said to be connected, and an infinity of connected elements following one another continuously be called a connectivity, then our statement is that a connectivity consists of not more than ∞² elements, the whole number of elements (x, y, z, p, q) that are possible being called ∞⁵. The solution of an equation F(x, y, z, dz/dx, dz/dy) = 0 is then to be understood to mean finding in all possible ways, from the ∞⁴ elements (x, y, z, p, q) which satisfy F(x, y, z, p, q) = 0 a set of ∞² elements forming a connectivity; or, more analytically, finding in all possible ways two relations G = 0, H = 0 connecting x, y, z, p, q and independent of F = 0, so that the three relations together may involve

dz＝pdx + qdy.

Such a set of three relations may, for example, be of the form z = ψ(x, y), p = dψ/dx, q = dψ/dy; but it may also, as another case, involve two relations z = ψ(y), x = ψ₁(y) connecting x, y, z, the third relation being

ψ′(y)＝pψ′₁(y) + q,

the connectivity consisting in that case, geometrically, of a curve in space taken with ∞¹ of its tangent planes; or, finally, a connectivity is constituted by a fixed point and all the planes passing through that point. This generalized view of the meaning of a solution of F = 0 is of advantage, moreover, in view of anomalies otherwise arising from special forms of the equation Meaning of a solution of the equation. itself. For instance, we may include the case, sometimes arising when the equation to be solved is obtained by transformation from another equation, in which F does not contain either p or q. Then the equation has ∞² solutions, each consisting of an arbitrary point of the surface F = 0 and all the ∞² planes passing through this point; it also has ∞² solutions, each consisting of a curve drawn on the surface F = 0 and all the tangent planes of this curve, the whole consisting of ∞² elements; finally, it has also an isolated (or singular) solution consisting of the points of the surface, each associated with the tangent plane of the surface thereat, also ∞² elements in all. Or again, a linear equation F = Pp + Qq − R = 0, wherein P, Q, R are functions of x, y, z only, has ∞² solutions, each consisting of one of the curves defined by

dx/P＝dy/Q＝dz/R

taken with all the tangent planes of this curve; and the same equation has ∞² solutions, each consisting of the points of a surface containing ∞¹ of these curves and the tangent planes of this surface. And for the case of n variables there is similarly the possibility of n + 1 kinds of solution of an equation F(x₁, ... x_n, z, p₁, ... p_n) = 0; these can, however, by a simple contact transformation be reduced to one kind, in which there is only one relation z′ = ψ(x′₁, ... x′_n) connecting the new variables x′₁, ... x′_n, z′ (see under Pfaffian Expressions); just as in the case of the solution

z＝ψ(y), x＝ψ₁(y), ψ′(y)＝pψ′₁(y) + q

of the equation Pp + Qq = R the transformation z′ = z − px, x′ = p, p′ = −x, y′ = y, q′ = q gives the solution

z′＝ψ(y′) + x′ψ₁(y′), p′＝dz′/dx′, q′＝dz′/dy′

of the transformed equation. These explanations take no account of the possibility of p and q being infinite; this can be dealt with by writing p = −u/w, q = −v/w, and considering homogeneous equations in u, v, w, with udx + vdy + wdz = 0 as the differential relation necessary for a connectivity; in practice we use the ideas associated with such a procedure more often without the appropriate notation.

In utilizing these general notions we shall first consider the theory of characteristic chains, initiated by Cauchy, which shows well the nature of the relations implied by the given differential equation; the alternative ways of carrying out the necessary integrations are suggested by considering Order of the ideas.the method of Jacobi and Mayer, while a good summary is obtained by the formulation in terms of a Pfaffian expression.

Consider a solution of F = 0 expressed by the three independent equations F = 0, G = 0, H = 0. If it be a solution in which there is more than one relation connecting x, y, z, let new variables x′, y′, z′, p′, q′ be introduced, as before explained under Pfaffian Expressions, in which z′ is of the formCharacteristic chains.

z′＝z−p₁x₁−... −p_sx_s (s＝1 or 2),

so that the solution becomes of a form z′ = ψ(x′y′), p′ = dψ/dx′, q′ = dψ/dy′, which then will identically satisfy the transformed equations F′ = 0, G′ = 0, H′ = 0. The equation F′ = 0, if x′, y′, z′ be regarded as fixed, states that the plane Z − z′ = p′(X − x′) + q′(Y − y′) is tangent to a certain cone whose vertex is (x′, y′, z′), the consecutive point (x′ + dx′, y′ + dy′, z′ + dz′) of the generator of contact being such that

$dx'\left/{\frac {d\mathrm {F} '}{dp'}}\right.=dy'\left/{\frac {d\mathrm {F} '}{dq'}}\right.=dz'\left/\left(p'{\frac {d\mathrm {F} '}{dp'}}+q'{\frac {d\mathrm {F} '}{dq'}}\right)\right..$

Passing in this direction on the surface z′ = ψ(x′, y′) the tangent plane of the surface at this consecutive point is (p′ + dp′, q′ + dq′), where, since F′(x′, y′, ψ, dψ/dx′, dψ/dy′)＝0 is identical, we have dx′ (dF′/dx′ + p′dF′/dz′) + dp′dF′/dp′＝0. Thus the equations, which we shall call the characteristic equations,

$dx'\left/{\frac {d\mathrm {F} '}{dp'}}\right.=dy'\left/{\frac {d\mathrm {F} '}{dq'}}\right.=dz'\left/\left(p'{\frac {d\mathrm {F} '}{dp'}}+q'{\frac {d\mathrm {F} '}{dq'}}\right)\right.=dp'\left/\left(-{\frac {d\mathrm {F} '}{dx'}}-p'{\frac {d\mathrm {F} '}{dz'}}\right)\right.=dq'\left/\left(-{\frac {d\mathrm {F} '}{dy'}}-q'{\frac {d\mathrm {F} '}{dz'}}\right)\right.$

are satisfied along a connectivity of ∞¹ elements consisting of a curve on z′＝ψ(x′, y′) and the tangent planes of the surface along this curve. The equation F′＝0, when p′, q′ are fixed, represents a curve in the plane Z − z′＝p′(X − x′) + q′(Y − y′) passing through (x′, y′, z′); if (x′ + δx′, y′ + δy′, z′ + δz′) be a consecutive point of this curve, we find at once

$\delta x'\left({\frac {d\mathrm {F} '}{dx'}}+p'{\frac {d\mathrm {F} '}{dz'}}\right)+\delta y'\left({\frac {d\mathrm {F} '}{dy'}}q'{\frac {d\mathrm {F} '}{dz'}}\right)=0;$

thus the equations above give δx′dp′ + δy′dq′＝0, or the tangent line of the plane curve, is, on the surface z′＝ψ(x′, y′), in a direction conjugate to that of the generator of the cone. Putting each of the fractions in the characteristic equations equal to dt, the equations enable us, starting from an arbitrary element x′₀, y′₀, z′₀, p′₀, q′₀, about which all the quantities F′, dF′/dp′, &c., occurring in the denominators, are developable, to define, from the differential equation F′＝0 alone, a connectivity of ∞¹ elements, which we call a characteristic chain; and it is remarkable that when we transform again to the original variables (x, y, z, p, q), the form of the differential equations for the chain is unaltered, so that they can be written down at once from the equation F＝0. Thus we have proved that the characteristic chain starting from any ordinary element of any integral of this equation F＝0 consists only of elements belonging to this integral. For instance, if the equation do not contain p, q, the characteristic chain, starting from an arbitrary plane through an arbitrary point of the surface F＝0, consists of a pencil of planes whose axis is a tangent line of the surface F＝0. Or if F＝0 be of the form Pp + Qq＝R, the chain consists of a curve satisfying dx/P＝dy/Q＝dz/R and a single infinity of tangent planes of this curve, determined by the tangent plane chosen at the initial point. In all cases there are ∞³ characteristic chains, whose aggregate may therefore be expected to exhaust the ∞⁴ elements satisfying F＝0.

Consider, in fact, a single infinity of connected elements each satisfying F＝0, say a chain connectivity T, consisting of elements specified by x₀, y₀, z₀, p₀, q₀, which we suppose expressed as functions of a parameter u, so that Complete integral constructed with characteristic chains.

U₀＝dz₀/du − p₀dx₀/du − q₀dy₀/du

is everywhere zero on this chain; further, suppose that each of F, dF/dp, ... , dF/dx + pdF/dz is developable about each element of this chain T, and that T is not a characteristic chain. Then consider the aggregate of the characteristic chains issuing from all the elements of T. The ∞² elements, consisting of the aggregate of these characteristic chains, satisfy F＝0, provided the chain connectivity T consists of elements satisfying F＝0; for each characteristic chain satisfies dF＝0. It can be shown that these chains are connected; in other words, that if x, y, z, p, q, be any element of one of these characteristic chains, not only is

dz/dt − pdx/dt − qdy/dt＝0,

as we know, but also U＝dz/du − pdx/du − qdy/du is also zero. For we have

${\frac {d\mathrm {U} }{dt}}={\frac {d}{dt}}\left({\frac {dz}{du}}-p{\frac {dx}{du}}-q{\frac {dy}{du}}\right)-{\frac {d}{du}}\left({\frac {dz}{dt}}-p{\frac {dx}{dt}}-q{\frac {dy}{dt}}\right)$

$={\frac {dp}{du}}{\frac {dx}{dt}}-{\frac {dp}{dt}}{\frac {dx}{du}}+{\frac {dq}{du}}{\frac {dy}{dt}}-{\frac {dq}{dt}}{\frac {dy}{du}}$

which is equal to

${\frac {dp}{du}}{\frac {d\mathrm {F} }{dp}}+{\frac {dx}{du}}\left({\frac {d\mathrm {F} }{dx}}+p{\frac {d\mathrm {F} }{dz}}\right)+{\frac {dq}{du}}{\frac {d\mathrm {F} }{dq}}+{\frac {dy}{du}}\left({\frac {d\mathrm {F} }{dy}}+q{\frac {d\mathrm {F} }{dz}}\right)=-{\frac {d\mathrm {F} }{dz}}\mathrm {U} .$

As dF/dz is a developable function of t, this, giving

$\mathrm {U} =\mathrm {U} _{0}exp\left(-\int _{t_{0}}^{t}{\frac {d\mathrm {F} }{dz}}dt\right),$

shows that U is everywhere zero. Thus integrals of F＝0 are obtainable by considering the aggregate of characteristic chains issuing from arbitrary chain connectivities T satisfying F＝0; and such connectivities T are, it is seen at once, determinable without integration. Conversely, as such a chain connectivity T can be taken out from the elements of any given integral all possible integrals are obtainable in this way. For instance, an arbitrary curve in space, given by x₀＝θ(u), y₀＝φ(u), z₀＝ψ(u), determines by the two equations F(x₀, y₀, z₀, p₀, q₀)＝0, ψ′(u)＝p₀θ′(u) + q₀φ′(u), such a chain connectivity T, through which there passes a perfectly definite integral of the equation F＝0. By taking ∞² initial chain connectivities T, as for instance by taking the curves x₀＝θ, y₀ ＝φ, z₀ ＝ψ to be the ∞² curves upon an arbitrary surface, we thus obtain ∞² integrals, and so ∞⁴ elements satisfying F＝0. In general, if functions G, H, independent of F, be obtained, such that the equations F＝0, G＝b, H＝c represent an integral for all values of the constants b, c, these equations are said to constitute a complete integral. Then ∞⁴ elements satisfying F＝0 are known, and in fact every other form of integral can be obtained without further integrations.

In the foregoing discussion of the differential equations of a characteristic chain, the denominators dF/dp, ... may be supposed to be modified in form by means of F＝0 in any way conducive to a simple integration. In the immediately following explanation of ideas, however, we consider indifferently all equations F＝constant; when a function of x, y, z, p, q is said to be zero, it is meant that this is so identically, not in virtue of F＝0; in other words, we consider the integration of F＝a, where a is an arbitrary constant. In the theory of linear partial equations we have seen that the integration Operations necessary for integration of
F＝a. of the equations of the characteristic chains, from which, as has just been seen, that of the equation F＝a follows at once, would be involved in completely integrating the single linear homogeneous partial differential equation of the first order [Fƒ]＝0 where the notation is that explained above under Contact Transformations. One obvious integral is ƒ＝F. Putting F＝a, where a is arbitrary, and eliminating one of the independent variables, we can reduce this equation [Fƒ]＝0 to one in four variables; and so on. Calling, then, the determination of a single integral of a single homogeneous partial differential equation of the first order in n independent variables, an operation of order n − 1, the characteristic chains, and therefore the most general integral of F＝a, can be obtained by successive operations of orders 3, 2, 1. If, however, an integral of F＝a be represented by F＝a, G＝b, H＝c, where b and c are arbitrary constants, the expression of the fact that a characteristic chain of F＝a satisfies dG＝0, gives [FG]＝0; similarly, [FH]＝0 and [GH]＝0, these three relations being identically true. Conversely, suppose that an integral G, independent of F, has been obtained of the equation [Fƒ]＝0, which is an operation of order three. Then it follows from the identity [ƒ[φψ]] + [φ[ψƒ]] + [ψ[ƒφ]]＝dƒ/dz [ψφ] + dφ/dz [ψƒ] + dψ/dz [ƒφ] before remarked, by putting φ＝F, ψ＝G, and then [Fƒ]＝A(ƒ), [Gƒ]＝B(ƒ), that AB(ƒ) − BA(ƒ)＝dF/dz B(ƒ) − dG/dz A(ƒ), so that the two linear equations [Fƒ]＝0, [Gƒ]＝0 form a complete system; as two integrals F, G are known, they have a common integral H, independent of F, G, determinable by an operation of order one only. The three functions F, G, H thus identically satisfy the relations [FG]＝[GH]＝[FH]＝0. The ∞² elements satisfying F＝a, G＝b, H＝c, wherein a, b, c are assigned constants, can then be seen to constitute an integral of F＝a. For the conditions that a characteristic chain of G＝b issuing from an element satisfying F＝a, G＝b, H＝c should consist only of elements satisfying these three equations are simply [FG]＝0, [GH]＝0. Thus, starting from an arbitrary element of (F＝a, G＝b, H＝c), we can single out a connectivity of elements of (F＝a, G＝b, H＝c) forming a characteristic chain of G＝b; then the aggregate of the characteristic chains of F＝a issuing from the elements of this characteristic chain of G＝b will be a connectivity consisting only of elements of

(F＝a, G＝b, H＝c),

and will therefore constitute an integral of F＝a; further, it will include all elements of (F＝a, G＝b, H＝c). This result follows also from a theorem given under Contact Transformations, which shows, moreover, that though the characteristic chains of F＝a are not determined by the three equations F＝a, G＝b, H＝c, no further integration is now necessary to find them. By this theorem, since identically [FG]＝[GH]＝[FH]＝0, we can find, by the solution of linear algebraic equations only, a non-vanishing function σ and two functions A, C, such that

dG − AdF − CdH＝σ(dz − pdz − qdy);

thus all the elements satisfying F＝a, G＝b, H＝c, satisfy dz＝pdx + qdy and constitute a connectivity, which is therefore an integral of F＝a. While, further, from the associated theorems, F, G, H, A, C are independent functions and [FC]＝0. Thus C may be taken to be the remaining integral independent of G, H, of the equation [Fƒ]＝0, whereby the characteristic chains are entirely determined.

When we consider the particular equation F＝0, neglecting the case when neither p nor q enters, and supposing p to enter, we may express p from F＝0 in terms of x, y, z, q, and then eliminate it from all other equations. Then instead of the equation [Fƒ]＝0, we have, if F＝0 give p＝ψ(x, y, z, q), the equation

$\Omega f=-\left({\frac {df}{dx}}+\psi {\frac {df}{dz}}\right)+{\frac {d\psi }{dq}}\left({\frac {df}{dy}}+q{\frac {df}{dz}}\right)-\left({\frac {d\psi }{dy}}+q{\frac {d\psi }{dz}}\right){\frac {df}{dq}}=0,$

moreover obtainable by omitting the term in dƒ/dp in [p − ψ, ƒ]＝0. Let x₀, y₀, z₀, q₀, be values about which the coefficients in this equation are developable, and let ζ, η, ω be the principal solutions reducing respectively to z, y and q when x＝x₀. Then the equations p＝ψ, ζ＝z₀, η＝y₀, ω＝q₀ The single equation F＝0 and Pfaffian formulations.represent a characteristic chain issuing from the element x₀, y₀, z₀, ψ₀, q₀; we have seen that the aggregate of such chains issuing from the elements of an arbitrary chain satisfying

dz₀＝p₀dx₀ − q₀dy₀＝0

constitute an integral of the equation p＝ψ. Let this arbitrary chain be taken so that x₀ is constant; then the condition for initial values is only

dz₀ − q₀dy₀＝0,

and the elements of the integral constituted by the characteristic chains issuing therefrom satisfy

dζ − ωdη＝0.

Hence this equation involves dz − ψdx − qdy = 0, or we have

dz − ψdx − qdy＝σ(dζ − ωdη),

where σ is not zero. Conversely, the integration of p = ψ is, essentially, the problem of writing the expression dz − ψdx − qdy in the form σ(dζ − ωdη), as must be possible (from what was said under Pfaffian Expressions).

To integrate a system of simultaneous equations of the first order X₁ = a₁, ... X_r = a_r in n independent variables x₁, ... x_n and one dependent variable z, we write p₁ for dz/dx₁, &c., and attempt to find n + 1 − r further functions Z, X_r+1 ... X_n, such that the equations Z = a, X_i = a_i,(i = 1, ... n) System of equations of the first order.involve dz − p₁dx₁ − ... − p_ndx_n = 0. By an argument already given, the common integral, if existent, must be satisfied by the equations of the characteristic chains of any one equation X_i = a_i; thus each of the expressions [X_iX_j] must vanish in virtue of the equations expressing the integral, and we may without loss of generality assume that each of the corresponding 1/2r(r − 1) expressions formed from the r given differential equations vanishes in virtue of these equations. The determination of the remaining n + 1 − r functions may, as before, be made to depend on characteristic chains, which in this case, however, are manifolds of r dimensions obtained by integrating the equations [X₁ƒ] = 0, ... [X_rƒ] = 0; or having obtained one integral of this system other than X₁, ... X_r, say X_r+1, we may consider the system [X₁ƒ] = 0, ... [X_r+1ƒ] = 0, for which, again, we have a choice; and at any stage we may use Mayer’s method and reduce the simultaneous linear equations to one equation involving parameters; while if at any stage of the process we find some but not all of the integrals of the simultaneous system, they can be used to simplify the remaining work; this can only be clearly explained in connexion with the theory of so-called function groups for which we have no space. One result arising is that the simultaneous system p₁ = φ₁, ... p_r = φ_r, wherein p₁, . . . p_r are not involved in φ₁, . . . φ_r, if it satisfies the 1/2r(r − 1) relations [p_i − φ_i, p_j − φ_j] = 0, has a solution z = ψ(x₁, ... x_n), p₁ = dψ/dx₁, ... p_n = dψ/dx_n, reducing to an arbitrary function of x_r+1, ... x_n only, when x₁ = xº₁, ... x_r = xº_r under certain conditions as to developability; a generalization of the theorem for linear equations. The problem of integration of this system is, as before, to put

dz − φ₁dx₁ − ... − φ_rdx_r − p_r+1dx_r+1 − ... − p_ndx_n

into the form σ(dζ − ω_r+1 + dξ_r+1 − ... − ω_ndξ_n); and here ζ, ξ_r+1, ... ξ_n, ω_r+1, ... ω_n may be taken, as before, to be principal integrals of a certain complete system of linear equations; those, namely, determining the characteristic chains.

If L be a function of t and of the 2n quantities x₁, ... x_n, ẋ₁, ... ẋ_n, where ẋ_i, denotes dx_i/dt, &c., and if in the n equations

${\frac {d}{dt}}\left({\frac {d\mathrm {L} }{d{\dot {x}}_{i}}}\right)={\frac {d\mathrm {L} }{dx_{i}}}$

we put p_i = dL/dẋ_i, and so express ẋ_i, ... ẋ_n in terms of t, x_i, ... x_n, p₁, ... p_n, assuming that the determinant of the quantities d²L/dx_idẋ_j is not zero; if, further, H denote the function of t, x₁, ... x_n, p₁, ... p_n, numerically equal to p₁ẋ₁ + ... + p_nẋ_n − L, it is easy Equations of dynamics. to prove that dp_i/dt = −dH/dx_i, dx_i/dt = dH/dp_i. These so-called canonical equations form part of those for the characteristic chains of the single partial equation dz/dt + H(t, x₁, ... x_n, dz/dx₁, ..., dz/dx_n) = 0, to which then the solution of the original equations for x₁ ... x_n can be reduced. It may be shown (1) that if z = ψ(t, x₁, ... x_n, c₁, .. c_n) + c be a complete integral of this equation, then p_i = dψ/dx_i, dψ/dc_i = e_i are 2n equations giving the solution of the canonical equations referred to, where c₁ ... c_n and e₁, ... e_n are arbitrary constants; (2) that if x_i = X_i(t, x⁰₁, ... pº_n), p_i = P_i(t, xº₁, ... p⁰_n) be the principal solutions of the canonical equations for t = t⁰, and ω denote the result of substituting these values in p₁dH/dp₁ + ... + p_ndH/dp_n − H, and Ω = ∫t_t0 ωdt, where, after integration, Ω is to be expressed as a function of t, x₁, ... x_n, xº₁, ... xº_n, then z = Ω + z⁰ is a complete integral of the partial equation.

A system of differential equations is said to allow a certain continuous group of transformations (see Groups, Theory of) when the introduction for the variables in the differential equations of the new variables given by the equations of the group leads, for all values of the Application of theory of continuous groups to formal theories.parameters of the group, to the same differential equations in the new variables. It would be interesting to verify in examples that this is the case in at least the majority of the differential equations which are known to be integrable in finite terms. We give a theorem of very general application for the case of a simultaneous complete system of linear partial homogeneous differential equations of the first order, to the solution of which the various differential equations discussed have been reduced. It will be enough to consider whether the given differential equations allow the infinitesimal transformations of the group.

It can be shown easily that sufficient conditions in order that a complete system Π₁ƒ = 0 ... Π_kƒ = 0, in n independent variables, should allow the infinitesimal transformation Pƒ = 0 are expressed by k equations Π_iPƒ − PΠ_iƒ = λ_i1Π₁ƒ + ... + λ_ikΠ_kƒ. Suppose now a complete system of n − r equations in n variables to allow a group of r infinitesimal transformations (P₁f, ..., P_rƒ) which has an invariant subgroup of r − 1 parameters (P₁ƒ, ..., P_r−1ƒ), it being supposed that the n quantities Π₁ƒ, ..., Π_n-rƒ, P₁ƒ, ..., P_rƒ are not connected by an identical linear equation (with coefficients even depending on the independent variables). Then it can be shown that one solution of the complete system is determinable by a quadrature. For each of Π_iPσƒ − PσΠ_if is a linear function of Π₁ƒ, ..., Π_n-rƒ and the simultaneous system of independent equations Π₁ƒ = 0, ... Π_n-rƒ = 0, P₁ƒ = 0, ... P_r−1ƒ = 0 is therefore a complete system, allowing the infinitesimal transformation P_rƒ. This complete system of n − 1 equations has therefore one common solution ω, and P_r(ω) is a function of ω. By choosing ω suitably, we can then make P_r(ω) = 1. From this equation and the n − 1 equations Π_iω = 0, P_σω = 0, we can determine ω by a quadrature only. Hence can be deduced a much more general result, that if the group of r parameters be integrable, the complete system can be entirety solved by quadratures; it is only necessary to introduce the solution found by the first quadrature as an independent variable, whereby we obtain a complete system of n − r equations in n − 1 variables, subject to an integrable group of r − 1 parameters, and to continue this process. We give some examples of the application of the theorem. (1) If an equation of the first order y′ = ψ(x, y) allow the infinitesimal transformation ξdƒ/dx + ηdƒ/dy, the integral curves ω(x, y) = y⁰, wherein ω(x, y) is the solution of dƒ/dx + ψ(x, y) dƒ/dy = 0 reducing to y for x = x⁰, are interchanged among themselves by the infinitesimal transformation, or ω(x, y) can be chosen to make ξd_ω/dx + ηd_ω/dy = 1; this, with dω/dx + ψdω/dy = 0, determines ω as the integral of the complete differential (dy − ψdx)/(η − ψξ). This result itself shows that every ordinary differential equation of the first order is subject to an infinite number of infinitesimal transformations. But every infinitesimal transformation ξdƒ/dx + ηdƒ/dy can by change of variables (after integration) be brought to the form dƒ/dy, and all differential equations of the first order allowing this group can then be reduced to the form F(x, dy/dx) = 0. (2) In an ordinary equation of the second order y ″= ψ(x, y, y′), equivalent to dy/dx = y₁, dy₁/dx = ψ(x, y, y₁), if H, H₁ be the solutions for y and y₁ chosen to reduce to y^o and y^o₁ when x = x^o, and the equations H = y, H₁= y₁ be equivalent to ω = y^o, ω₁ = y^o₁, then ω, ω₁ are the principal solutions of Πƒ = dƒ/dx + y₁dƒ/dy + ψdƒ/dy₁ = 0. If the original equation allow an infinitesimal transformation whose first extended form (see Groups) is Pƒ = ξdƒ/dx + ηdƒ/dy + η₁dƒ/dy₁, where η₁δt is the increment of dy/dx when ξδt, ηδt are the increments of x, y, and is to be expressed in terms of x, y, y₁, then each of Pω and Pω₁ must be functions of ω and ω₁, or the partial differential equation Πƒ must allow the group Pƒ. Thus by our general theorem, if the differential equation allow a group of two parameters (and such a group is always integrable), it can be solved by quadratures, our explanation sufficing, however, only provided the form Πƒ and the two infinitesimal transformations are not linearly connected. It can be shown, from the fact that η₁ is a quadratic polynomial in y₁, that no differential equation of the second order can allow more than 8 really independent infinitesimal transformations, and that every homogeneous linear differential equation of the second order allows just 8, being in fact reducible to d²y/dx² = 0. Since every group of more than two parameters has subgroups of two parameters, a differential equation of the second order allowing a group of more than two parameters can, as a rule, be solved by quadratures. By transforming the group we see that if a differential equation of the second order allows a single infinitesimal transformation, it can be transformed to the form F(x, dγ/dx, d²γ/dx²); this is not the case for every differential equation of the second order. (3) For an ordinary differential equation of the third order, allowing an integrable group of three parameters whose infinitesimal transformations are not linearly connected with the partial equation to which the solution of the given ordinary equation is reducible, the similar result follows that it can be integrated by quadratures. But if the group of three parameters be simple, this result must be replaced by the statement that the integration is reducible to quadratures and that of a so-called Riccati equation of the first order, of the form dy/dx = A + By + Cy², where A, B, C are functions of x. (4) Similarly for the integration by quadratures of an ordinary equation y_n = ψ(x, y, y₁, ... y_n−1) of any order. Moreover, the group allowed by the equation may quite well consist of extended contact transformations. An important application is to the case where the differential equation is the resolvent equation defining the group of transformations or rationality group of another differential equation (see below); in particular, when the rationality group of an ordinary linear differential equation is integrable, the equation can be solved by quadratures.

Following the practical and provisional division of theories of differential equations, to which we alluded at starting, into transformation theories and function theories, we pass now to give some account of the latter. These are both a necessary logical complement of the former, and the Consideration of function theories
of differential equations.only remaining resource when the expedients of the former have been exhausted. While in the former investigations we have dealt only with values of the independent variables about which the functions are developable, the leading idea now becomes, as was long ago remarked by G. Green, the consideration of the neighbourhood of the values of the variables for which this developable character ceases. Beginning, as before, with existence theorems applicable for ordinary values of the variables, we are to consider the cases of failure of such theorems.

When in a given set of differential equations the number of equations is greater than the number of dependent variables, the equations cannot be expected to have common solutions unless certain conditions of compatibility, obtainable by equating different forms of the same differential coefficients deducible from the equations, are satisfied. We have had examples in systems of linear equations, and in the case of a set of equations p₁ = φ₁, . . . , p_r = φ_r . For the case when the number of equations is the same as that of dependent variables, the following is a general theorem which should be referred to: Let there be r equations in r dependent variables z₁, . . . z_r and n independent A general existence theorem. variables x₁, . . . x_n; let the differential coefficient of z_σ of highest order which enters be of order h_σ, and suppose d ^hσz_σ / dx₁^hσ to enter, so that the equations can be written d ^hσz_σ / dx₁^hσ = Φ_σ, where in the general differential coefficient of z_ρ which enters in Φ_σ, say

d ^{k₁ + . . . + k_n} z_ρ / dx₁^k₁ . . . dx_n^k_n,

we have k₁ < h_ρ and k₁ + . . . + k_n ≤ h_ρ. Let a₁, . . . a_n, b₁, . . . b_r, and b^ρ_{k₁ . . . k_n} be a set of values of

x₁, . . . x_n, z₁, . . . z_r

and of the differential coefficients entering in Φ_σ about which all the functions Φ₁, . . . Φ_r, are developable. Corresponding to each dependent variable z_σ, we take now a set of h_σ functions of x₂, . . . x_n, say φ_σ, φ_σ;⁽¹⁾, . . . ,φ_σ^h−1 arbitrary save that they must be developable about a₂, a₃, . . . a_n, and such that for these values of x₂, . . . x_n, the function φ_ρ reduces to b_ρ, and the differential coefficient

d ^{k₂ + . . . + k_n} φ_ρ^k₁ / dx₂^k₂ . . . dx_n^k_n

reduces to b^ρ_{k₁ . . . k_n}. Then the theorem is that there exists one, and only one, set of functions z₁, . . . z_r, of x₂, . . . x_n developable about a₁, . . . a_n satisfying the given differential equations, and such that for x₁ = a₁ we have

z_σ＝φ_σ, dz_σ / dx₁＝φ_σ⁽¹⁾, . . . d ^hσ−1z_σ / d ^hσ−1x₁＝φ_σ^hσ−1.

And, moreover, if the arbitrary functions φ_σ, φ_σ⁽¹⁾ . . . contain a certain number of arbitrary variables t₁, . . . t_m, and be developable about the values tº₁, . . . tº_m of these variables, the solutions z₁, . . . z_r will contain t₁, . . . t_m, and be developable about tº₁, . . . tº_m.

The proof of this theorem may be given by showing that if ordinary power series in x₁ − a₁, . . . x_n − a_n, t₁ − tº₁, . . . t_m − tº_m be substituted in the equations wherein in z_σ the coefficients of (x₁ − a₁)º, x₁ − a₁, . . ., (x₁ − a₁)^hσ−1 are the arbitrary functions φ_σ, φ_σ⁽¹⁾, . . . φ_σ^h−1, divided respectively by 1, 1!, 2!, &c., then the differential equations determine uniquely all the other coefficients, and that the resulting series are convergent. We rely, in fact, upon the theory of monogenic analytical functions (see Function), a function being determined entirely by its development in the neighbourhood of one set of values of the independent variables, from which all its other values arise by continuation; it being of course understood that the coefficients in the differential equations are to be continued at the same time. But it is to be remarked that there is no ground for believing, if this method of continuation be utilized, that the function is single-valued; we may quite well return to the same values of the independent variables with a different Singular points
of solutions. value of the function; belonging, as we say, to a different branch of the function; and there is even no reason for assuming that the number of branches is finite, or that different branches have the same singular points and regions of existence. Moreover, and this is the most difficult consideration of all, all these circumstances may be dependent upon the values supposed given to the arbitrary constants of the integral; in other words, the singular points may be either fixed, being determined by the differential equations themselves, or they may be movable with the variation of the arbitrary constants of integration. Such difficulties arise even in establishing the reversion of an elliptic integral, in solving the equation

(dx/ds)2＝(x − a₁)(x − a₂)(x − a₃)(x − a₄);

about an ordinary value the right side is developable; if we put x − a₁ = t₁², the right side becomes developable about t₁ = 0; if we put x = 1/t, the right side of the changed equation is developable about t = 0; it is quite easy to show that the integral reducing to a definite value x₀ for a value s₀ is obtainable by a series in integral powers; this, however, must be supplemented by showing that for no value of s does the value of x become entirely undetermined.

These remarks will show the place of the theory now to be sketched of a particular class of ordinary linear homogeneous differential equations whose importance arises from the completeness and generality with which they can be discussed. We have seen that if in the equationsLinear differential equations with rational coefficients.

dy/dx＝y₁, dy₁/dx＝y₂, . . ., dy_n−2/dx＝y_n−1,
dy_n−1/dx＝a_ny + a_n−1y₁ + . . . + a₁y_n−1,

where a₁, a₂, . . ., a_n are now to be taken to be rational functions of x, the value x = xº be one for which no one of these rational functions is infinite, and yº, yº₁, . . ., yº_n−1 be quite arbitrary finite values, then the equations are satisfied by

y＝yºu + yº₁u₁ + . . . + yº_n−1u_n−1,

where u, u₁, . . ., u_n−1 are functions of x, independent of yº, . . . yº_n−1, developable about x = xº; this value of y is such that for x = xº the functions y, y₁ . . . y_n−1 reduce respectively to yº, yº₁, . . . yº_n−1; it can be proved that the region of existence of these series extends within a circle centre xº and radius equal to the distance from xº of the nearest point at which one of a₁, . . . a_n becomes infinite. Now consider a region enclosing xº and only one of the places, say Σ, at which one of a₁, . . . a_n becomes infinite. When x is made to describe a closed curve in this region, including this point Σ in its interior, it may well happen that the continuations of the functions u, u₁, . . ., u_n−1 give, when we have returned to the point x, values v, v₁, . . ., v_n−1, so that the integral under consideration becomes changed to yº + yº₁v₁ + . . . + yº_n−1v_n−1. At xº let this branch and the corresponding values of y₁, . . . y_n−1 be ηº, ηº₁, . . . ηº_n−1; then, as there is only one series satisfying the equation and reducing to (ηº, ηº₁, . . . ηº_n−1) for x = xº and the coefficients in the differential equation are single-valued functions, we must have ηºu + ηº₁u₁ + . . . + ηº_n−1u_n−1 = yºv + yº₁v₁ + . . . + yº_n−1v_n−1; as this holds for arbitrary values of yº . . . yº_n−1, upon which u, . . . u_n−1 and v, . . . v_n−1 do not depend, it follows that each of v, . . . v_n−1 is a linear function of u, . . . u_n−1 with constant coefficients, say v_i = A_i1u + . . . + A_inu_n−1. Then

yºv + . . . + yº_n−1v_n−1＝(Σ_i A_i1 yº_i)u + . . . + (Σ_i A_in yº_i) u_n−1;

this is equal to μ(yºu + . . . + yº_n−1u_n−1) if Σ_i A_ir yº_i = μyº_r−1; eliminating yº . . . yº_n−1 from these linear equations, we have a determinantal equation of order n for μ; let μ₁ be one of its roots; determining the ratios of yº, y₁º, . . . yº_n−1 to satisfy the linear equations, we have thus proved that there exists an integral, H, of the equation, which when continued round the point Σ and back to the starting-point, becomes changed to H₁ = μ₁H. Let now ξ be the value of x at Σ and r₁ one of the values of (1/2πi) log μ₁; consider the function (x − ξ)^−r₁H; when x makes a circuit round x = ξ, this becomes changed to

exp (−2πir₁) (x − ξ)^−r1 μH,

that is, is unchanged; thus we may put H = (x − ξ)^r₁φ₁, φ₁ being a function single-valued for paths in the region considered described about Σ, and therefore, by Laurent’s Theorem (see Function), capable of expression in the annular region about this point by a series of positive and negative integral powers of x − ξ, which in general may contain an infinite number of negative powers; there is, however, no reason to suppose r₁ to be an integer, or even real. Thus, if all the roots of the determinantal equation in μ are different, we obtain n integrals of the forms (x − ξ)^r₁φ₁, . . ., (x − ξ)^r_nφ_n. In general we obtain as many integrals of this form as there are really different roots; and the problem arises to discover, in case a root be k times repeated, k − 1 equations of as simple a form as possible to replace the k − 1 equations of the form yº + . . . + yº_n−1v_n−1 = μ(yº + . . . + yº_n−1u_n−1) which would have existed had the roots been different. The most natural method of obtaining a suggestion lies probably in remarking that if r₂ = r₁ + h, there is an integral [(x − ξ)^{r1 + h}φ₂ − (x − ξ)^r₁φ₁] / h, where the coefficients in φ₂ are the same functions of r₁ + h as are the coefficients in φ₁ of r₁; when h vanishes, this integral takes the form

(x − ξ)^r1 [dφ₁/dr₁ + φ₁ log (x − ξ)],

or say

(x − ξ)^r₁ [φ₁ + ψ₁ log (x − ξ)];

denoting this by 2πiμ₁K, and (x − ξ)^r₁ φ₁ by H, a circuit of the point ξ changes K into

$\mathrm {K} {=}{\frac {1}{2\pi i\mu _{1}}}\left[e^{2\pi ir_{1}}(x-\xi )^{r_{1}}\psi _{1}+e^{2\pi ir_{1}}(x-\xi )^{r_{1}}\phi _{1}(2\pi i+\log(x-\xi ))\right]{=}\mu _{1}\mathrm {K} +\mathrm {H} .$

A similar artifice suggests itself when three of the roots of the determinantal equation are the same, and so on. We are thus led to the result, which is justified by an examination of the algebraic conditions, that whatever may be the circumstances as to the roots of the determinantal equation, n integrals exist, breaking up into batches, the values of the constituents H₁, H₂, ... of a batch after circuit about x = ξ being H₁′ = μ₁H₁, H₂′ = μ₁H₂ + H₁, H₃′ = μ₁H₃ + H₂, and so on. And this is found to lead to the forms (x − ξ)^r₁φ₁, (x − ξ)^r₁ [ψ₁ + φ₁ log (x − ξ)], (x − ξ)^r₁ [χ₁ + χ₂ log (x − ξ) + φ₁(log(x − ξ) )²], and so on. Here each of φ₁, ψ₁, χ₁, χ₂, ... is a series of positive and negative integral powers of x − ξ in which the number of negative powers may be infinite.

It appears natural enough now to inquire whether, under proper conditions for the forms of the rational functions a₁, ... a_n, it may be possible to ensure that in each of the series φ₁, ψ₁, [χ]₁, ... the number of negative powers shall be finite. Herein Regular equations. lies, in fact, the limitation which experience has shown to be justified by the completeness of the results obtained. Assuming n integrals in which in each of φ₁, ψ₁, χ₁ ... the number of negative powers is finite, there is a definite homogeneous linear differential equation having these integrals; this is found by forming it to have the form

y′ ⁿ = (x − ξ)⁻¹ b₁y′ ⁽ⁿ⁻¹⁾ + (x − ξ)⁻² b₂y′ ⁽ⁿ⁻²⁾ + ... + (x − ξ)⁻ⁿ b_ny,

where b₁, ... b_n are finite for x = ξ. Conversely, assume the equation to have this form. Then on substituting a series of the form (x − ξ)^r [1 + A₁(x − ξ) + A₂(x − ξ)² + ... ] and equating the coefficients of like powers of x − ξ, it is found that r must be a root of an algebraic equation of order n; this equation, which we shall call the index equation, can be obtained at once by substituting for y only (x − ξ)^r and replacing each of b₁, ... b_n by their values at x = ξ; arrange the roots r₁, r₂, ... of this equation so that the real part of r_i is equal to, or greater than, the real part of r_i+1, and take r equal to r₁; it is found that the coefficients A₁, A₂ ... are uniquely determinate, and that the series converges within a circle about x = ξ which includes no other of the points at which the rational functions a₁ ... a_n become infinite. We have thus a solution H₁ = (x − ξ)^r1φ₁ of the differential equation. If we now substitute in the equation y = H₁∫ηdx, it is found to reduce to an equation of order n − 1 for η of the form

η′ ⁽ⁿ⁻¹⁾ = (x − ξ)⁻¹ c₁η′ ⁽ⁿ⁻²⁾ + ... + (x − ξ)⁽ⁿ⁻¹⁾ c_n−1η,

where c₁, ... c_n−1 are not infinite at x = ξ. To this equation precisely similar reasoning can then be applied; its index equation has in fact the roots r₂ − r₁ − 1, ..., r_n − r₁ − 1; if r₂ − r₁ be zero, the integral (x − ξ)⁻¹ψ₁ of the η equation will give an integral of the original equation containing log (x − ξ); if r₂ − r₁ be an integer, and therefore a negative integer, the same will be true, unless in ψ₁ the term in (x − ξ)^{r₁ − r₂} be absent; if neither of these arise, the original equation will have an integral (x − ξ)^r₂φ₂. The η equation can now, by means of the one integral of it belonging to the index r₂ − r₁ − 1, be similarly reduced to one of order n − 2, and so on. The result will be that stated above. We shall say that an equation of the form in question is regular about x = ξ.

We may examine in this way the behaviour of the integrals at all the points at which any one of the rational functions a₁ ... a_n becomes infinite; in general we must expect that beside these the value x = ∞ will be a singular point for the Fuchsian equations. solutions of the differential equation. To test this we put x = 1/t throughout, and examine as before at t = 0. For instance, the ordinary linear equation with constant coefficients has no singular point for finite values of x; at x = ∞ it has a singular point and is not regular; or again, Bessel’s equation x²y″ + xy′ + (x² − n²)y = 0 is regular about x = 0, but not about x = ∞. An equation regular at all the finite singularities and also at x = ∞ is called a Fuchsian equation. We proceed to examine particularly the case of an equation of the second order

y″ + ay′ + by = 0.

Putting x = 1/t, it becomes

d²y/dt² + (2t⁻¹ − at⁻²) dy/dt + bt⁻⁴ y = 0,

which is not regular about t = 0 unless 2 − at⁻¹ and bt⁻², that is, unless ax and bx² are finite at x = ∞; which we thus assume; putting y = t^r(1 + A₁t + ... ), we find for the index equation at x = ∞ the equation r(r − 1) + r(2 − ax)₀ + (bx²)₀ = 0. If there be Equation of the second order. finite singular points at ξ₁, ... ξ_m, where we assume m > 1, the cases m = 0, m = 1 being easily dealt with, and if φ(x) = (x − ξ₁) ... (x − ξ_m), we must have a·φ(x) and b·[φ(x)]² finite for all finite values of x, equal say to the respective polynomials ψ(x) and θ(x), of which by the conditions at x = ∞ the highest respective orders possible are m − 1 and 2(m − 1). The index equation at x = ξ₁ is r(r − 1) + rψ(ξ₁) / φ′ (ξ₁) + θ(ξ)₁ / [φ′(ξ₁)]² = 0, and if α₁, β₁ be its roots, we have α₁ + β₁ = 1 − ψ(ξ₁) / φ′ (ξ₁) and α₁β₁ = θ(ξ)₁ / [φ′(ξ₁)]². Thus by an elementary theorem of algebra, the sum Σ(1 − α_i − β_i) / (x − ξ_i), extended to the m finite singular points, is equal to ψ(x) / φ(x), and the sum Σ(1 − α_i − β_i) is equal to the ratio of the coefficients of the highest powers of x in ψ(x) and φ(x), and therefore equal to 1 + α + β, where α, β are the indices at x = ∞. Further, if (x, 1)_m−2 denote the integral part of the quotient θ(x) / φ(x), we have Σ α_iβ_iφ′ (ξ_i) / (x = ξ_i) equal to −(x, 1)_m−2 + θ(x)/φ(x), and the coefficient of x^m−2 in (x, 1)_m−2 is αβ. Thus the differential equation has the form

y″ + y′Σ (1 − α_i − β_i) / (x − ξ_i) + y[(x, 1)_m-2 + Σ α_iβ_iφ′(ξ_i) / (x − ξ_i)]/φ(x) = 0.

If, however, we make a change in the dependent variable, putting y = (x − ξ₁)^α1 ... (x − ξ_m)^{α mη}, it is easy to see that the equation changes into one having the same singular points about each of which it is regular, and that the indices at x = ξ_i become 0 and β_i − α_i, which we shall denote by λ_i, for (x − ξ_i)^αj can be developed in positive integral powers of x − ξ_i about x = ξ_i; by this transformation the indices at x = ∞ are changed to

α + α₁ + ... + α_m, β + β₁ + ... + β_m

which we shall denote by λ, μ. If we suppose this change to have been introduced, and still denote the independent variable by y, the equation has the form

y″ + y′Σ (1 − λ_i) / (x − ξ_i) + y(x, 1)_m−2 / φ(x) = 0,

while λ + μ + λ₁ + ... + λ_m = m − 1. Conversely, it is easy to verify that if λμ be the coefficient of x^m−2 in (x, 1)_m−2, this equation has the specified singular points and indices whatever be the other coefficients in (x, 1)_m−2.

Thus we see that (beside the cases m = 0, m = 1) the “Fuchsian equation” of the second order with two finite singular points is distinguished by the fact that it has a definite form when the singular points and the indices are assigned. Hypergeometric equation. In that case, putting (x − ξ₁) / (x − ξ₂) = t / (t − 1), the singular points are transformed to 0, 1, ∞, and, as is clear, without change of indices. Still denoting the independent variable by x, the equation then has the form

x(1 − x)y″ + y′[1 − λ₁ − x(1 + λ + μ)] − λμy = 0,

which is the ordinary hypergeometric equation. Provided none of λ₁, λ₂, λ − μ be zero or integral about x = 0, it has the solutions

F(λ, μ, 1 − λ₁, x), x^λ1 F(λ + λ₁, μ + λ₁, 1 + λ₁, x);

about x = 1 it has the solutions

F(λ, μ, 1 − λ₂, 1 − x), (1 − x)^λ2 F(λ + λ₂, μ + λ₂, 1 + λ₂, 1 − x),

where λ + μ + λ₁ + λ₂ = 1; about x = ∞ it has the solutions

x^−λ F(λ, λ + λ₁, λ − μ + 1, x⁻¹), x^−μ F(μ, μ + λ₁, μ − λ + 1, x⁻¹),

where F(α, β, γ, x) is the series

$1+{\frac {\alpha \beta x}{\gamma }}+{\frac {\alpha (\alpha +1)\beta (\beta +1)x^{2}}{1\cdot 2\cdot \gamma (\gamma +1)}}\ldots ,$

which converges when |x| < 1, whatever α, β, γ may be, converges for all values of x for which |x| = 1 provided the real part of γ − α − β < 0 algebraically, and converges for all these values except x = 1 provided the real part of γ − α − β > −1 algebraically.

In accordance with our general theory, logarithms are to be expected in the solution when one of λ₁, λ₂, λ − μ is zero or integral. Indeed when λ₁ is a negative integer, not zero, the second solution about x = 0 would contain vanishing factors in the denominators of its coefficients; in case λ or μ be one of the positive integers 1, 2, ... (−λ₁), vanishing factors occur also in the numerators; and then, in fact, the second solution about x = 0 becomes x^λ1 times an integral polynomial of degree (−λ₁) − λ or of degree (−λ₁) − μ. But when λ₁ is a negative integer including zero, and neither λ nor μ is one of the positive integers 1, 2 ... (−λ₁), the second solution about x = 0 involves a term having the factor log x. When λ₁ is a positive integer, not zero, the second solution about x = 0 persists as a solution, in accordance with the order of arrangement of the roots of the index equation in our theory; the first solution is then replaced by an integral polynomial of degree -λ or −μ₁, when λ or μ is one of the negative integers 0, −1, −2, ..., 1 − λ₁, but otherwise contains a logarithm. Similarly for the solutions about x = 1 or x = ∞; it will be seen below how the results are deducible from those for x = 0.

Denote now the solutions about x = 0 by u₁, u₂; those about x = 1 by v₁, v₂; and those about x = ∞ by w₁, w₂; in the region (S₀S₁) common to the circles S₀, S₁ of radius 1 whose centres are the points x = 0, x = 1, all the first four are valid, March of the Integral. and there exist equations u₁ =Av₁ + Bv₂, u₂ = Cv₁ + Dv₂ where A, B, C, D are constants; in the region (S₁S) lying inside the circle S₁ and outside the circle S₀, those that are valid are v₁, v₂, w₁, w₂, and there exist equations v₁ = Pw₁ + Qw₂, v₂ = Rw₁ + Tw₂, where P, Q, R, T are constants; thus considering any integral whose expression within the circle S₀ is au₁ + bu₂, where a, b are constants, the same integral will be represented within the circle S₁ by (aA + bC)v₁ + (aB + bD)v₂, and outside these circles will be represented by

[aA + bC)P + (aB + bD)R]w₁ + [(aA + bC)Q + (aB + bD)T]w₂.

A single-valued branch of such integral can be obtained by making a barrier in the plane joining ∞ to 0 and 1 to ∞; for instance, by excluding the consideration of real negative values of x and of real positive values greater than 1, and defining the phase of x and x−1 for real values between 0 and 1 as respectively 0 and π.

We can form the Fuchsian equation of the second order with three arbitrary singular points ξ₁, ξ₂, ξ₃, and no singular point at x＝∞, and with respective indices α₁, β₁, α₂, β₂, α₃, β₃ such that α₁+β₁+α₂+β₂+α₃+β₃＝1. This equation can then be Transformation of the equation into itself.transformed into the hypergeometric equation in 24 ways; for out of ξ₁, ξ₂, ξ₃ we can in six ways choose two, say ξ₁, ξ₂, which are to be transformed respectively into 0 and 1, by (x−ξ₁)/(x−ξ₂)＝t(t−1); and then there are four possible transformations of the dependent variable which will reduce one of the indices at t＝0 to zero and one of the indices at t＝1 also to zero, namely, we may reduce either α₁ or β₁ at t＝0, and simultaneously either α₂ or β₂ at t＝1. Thus the hypergeometric equation itself can be transformed into itself in 24 ways, and from the expression F(λ, μ, 1−λ₁, x) which satisfies it follow 23 other forms of solution; they involve four series in each of the arguments, x, x−1, 1/x, 1/(1−x), (x−1)/x, x/(x−1). Five of the 23 solutions agree with the fundamental solutions already described about x＝0, x＝1, x＝∞; and from the principles by which these were obtained it is immediately clear that the 24 forms are, in value, equal in fours.

The quarter periods K, K′ of Jacobi’s theory of elliptic functions, of which K＝∫π/20 (1−h sin ²θ)^− 1/2dθ, and K′ is the same function of 1–h, can easily be proved to be the solutions of a hypergeometric equation of which h is the independent variable. When K, K′ are Inversion. Modular functions.regarded as defined in terms of h by the differential equation, the ratio K′/K is an infinitely many valued function of h. But it is remarkable that Jacobi’s own theory of theta functions leads to an expression for h in terms of K′/K (see Function) in terms of single-valued functions. We may then attempt to investigate, in general, in what cases the independent variable x of a hypergeometric equation is a single-valued function of the ratio ς of two independent integrals of the equation. The same inquiry is suggested by the problem of ascertaining in what cases the hypergeometric series F(α, β, γ, x) is the expansion of an algebraic (irrational) function of x. In order to explain the meaning of the question, suppose that the plane of x is divided along the real axis from −∞ to 0 and from 1 to +∞, and, supposing logarithms not to enter about x＝0, choose two quite definite integrals y₁, y₂ of the equation, say

y₁＝F(λ, μ, 1−λ₁, x), y₂＝x^λ₁ F(λ+λ₁, μ+λ₁, 1+λ₁, x),

with the condition that the phase of x is zero when x is real and between 0 and 1. Then the value of ς＝y₂/y₁ is definite for all values of x in the divided plane, ς being a single-valued monogenic branch of an analytical function existing and without singularities all over this region. If, now, the values of ς that so arise be plotted on to another plane, a value p+iq of ς being represented by a point (p, q) of this ς-plane, and the value of x from which it arose being mentally associated with this point of the σ-plane, these points will fill a connected region therein, with a continuous boundary formed of four portions corresponding to the two sides of the two barriers of the x-plane. The question is then, firstly, whether the same value of ς can arise for two different values of x, that is, whether the same point (p, q) of the ς-plane can arise twice, or in other words, whether the region of the ς-plane overlaps itself or not. Supposing this is not so, a second part of the question presents itself. If in the x-plane the barrier joining −∞ to 0 be momentarily removed, and x describe a small circle with centre at x＝0 starting from a point x＝−h−ik, where h, k are small, real, and positive and coming back to this point, the original value ς at this point will be changed to a value σ, which in the original case did not arise for this value of x, and possibly not at all. If, now, after restoring the barrier the values arising by continuation from σ be similarly plotted on the ς-plane, we shall again obtain a region which, while not overlapping itself, may quite possibly overlap the former region. In that case two values of x would arise for the same value or values of the quotient y₂/y₁, arising from two different branches of this quotient. We shall understand then, by the condition that x is to be a single-valued function of x, that the region in the ς-plane corresponding to any branch is not to overlap itself, and that no two of the regions corresponding to the different branches are to overlap. Now in describing the circle about x＝0 from x＝−h−ik to −h+ik, where h is small and k evanescent,

ς＝x^λ₁ F(λ＋λ₁, μ＋λ₁, 1+λ₁, x)/F(λ, μ, 1−λ₁, x)

is changed to σ＝ςe^2πiλ₁. Thus the two portions of boundary of the ς-region corresponding to the two sides of the barrier (−∞, 0) meet (at ς＝0 if the real part of λ₁ be positive) at an angle 2πL₁, where L₁ is the absolute value of the real part of λ₁; the same is true for the σ-region representing the branch σ. The condition that the ς-region shall not overlap itself requires, then, L₁＝1. But, further, we may form an infinite number of branches σ＝ςe^2πiλ₁, σ₁＝e^2πiλ₁, . . . in the same way, and the corresponding regions in the plane upon which y₂/y₁ is represented will have a common point and each have an angle 2πL₁; if neither overlaps the preceding, it will happen, if L₁ is not zero, that at length one is reached overlapping the first, unless for some positive integer α we have 2παL₁＝2π, in other words L₁＝1/α. If this be so, the branch σ_α−1＝ςe^2πiαλ₁ will be represented by a region having the angle at the common point common with the region for the branch ς; but not altogether coinciding with this last region unless λ₁ be real, and therefore＝±1/α; then there is only a finite number, α, of branches obtainable in this way by crossing the barrier (−∞, 0). In precisely the same way, if we had begun by taking the quotient

ς′＝(x−1)^λ₂ F(λ＋λ₂, μ＋λ₂, 1+λ₂, 1−x) /F(λ, μ, 1−λ₂, 1−x)

of the two solutions about x＝1, we should have found that x is not a single-valued function of ς′ unless λ₂ is the inverse of an integer, or is zero; as ς′ is of the form (Aς+B)/(Cς+D), A, B, C, D constants, the same is true in our case; equally, by considering the integrals about x＝∞ we find, as a third condition necessary in order that x may be a single-valued function of ς, that λ−μ must be the inverse of an integer or be zero. These three differences of the indices, namely, λ₁, λ₂, λ−μ, are the quantities which enter in the differential equation satisfied by x as a function of ς, which is easily found to be

−	x₁₁₁	+	3x²₁₁	＝1/2(h−h₁−h₂)x⁻¹(x−1)⁻¹+1/2h₁x⁻²+1/2h₂(x−1)⁻²,
	x₁³		2x₁⁴

where x₁＝dx/dς, &c.; and h₁＝1−y₁², h₂＝1−λ₂², h₃＝1−(λ−μ)². Into the converse question whether the three conditions are sufficient to ensure (1) that the ς region corresponding to any branch does not overlap itself, (2) that no two such regions overlap, we have no space to enter. The second question clearly requires the inquiry whether the group (that is, the monodromy group) of the differential equation is properly discontinuous. (See Groups, Theory of.)

The foregoing account will give an idea of the nature of the function theories of differential equations; it appears essential not to exclude some explanation of a theory intimately related both to such theories and to transformation theories, which is a generalization of Galois’s theory of algebraic equations. We deal only with the application to homogeneous linear differential equations.

In general a function of variables x₁, x₂ . . . is said to be rational when it can be formed from them and the integers 1, 2, 3, . . . by a finite number of additions, subtractions, multiplications and divisions. We generalize this definition. Assume that we have assigned a fundamental series of quantities and functions Rationality group of a linear equation.of x, in which x itself is included, such that all quantities formed by a finite number of additions, subtractions, multiplications, divisions and differentiations in regard to x, of the terms of this series, are themselves members of this series. Then the quantities of this series, and only these, are called rational. By a rational function of quantities p, q, r, . . . is meant a function formed from them and any of the fundamental rational quantities by a finite number of the five fundamental operations. Thus it is a function which would be called, simply, rational if the fundamental series were widened by the addition to it of the quantities p, q, r, . . . and those derivable from them by the five fundamental operations. A rational ordinary differential equation, with x as independent and y as dependent variable, is then one which equates to zero a rational function of y, the order k of the differential equation being that of the highest differential coefficient y^(k) which enters; only such equations are here discussed. Such an equation P＝0 is called irreducible when, firstly, being arranged as an integral polynomial in y^(k), this polynomial Irreducibility of a rational equation.is not the product of other polynomials in y^(k) also of rational form; and, secondly, the equation has no solution satisfying also a rational equation of lower order. From this it follows that if an irreducible equation P＝0 have one solution satisfying another rational equation Q＝0 of the same or higher order, then all the solutions of P＝0 also satisfy Q＝0. For from the equation P＝0 we can by differentiation express y^(k+1), y^(k+2), . . . in terms of x, y, y⁽¹⁾, . . . , y^(k), and so put the function Q rationally in terms of these quantities only. It is sufficient, then, to prove the result when the equation Q＝0 is of the same order as P＝0. Let both the equations be arranged as integral polynomials in y^(k); their algebraic eliminant in regard to y^(k) must then vanish identically, for they are known to have one common solution not satisfying an equation of lower order; thus the equation P＝0 involves Q＝0 for all solutions of P＝0.

Now let y⁽ⁿ⁾＝a₁y⁽ⁿ⁻¹⁾+ . . . +a_ny be a given rational homogeneous linear differential equation; let y₁, . . . y_n be n particular functions of x, unconnected by any equation with constant coefficients of the form c₁y₁+ . . . +c_ny_n＝0, all satisfying the differential equation; let η₁, . . . η_n be linear functions The variant function for a linear equation.of y₁, . . . y_n, say η_i＝A_i1y₁+ . . . +A_iny_n, where the constant coefficients A_ij have a non-vanishing determinant; write (η)＝A(y), these being the equations of a general linear homogeneous group whose transformations may be denoted by A, B, . . . . We desire to form a rational function φ(η), or say φ(A(y)), of η₁, . . . η, in which the η² constants A_ij shall all be essential, and not reduce effectively to a fewer number, as they would, for instance, if the y₁, . . . y_n were connected by a linear equation with constant coefficients. Such a function is in fact given, if the solutions y₁, . . . y_n be developable in positive integral powers about x = a, by φ(η) = η₁ + (x − a)ⁿ η₂ + . . . + (x − a)⁽ⁿ⁻¹⁾ⁿ η_n. Such a function, V, we call a variant.

Then differentiating V in regard to x, and replacing η_i⁽ⁿ⁾ by its value a₁η⁽ⁿ⁻¹⁾ + . . . + a_nη, we can arrange dV/dx, and similarly each of d ²/dx² . . . d ^NV/dx^N, where N = n², as a linear function of the N quantities η₁, . . . η_n, . . . η₁⁽ⁿ⁻¹⁾, . . . η_n⁽ⁿ⁻¹⁾, and thence by elimination obtain a linear differential equation The resolvent equation.for V of order N with rational coefficients. This we denote by F = 0. Further, each of η₁ . . . η_n is expressible as a linear function of V, dV/dx, . . . d ^N−1V / dx^N−1, with rational coefficients not involving any of the n² coefficients A_ij, since otherwise V would satisfy a linear equation of order less than N, which is impossible, as it involves (linearly) the n² arbitrary coefficients A_ij, which would not enter into the coefficients of the supposed equation. In particular, y₁,.. y_n are expressible rationally as linear functions of ω, dω/dx, . . . d ^N−1ω / dx^N−1, where ω is the particular function φ(y). Any solution W of the equation F = 0 is derivable from functions ζ₁, . . . ζ_n, which are linear functions of y₁, . . . y_n, just as V was derived from η₁, . . . η_n; but it does not follow that these functions ζ_i, . . . ζ_n are obtained from y₁, . . . y_n by a transformation of the linear group A, B, . . . ; for it may happen that the determinant d(ζ₁, . . . ζ_n) / (dy₁, . . . y_n) is zero. In that case ζ₁, . . . ζ_n may be called a singular set, and W a singular solution; it satisfies an equation of lower than the N-th order. But every solution V, W, ordinary or singular, of the equation F = 0, is expressible rationally in terms of ω, dω / dx, . . . d ^N−1ω / dx^N−1; we shall write, simply, V = r(ω). Consider now the rational irreducible equation of lowest order, not necessarily a linear equation, which is satisfied by ω; as y₁, . . . y_n are particular functions, it may quite well be of order less than N; we call it the resolvent equation, suppose it of order p, and denote it by γ(v). Upon it the whole theory turns. In the first place, as γ(v) = 0 is satisfied by the solution ω of F = 0, all the solutions of γ(v) are solutions F = 0, and are therefore rationally expressible by ω; any one may then be denoted by r(ω). If this solution of F = 0 be not singular, it corresponds to a transformation A of the linear group (A, B, ...), effected upon y₁, . . . y_n. The coefficients A_ij of this transformation follow from the expressions before mentioned for η₁ . . . η_n in terms of V, dV/dx, d ²V/dx², . . . by substituting V = r(ω); thus they depend on the p arbitrary parameters which enter into the general expression for the integral of the equation γ(v) = 0. Without going into further details, it is then clear enough that the resolvent equation, being irreducible and such that any solution is expressible rationally, with p parameters, in terms of the solution ω, enables us to define a linear homogeneous group of transformations of y₁ . . . y_n depending on p parameters; and every operation of this (continuous) group corresponds to a rational transformation of the solution of the resolvent equation. This is the group called the rationality group, or the group of transformations of the original homogeneous linear differential equation.

The group must not be confounded with a subgroup of itself, the monodromy group of the equation, often called simply the group of the equation, which is a set of transformations, not depending on arbitrary variable parameters, arising for one particular fundamental set of solutions of the linear equation (see Groups, Theory of).

The importance of the rationality group consists in three propositions. (1) Any rational function of y₁, . . . y_n which is unaltered in value by the transformations of the group can be written in rational form. (2) If any rational function be changed in form, becoming a rational function of y₁, . . . y_n, a The fundamental theorem in regard to the rationality group.transformation of the group applied to its new form will leave its value unaltered. (3) Any homogeneous linear transformation leaving unaltered the value of every rational function of y₁, . . . y_n which has a rational value, belongs to the group. It follows from these that any group of linear homogeneous transformations having the properties (1) (2) is identical with the group in question. It is clear that with these properties the group must be of the greatest importance in attempting to discover what functions of x must be regarded as rational in order that the values of y₁ . . . y_n may be expressed. And this is the problem of solving the equation from another point of view.

Literature.—(α) Formal or Transformation Theories for Equations of the First Order:—E. Goursat, Leçons sur l’intégration des équations aux dérivées partielles du premier ordre (Paris, 1891); E. v. Weber, Vorlesungen über das Pfaff’sche Problem und die Theorie der partiellen Differentialgleichungen erster Ordnung (Leipzig, 1900); S. Lie und G. Scheffers, Geometrie der Berührungstransformationen, Bd. i. (Leipzig, 1896); Forsyth, Theory of Differential Equations, Part i., Exact Equations and Pfaff’s Problem (Cambridge, 1890); S. Lie, “Allgemeine Untersuchungen über Differentialgleichungen, die eine continuirliche endliche Gruppe gestatten” (Memoir), Mathem. Annal.xxv. (1885), pp. 71-151; S. Lie und G. Scheffers, Vorlesungen über Differentialgleichungen mit bekannten infinitesimalen Transformationen (Leipzig, 1891). A very full bibliography is given in the book of E. v. Weber referred to; those here named are perhaps sufficiently representative of modern works. Of classical works may be named: Jacobi, Vorlesungen über Dynamik (von A. Clebsch, Berlin, 1866); Werke, Supplementband; G Monge, Application de l’analyse à la géométrie (par M. Liouville, Paris, 1850); J. L. Lagrange, Leçons sur le calcul des fonctions (Paris, 1806), and Théorie des fonctions analytiques (Paris, Prairial, an V); G. Boole, A Treatise on Differential Equations (London, 1859); and Supplementary Volume (London, 1865); Darboux, Leçons sur la théorie générale des surfaces, tt. i.-iv. (Paris, 1887–1896); S. Lie, Théorie der transformationsgruppen ii. (on Contact Transformations) (Leipzig, 1890).

(β) Quantitative or Function Theories for Linear Equations:—C. Jordan, Cours d’analyse, t. iii. (Paris, 1896); E. Picard, Traité d’analyse, tt. ii. and iii. (Paris, 1893, 1896); Fuchs, Various Memoirs, beginning with that in Crelle’s Journal, Bd. lxvi. p. 121; Riemann, Werke, 2^r Aufl. (1892); Schlesinger, Handbuch der Theorie der linearen Differentialgleichungen, Bde. i.-ii. (Leipzig, 1895–1898); Heffter, Einleitung in die Theorie der linearen Differentialgleichungen mit einer unabhängigen Variablen (Leipzig, 1894); Klein, Vorlesungen über lineare Differentialgleichungen der zweiten Ordnung (Autographed, Göttingen, 1894); and Vorlesungen über die hypergeometrische Function (Autographed, Göttingen, 1894); Forsyth, Theory of Differential Equations, Linear Equations.

(γ) Rationality Group (of Linear Differential Equations):—Picard, Traité d’Analyse, as above, t. iii.; Vessiot, Annales de l’École Normale, série III. t. ix. p. 199 (Memoir); S. Lie, Transformationsgruppen, as above, iii. A connected account is given in Schlesinger, as above, Bd. ii., erstes Theil.

(δ) Function Theories of Non-Linear Ordinary Equations:—Painlevé, Leçons sur la théorie analytique des équations différentielles (Paris, 1897, Autographed); Forsyth, Theory of Differential Equations, Part ii., Ordinary Equations not Linear (two volumes, ii. and iii.) (Cambridge, 1900); Königsberger, Lehrbuch der Theorie der Differentialgleichungen (Leipzig, 1889); Painlevé, Leçons sur l’intégration des équations différentielles de la mécanique et applications (Paris, 1895).

(ε) Formal Theories of Partial Equations of the Second and Higher Orders:—E. Goursat, Leçons sur l’intégration des équations aux dérivées partielles du second ordre, tt. i. and ii. (Paris, 1896, 1898); Forsyth, Treatise on Differential Equations (London, 1889); and Phil. Trans. Roy. Soc. (A.), vol. cxci. (1898), pp. 1-86.

(ζ) See also the six extensive articles in the second volume of the German Encyclopaedia of Mathematics. (H. F. Ba.)

1911 Encyclopædia Britannica/Differential Equation

Navigation menu

Search