1911 Encyclopædia Britannica/Algebraic Forms
ALGEBRAIC FORMS. The subject-matter of algebraic forms is to a large extent connected with the linear transformation of algebraical polynomials which involve two or more variables. The theories of determinants and of symmetric functions and of the algebra of differential operations have an important bearing upon this comparatively new branch of mathematics. They are the chief instruments of research, and have themselves much benefited by being so employed. When a homogeneous polynomial is transformed by general linear substitutions as hereafter explained, and is then expressed in the original form with new coefficients affecting the new variables, certain functions of the new coefficients and variables are numerical multiples of the same functions of the original coefficients and variables. The investigation of the properties of these functions, as well for a single form as for a simultaneous set of forms, and as well for one as for many series of variables, is included in the theory of invariants. As far back as 1773 Joseph Louis Lagrange, and later Carl Friedrich Gauss, had met with simple cases of such functions, George Boole, in 1841 (Camb. Math. Journ. iii. pp. 1-20), made important steps, but it was not till 1845 that Arthur Cayley (Coll. Math. Papers, i. pp. 80-94, 95-112) showed by his calculus of hyper-determinants that an infinite series of such functions might be obtained systematically. The subject was carried on over a long series of years by himself, J. J. Sylvester, G. Salmon, L. O. Hesse, S. H. Aronhold, C. Hermite, Francesco Brioschi, R. F. A. Clebsch, P. Gordon, &c. The year 1868 saw a considerable enlargement of the field of operations. This arose from the study by Felix Klein and Sophus Lie of a new theory of groups of substitutions; it was shown that there exists an invariant theory connected with every group of linear substitutions. The invariant theory then existing was classified by them as appertaining to “finite continuous groups.” Other “Galois” groups were defined whose substitution coefficients have fixed numerical values, and are particularly associated with the theory of equations. Arithmetical groups, connected with the theory of quadratic forms and other branches of the theory of numbers, which are termed “discontinuous,” and infinite groups connected with differential forms and equations, came into existence, and also particular linear and higher transformations connected with analysis and geometry. The effect of this was to co-ordinate many branches of mathematics and greatly to increase the number of workers. The subject of transformation in general has been treated by Sophus Lie in the classical work Theorie der Transformationsgruppen. The present article is merely concerned with algebraical linear transformation. Two methods of treatment have been carried on in parallel lines, the unsymbolic and the symbolic; both of these originated with Cayley, but he with Sylvester and the English school have in the main confined themselves to the former, whilst Aronhold, Clebsch, Gordan, and the continental schools have principally restricted themselves to the latter. The two methods have been conducted so as to be in constant touch, though the nature of the results obtained by the one differs much from those which flow naturally from the other. Each has been singularly successful in discovering new lines of advance and in encouraging the other to renewed efforts. P. Gordan first proved that for any system of forms there exists a finite number of covariants, in terms of which all others are expressible as rational and integral functions. This enabled David Hilbert to produce a very simple unsymbolic proof of the same theorem. So the theory of the forms appertaining to a binary form of unrestricted order was first worked out by Cayley and P. A. MacMahon by unsymbolic methods, and later G. E. Stroh, from a knowledge of the results, was able to verify and extend the results by the symbolic method. The partition method of treating symmetrical algebra is one which has been singularly successful in indicating new paths of advance in the theory of invariants; the important theorem of expressibility is, directly we exclude unity from the partitions, a theorem concerning the expressibility of covariants, and involves the theory of the reducible forms and of the syzygies. The theory brought forward has not yet found a place in any systematic treatise in any language, so that it has been judged proper to give a fairly complete account of it.^{[1]}
I. The Theory Of Determinants.^{[1]}
Let there be given n^{2} quantities
a_{11} | a_{12} | a_{13} | ... | a_{1n} |
a_{21} | a_{22} | a_{23} | ... | a_{2n} |
a_{31} | a_{32} | a_{33} | ... | a_{3n} |
. | . | . | ... | . |
a_{n1} | a_{n2} | a_{n3} | ... | a_{nn} |
and form from them a product of n quantities
a_{1.mw-parser-output .grc{font-family:SBL BibLit,SBL Greek,DejaVu Sans,DejaVu Serif,FreeSerif,FreeSans,Athena,Gentium Plus,Gentium,Palatino Linotype,Arial Unicode MS,Lucida Sans Unicode,Lucida Grande,Code2000,sans-serif}.mw-parser-output .polytonic{font-family:"SBL BibLit","SBL Greek",Athena,"Foulis Greek","Gentium Plus",Gentium,"Palatino Linotype","Arial Unicode MS","Lucida Sans Unicode","Lucida Grande",Code2000}α} | a_{2β} | a_{3γ} | ... | a_{nν}, |
where the first suffixes are the natural numbers 1, 2, 3, ...n taken in order, and α, β, γ, ...ν is some permutation of these n numbers. This permutation by a transposition of two numbers, say α, β, becomes β, α, γ, ... ν, and by successively transposing pairs of letters the permutation can be reduced to the form 1, 2, 3, ... n. Let k such transpositions be necessary; then the expression
Σ( — )^{k}a_{1α}a_{2β}a_{3γ}...a_{nν},
the summation being for all permutations of the n numbers, is called the determinant of the n^{2} quantities. The quantities a_{1α}, a_{2β} ... are called the elements of the determinant; the term ( — )^{k}a_{1α}a_{2β}a_{3γ}...a_{nν} is called a member of the determinant, and there are evidently n! members corresponding to the n! permutations of the n numbers 1, 2, 3, ... n. The determinant is usually written
Δ= | a_{11} | a_{12} | a_{13} | ... | a_{1n} |
a_{21} | a_{22} | a_{23} | ... | a_{2n} | |
a_{31} | a_{32} | a_{33} | ... | a_{3n} | |
. | . | . | ... | . | |
a_{n1} | a_{n2} | a_{n3} | ... | a_{nn} |
the square array being termed the matrix of the determinant. A matrix has in many parts of mathematics a signification apart from its evaluation as a determinant. A theory of matrices has been constructed by Cayley in connexion particularly with the theory of linear transformation. The matrix consists of n rows and n columns. Each row as well as each column supplies one and only one element to each member of the determinant. Consideration of the definition of the determinant shows that the value is unaltered when the suffixes in each element are transposed.
Theorem.—If the determinant is transformed so as to read by columns as it formerly did by rows its value is unchanged. The leading member of the determinant is α_{11}α_{22}α_{33}...α_{nn}, and corresponds to the principal diagonal of the matrix.
We write frequently
Δ = Σ ± a_{11}a_{22}a_{33}...a_{nn} = (a_{11}a_{22}a_{33}...a_{nn}).
If the first two columns of the determinant be transposed the expression for the determinant becomes Σ( — )^{k}a_{1}βa_{2a}a_{3γ}...a_{nν}, viz. a and β are transposed, and it is clear that the number of transpositions necessary to convert the permutation βαγ...ν of the second suffixes to the natural order is changed by unity. Hence the transposition of columns merely changes the sign of the determinant. Similarly it is shown that the transposition of any two columns or of any two rows merely changes the sign of the determinant.
Theorem—Interchange of any two rows or of any two columns merely changes the sign of the determinant.
Corollary—If any two rows or any two columns of a determinant be identical the value of the determinant is zero.
Minors of a Determinant—From the value of Δ we may separate those members which contain a particular element a_{ik} as a factor, and write the portion a_{ik} A_{ik}; A_{k}, the cofactor of a_{ik} , is called a minor of order n−1 of the determinant.
Now a_{11}A_{11}=Σ ± = a_{11}a_{22}a_{33}...a_{nn}, wherein a_{11} is not to be changed, but the second suffixes in the product a_{22}a_{33}...a_{nn} assume all permutations, the number of transpositions necessary determining the sign to be affixed to the member.
Hence a_{11}A_{11} = a_{11}a_{22}a_{33}...a_{nn}, where the cofactor of a_{11} is clearly the determinant obtained by erasing the first row and the first column.
Hence A_{11}= | a_{22} | a_{23} | ... | a_{2n} |
a_{32} | a_{33} | ... | a_{3n} | |
. | . | ... | . | |
a_{n2} | a_{n3} | ... | a_{nn} |
Similarly A_{ik} , the cofactor of a_{ik}, is shown to be the product of (—)^{i+k} and the determinant obtained by erasing from Δ the i^{th} row and k^{th} column. No member of a determinant can involve more than one element from the first row. Hence we have the development
Δ = a_{11}A_{11} +a_{12}A_{12} +a_{13}A_{13}+...+a_{1n}A_{1n}, proceeding according to the elements of the first row and the corresponding minors.
Similarly we have a development proceeding according to the elements contained in any row or in any column, viz.
Δ=a_{i1}A_{i1} +a_{i2}A_{i2} +a_{i3}A_{i3}+...+a_{in}A_{in} | (A) |
Δ=a_{1k}A_{1k} +a_{2k}A_{2k} +a_{3k}A_{3k}+...+a_{nk}A_{nk} |
This theory enables the evaluation of a determinant by successive reduction of the orders of the determinants involved.
Ex. gr. | 1 | 0 | 3 | ||||||||||||||||
2 | 1 | 6 |
| ||||||||||||||||
0 | −5 | 3 | |||||||||||||||||
=1 | 3 | −6 | −5 | +3.2 | −5 | −3. 1 | 0 | | |||||||||||||||||||
=3+30−30−0=3. |
Since the determinant
a_{21} | a_{22} | a_{23} | ... | a_{2n} | |
a_{21} | a_{22} | a_{23} | ... | a_{2n} | |
a_{31} | a_{32} | a_{33} | ... | a_{3n} | , having two identical rows, |
. | . | . | ... | . | |
a_{n1} | a_{n2} | a_{n3} | ... | a_{nn} |
vanishes identically; we have by development according to the elements of the first row
a_{21}A_{11}+a_{22}A_{12}+a_{23}A_{13}+...a_{2n}A_{1n}=0;
and, in general, since
a_{i1}A_{i1}+a_{i2}A_{i2}+a_{i3}A_{i3}+...a_{in}A_{in}=Δ,
if we suppose the i^{th} and k^{th} rows identical
a_{k1}A_{i1}+a_{k2}A_{i2}+a_{k3}A_{i3}+...a_{kn}A_{in}=0 (k ≷ i);
and proceeding by columns instead of rows,
a_{1i}A_{1k}+a_{2i}A_{2k}+a_{3i}A_{3k}+...a_{ni}A_{nk}=0 (k ≷ i)
identical relations always satisfied by these minors.
If in the first relation of (A) we write a_{is} = b_{is}+c_{is}+d_{is}+... we find that Σa_{is}A_{is} = Σb_{is}A_{is} + Σc_{is}A_{is} + Σd_{is}A_{is} +... so that Δ breaks up into a sum of determinants, and we also obtain a theorem for the addition of determinants which have n – 1 rows in common. If we multiply the elements of the second row by an arbitrary magnitude λ, and add to the corresponding elements of the first row, Δ becomes Σa_{1s}A_{1s} + λΣa_{2s}A_{1s} = Δ, showing that the value of the determinant is unchanged. In general we can prove in the same way the—
Theorem.—The value of a determinant is unchanged if we add to the elements of any row or column the corresponding elements of the other rows or other columns respectively each multiplied by an arbitrary magnitude, such magnitude remaining constant in respect of the elements in a particular row or a particular column.
Observation.—Every factor common to all the elements of a row or of a column is obviously a factor of the determinant, and may be taken outside the determinant brackets.
Ex. gr. | α^{2} | β^{2} | γ^{2} | α^{2} | β^{2} − α^{2} | γ^{2} − α^{2} | |||||||||||
α | β | γ | = | α | β − α | γ − α | = |
| |||||||||
1 | 1 | 1 | 1 | 0 | 0 |
= (β − α)(γ − α) | β + α | γ + α | = (β − γ)(γ − α) | β − γγ + α |
1 | 1 | 01 |
=(β − α)(γ − α)(β − γ).
The minor A_{ik} is ∂Δ∂a_{ik}, and is itself a determinant of order n−1. We may therefore differentiate again in regard to any element a_{rs} where r≷i, s≷k; we will thus obtain a minor of A_{ik}, which is a minor also of Δ of order n−2. It will be
A_{ik} |
= ∂A_{ik}∂a_{rs} = ∂²Δ∂a_{ik}∂a_{rs} |
^{rs } |
and will be obtained by erasing from the determinant A_{ik} the row and column containing the element a_{rs}; this was originally the r^{th} row and the s^{th} column of Δ; the r^{th} row of Δ is the r^{th} or (r–1)^{th} row of A_{ik} according as r≷i and the s^{th} column of Δ is the s^{th} or (s−1)^{th} column of A_{ik} according as s≷k. Hence, if T_{ri} denote the number of transpositions necessary to bring the succession ri into ascending order of magnitude, the sign to be attached to the determinant arrived at by erasing the i^{th} and r^{th} rows and the k^{th} and s^{th} columns from Δ in order produce
A_{ik } | will be −1 raised to the power of T_{ri}+T_{ks}+i+k+r+s. |
^{rs} |
Similarly proceeding to the minors of order n−3, we find that
A_{ik} |
= ∂∂a_{tu} | A_{ik} |
= ∂²∂a_{rs}∂a_{tu} | A_{ik} |
= ∂²∂a_{ik}∂a_{rs}∂a_{tu}Δ |
^{rs} ^{tu} |
^{rs} |
is obtained from Δ by erasing the i^{th}, r^{th}, t^{th}, rows, the k^{th}, s^{th}, u^{th} columns, and multiplying the resulting determinant by −1 raised to the power T_{tri} +T_{usk} +i+k+r+s+t+u and the general law is clear.
Corresponding Minors.—In obtaining the minor
A_{ik } | in the form of a determinant we erased certain rows and columns, |
^{rs} |
and we would have erased in an exactly similar manner had we been forming the determinant associated with
A_{ik} | , since the deleting lines intersect in two pairs of points. |
^{rk} |
A_{ik}= | A_{is}. |
^{rs} | ^{rk} |
Moreover
a_{ik}a_{rs} | A_{ik}+ | a_{is}a_{rk} | A_{is} = | a_{ik} a_{is} | A_{ik}, |
^{rs} | ^{rk} | a_{ik} a_{rs} | ^{rs} |
A_{ik} | are termed corresponding determinants. |
^{rs} |
Similarly p lines of deletion intersecting in p² points yield corresponding determinants of orders p and n−p respectively. Recalling the formula
Δ=a_{11}A_{11}+a_{12}A_{12}+a_{13}A_{13}+...+a_{1n}A_{1n},
it will be seen that a_{1k} and A_{1k} involve corresponding determinants. Since A_{1k} is a determinant we similarly obtain
A_{1k}=a_{21} | A_{1k}+...+a_{2,k−1} | A_{1,k} | +a_{2,k+1} | +...+a_{2,n} | A_{1,k, } |
^{21} | ^{2,k−1} | ^{2,n} |
and thence
Δ = | Σa_{1i} a_{2k} | A_{1i}where i ≷ k; |
^{i,k} | ^{2k} |
and as before
| a_{1i} a_{2i } | ||
Δ=Σ ^{i,k} |
a_{1k} a_{2k} | A_{1i} ^{2k} |
i>k, |
an important expansion of Δ.
Similarly
| a_{1i}a_{2i}a_{3i} | ||
Δ=Σ ^{i,k,r} |
a_{1k} a_{2k} a_{3k} | A_{1i} ^{2k} |
i>k>r, |
a_{1r} a_{2r}a_{3r} | ^{3r} |
and the general theorem is manifest, and yields a development in a sum of products of corresponding determinants. If the j^{th} column be identical with the i^{th} the determinant Δ vanishes identically; hence if j be not equal to i, k, or r,
| a_{1j}a_{2j}a_{3j} | |
0=Σ | a_{1k} a_{2k} a_{3k} | A_{1i} ^{2k}. |
a_{1r}a_{2r}a_{3r} | ^{3r} |
Similarly, by putting one or more of the deleted rows or columns equal to rows or columns which are not deleted, we obtain, with Laplace, a number of identities between products of determinants of complementary orders.
Multiplication.—From the theorem given above for the expansion of a determinant as a sum of products of pairs of corresponding determinants it will be plain that the product of Δ= (a_{11}, a_{22}, ... a_{nn}) and D = (b_{11}, b_{22}, b_{nn} ) may be written as a determinant of order 2n, viz.
a_{11} | a_{21} | a_{31} | ... | a_{n1} | −1 | 0 | 0 | ... | 0 | |||||||||
a_{12} | a_{22} | a_{32} | ... | a_{n2} | 0 | −1 | 0 | ... | 0 | |||||||||
a_{13} | a_{23} | a_{33} | ... | a_{n3} | 0 | 0 | −1 | ... | 0 | |||||||||
. | . | . | ... | . | . | . | . | ... | . | |||||||||
a_{n1} | a_{n2} | a_{n3} | ... | a_{nn} | 0 | 0 | 0 | ... | −1 |
| ||||||||
0 | 0 | 0 | ... | 0 | b_{11} | b_{12} | b_{13} | ... | b_{1n} | |||||||||
0 | 0 | 0 | ... | 0 | b_{21} | b_{22} | b_{23} | ... | b_{2n} | |||||||||
0 | 0 | 0 | ... | 0 | b_{31} | b_{32} | b_{33} | ... | b_{3n} | |||||||||
. | . | . | ... | . | . | . | . | ... | . | |||||||||
0 | 0 | 0 | ... | 0 | b_{n1} | b_{n2} | b_{n3} | ... | b_{nn} |
Multiply the 1^{st}, 2^{nd} ... n^{th} rows by b_{11}, b_{12}, ... b_{1n} respectively, and add to the (n+1)^{th} row; by b_{21}, b_{22} ... b_{2n}, and add to the (n+2)^{th} row; by b_{31}, b_{32} ... b_{3n} and add to the (n+3)^{rd} row, &c. C then becomes
a_{11}b_{11}+a_{12}b_{12}+...+a_{1n}b_{1n}, a_{21}b_{11}+a_{22}b_{12}+...+a_{2n}b_{1n}, ...a_{n1}b_{11}+a_{n2}b_{12}+...+a_{nn}b_{1n} |
a_{11}b_{21}+a_{12}b_{22}+...+a_{1n}b_{2n}, a_{21}b_{21}+a_{22}b_{22}+...+a_{2n}b_{2n}, ...a_{n1}b_{21}+a_{n2}b_{22}+...+a_{nn}b_{2n} |
a_{11}b_{31}+a_{12}b_{32}+...+a_{1n}b_{3n}, a_{21}b_{31}+a_{22}b_{32}+...+a_{2n}b_{3n}, ...a_{n1}b_{31}+a_{n2}b_{32}+...+a_{nn}b_{2n} |
........ |
a_{11}b_{n1}+a_{12}b_{n2}+...+a_{1n}b_{nn}, a_{21}b_{n1}+a_{22}b_{n2}+...+a_{2n}b_{nn}, ...a_{n1}b_{n1}+a_{n2}b_{n2}+...+a_{nn}b_{nn} |
and all the elements of D become zero. Now by the expansion theorem the determinant becomes
(−)^{1+2+3+..+2n}B.C = (−1)^{n(2n+1)+n}C=C.
We thus obtain for the product a determinant of order n. We may say that, in the resulting determinant, the element in the i^{th} row and k^{th} column is obtained by multiplying the elements in the k^{th} row of the first determinant severally by the elements in the i^{th} row of the second, and has the expression
a_{k1}b_{i1}+a_{k2}b_{i2}+a_{k3}b_{i3}... +a_{kn}b_{in},
and we obtain other expressions by transforming either or both determinants so as to read by columns as they formerly did by rows.
Remark.—In particular the square of a determinant is a determinant of the same order (b_{11}b_{22}b_{33} ...b_{nn}) such that b_{ik} = b_{ki}; it is for this reason termed symmetrical.
The Adjoint or Reciprocal Determinant arises from Δ = (a_{11}a_{22}a_{33} ...a_{nn}) by substituting for each element A_{ik} the corresponding minor A_{ik} so as to form D = (A_{11}A_{22}A_{33} A_{nn}). If we form the product Δ.D by the theorem for the multiplication of determinants we find that the element in the i^{th} row and k^{th} column of the product is
a_{ki}A_{i1}+a_{k2}A_{i2}+...+a_{kn}A_{in},
the value of which is zero when k is different from i, whilst it has the value Δ when k=i. Hence the product determinant has the principal diagonal elements each equal to Δ and the remaining elements zero. Its value is therefore Δ^{n} and we have the identity
D.Δ = Δ^{n} or D=Δ^{n–1}.
It can now be proved that the first minor of the adjoint determinant, say B^{rs} is equal to Δ^{n–2}a_{rs}.
From the equations
a_{11}x_{1}+ a_{12}x_{2}+ a_{13}x_{3} +... = ξ_{1} ,
a_{21}x_{1}+ a_{22}x_{2}+ a_{23}x_{3} +... = ξ_{2} ,
a_{31}x_{1}+ a_{32}x_{2}+ a_{33}x_{3} +... = ξ_{3} ,
we derive.......
Δx_{1} = A_{11}ξ_{1}+A_{21}ξ_{2}+A_{31}ξ_{3}+... ,
Δx_{2} = A_{12}ξ_{1}+A_{22}ξ_{2}+A_{32}ξ_{3}+... ,
Δx_{3} = A_{13}ξ_{1}+A_{23}ξ_{2}+A_{33}ξ_{3}+... ,
and thence.......
Δ^{n–1}ξ_{1}=B_{11}Δx_{1}+B_{12}Δx_{2}+B_{13}Δx_{3}+... ,
Δ^{n–1}ξ_{2}=B_{21}Δx_{1}+B_{22}Δx_{2}+B_{23}Δx_{3}+... ,
Δ^{n–1}ξ_{3}=B_{31}Δx_{1}+B_{32}Δx_{2}+B_{33}Δx_{3}+... ,
.......
and comparison of the first and third systems yields
B_{rs} = Δ^{n–2}a_{rs}.
In general it can be proved that any minor of order p of the adjoint is equal to the complementary of the corresponding minor of the original multiplied by the (p – 1)^{th} power of the original determinant.
Theorem.—The adjoint determinant is the (n – 1)^{th} power of the original determinant. The adjoint determinant will be seen subsequently to present itself in the theory of linear equations and in the theory of linear transformation.
Determinants of Special Forms.—It was observed above that the square of a determinant when expressed as a determinant of the same order is such that its elements have the property expressed by a_{ik} = a_{ki}. Such determinants are called symmetrical. It is easy to see that the adjoint determinant is also symmetrical, viz. such that A_{ik} = A_{ki}, for the determinant got by suppressing the i^{th} row and k^{th} column differs only by an interchange of rows and columns from that got by suppressing the k^{th} row and i^{th} column. If any symmetrical determinant vanish and be bordered as shown below
a_{11} | a_{12} | a_{13} | λ_{1} |
a_{21} | a_{22} | a_{23} | λ_{2} |
a_{31} | a_{32} | a_{33} | λ_{3} |
λ_{1} | λ_{2} | λ_{3} | . |
it is a perfect square when considered as a function of λ_{1}, λ_{2}, λ_{3}. For since A_{11} A_{22} −A_{212} = Δa_{33}, with similar relations, we have a number of relations similar to A_{11}A_{12}=A_{2 12}, and either A_{rs} = +√ (A_{rr}A_{ss}) or − √ (A_{rr}A_{ss}) for all different values of r and s. Now the determinant has the value
–λ_{21}A_{11} + λ_{22}A_{22} + λ_{23}A_{33} + 2λ_{2}λ_{3}A_{23} + 2λ_{3}λ_{1}A_{31} + 2λ_{1}λ_{2}A_{12}}
= –Σλ_{2r}A_{rr} – 2Σλ_{r}λ_{s}A_{rs} in general, and hence by substitution
±λ_{1}√A_{11} + λ_{2}√A_{22} + ... λ_{n} √ A_{nn}}^{2}.
A skew symmetric determinant has a_{rr} = 0 and a_{rs} = – a_{sr} for all values of r and s. Such a determinant when of uneven degree vanishes, for if we multiply each row by –1 we multiply the determinant by (–1)^{n} = –1, and the effect of this is otherwise merely to transpose the determinant so that it reads by rows as it formerly did by columns, an operation which we know leaves the determinant unaltered. Hence Δ = –Δ or Δ = 0. When a skew symmetric determinant is of even degree it is a perfect square. This theorem is due to Cayley, and reference may be made to Salmon’s Higher Algebra, 4th ed. Art. 39. In the case of the determinant of order 4 the square root is
A_{12}A_{34} – A_{13} A_{24} + A_{14}A_{23}.
A skew determinant is one which is skew symmetric in all respects, except that the elements of the leading diagonal are not all zero. Such a determinant is of importance in the theory of orthogonal substitution. In the theory of surfaces we transform from one set of three rectangular axes to another by the substitutions
X =ax +by +cz,
Y = a′x +b′y +c′z,
Z = a″x + b″y + c″z,
where X^{2} + Y^{2} + Z^{2} = x^{2} + y^{2} + z^{2}. This relation implies six equations. between the coefficients, so that only three of them are independent. Further we find
x = aX + a′Y + a″Z,
y = bX + b′Y + b″Z,
z = cX + c′Y + c″Z,
and the problem is to express the nine coefficients in terms of three independent quantities.
In general in space of n dimensions we have n substitutions similar to
X_{1} = a_{11}x_{1} +a_{12}x_{2} + ... + a_{1n}x_{n},
and we have to express the n^{2} coefficients in terms of ½n(n – 1) independent quantities; which must be possible, because
X_{21}+X_{22}+ ... + X_{2n} =x_{21} + x_{22} + x_{23} + ... + x_{2n} .
Let there be 2n equations
x_{1}= b_{11}ξ_{1} + b_{12}ξ_{2} + b_{13}ξ_{3} + ...,
x_{1}= b_{21}ξ_{1} + b_{22}ξ_{2} + b_{23}ξ_{3} + ...,
.....
X_{1}= b_{11}ξ_{1} + b_{21}ξ_{2} + b_{31}ξ_{3} + ...,
X_{2}= b_{12}ξ_{1} + b_{22}ξ_{2} + b_{32}ξ_{3} + ...,
.....
where b_{rr} = 1 and b_{rs} = – b_{sr} for all values of r and s. There are then ½n(n–1) quantities b_{rs} . Let the determinant of the b’s be Δ_{b} and B_{rs}, the minor corresponding to b_{rs} . We can eliminate the quantities ξ_{1}, ξ_{2}, ... ξ_{n} and obtain n relations
Δ_{b}X_{1} = (2B_{11} – Δ_{b})x_{1} +2B_{21}x_{2}+2B_{31}x_{3}+...,
Δ_{b}X_{2} = 2B_{12}x_{1}+ (2B_{22} – Δ_{b})x_{2} + 2B_{32}x_{3}+...,
........
and from these another equivalent set
Δ_{b}x_{1} = (2B_{11} – Δ_{b})X_{1} +2B_{12}X_{2}+2B_{13}X_{3}+...,
Δ_{b}x_{2} = 2B_{12}X_{1}+ (2B_{22} – Δ_{b})X_{2} + 2B_{23}x_{3}+...,
........
and now writing
2B_{ii} – Δ_{b}Δ_{b}=a_{ii},2B_{ik}Δ_{b} = a_{ik},
we have a transformation which is orthogonal, because ΣX^{2} = Σx^{2} and the elements a_{ii}, a_{ik} are functions of the ½n(n − 1) independent quantities b. We may therefore form an orthogonal transformation in association with every skew determinant which has its leading diagonal elements unity, for the ½n(n − 1) quantities b are clearly arbitrary.
For the second order we may take
Δ_{b} = | 1,λ | = 1 + λ², |
–λ, 1 |
and the adjoint determinant is the same; hence
(1 + λ²)x_{1} = (1-λ²)X_{1} +2λX_{2},
(1 + λ²)x_{2}= –2λX_{1} + (1 – λ²)X_{2},
Similarly, for the order 3, we take
1 | ν | –μ | ||
Δ_{b} = | –ν | 1 | λ | = 1 + λ² + μ² +ν², |
μ | −λ | 1 |
and the adjoint is
1 +λ² | ν+λμ | −μ+λν |
−ν+λμ | 1+μ² | λ+μν |
μ+λν | λ+μν | 1+ν² |
leading to the orthogonal substitution
Δ_{b}x_{1} = (1 +λ² − μ² − ν²)X_{1}+2(ν + λμ)X_{2}+2(− μ + λν)X_{3}
Δ_{b}x_{2} = 2(λμ − ν)X_{1} + (1 + μ² − λ² − ν²)X_{2}+2(μν + λ)X_{3}
Δ_{b}x_{3} = 2(λν + μ)X_{1}+2(μν − λ)X_{2}+(1 +ν² − λ² − μ²)X_{3}.
Functional determinants were first investigated by Jacobi in a work De Determinantibus Functionalibus. Suppose n dependent variables y_{1}, y_{2},...y_{n}, each of which is a function of n independent variables x_{1}, x_{2},...x_{n}, so that y_{s}= ƒ_{s} (x_{1}, x_{2},...x_{n}). From the differential coefficients of the y’s with regard to the x’s we form the functional determinant
∂y_{1}∂x_{1} | ∂y_{1}∂x_{2} | ⋅⋅⋅ | ∂y_{1}∂x_{n} | |||||||||
R = | ∂y_{2}∂x_{1} | ∂y_{2}∂x_{2} | ⋅⋅⋅ | ∂y_{2}∂x_{n} |
| |||||||
⋅ | ⋅ | ⋅⋅⋅ | ⋅ | |||||||||
∂y_{n}∂x_{1} | ∂y_{n}∂x_{2} | ... | ∂y_{n}∂x_{n} |
If we have new variables z such that z_{s}=φ_{s}(y_{1}, y_{2},...y_{n}), we have also z_{s} = ψ_{s}(x_{1}, x_{2},...x_{n}), and we may consider the three determinants
(y_{1}, y_{2},...y_{n}
x_{1}, x_{2},...x_{n}), (z_{1}, z_{2},...z_{n}
y_{1}, y_{2},...y_{n}), (z_{1}, z_{2},...z_{n}
x_{1}, x_{2},...x_{n})
Forming the product of the first two by the product theorem, we obtain for the element in the i_{th} row and k_{th} column
∂z_{i}∂y_{1} ∂y_{1}∂x_{k}+∂z_{i}∂y_{2} ∂y_{2}∂x_{k}+...+∂z_{i}∂y_{n} ∂y_{n}∂x_{k}
which is ∂z_{i}∂x_{k}, the partial differential coefficient of z_{i}, with regard to x_{k} . Hence the product theorem
(z_{1}, z_{2},...z_{n}
y_{1}, y_{2},...y_{n}), (y_{1}, y_{2},...y_{n}
x_{1}, x_{2},...x_{n}) = (z_{1}, z_{2},...z_{n}
x_{1}, x_{2},...x_{n});
and as a particular case
(y_{1}, y_{2},...y_{n}
x_{1}, x_{2},...x_{n}) (x_{1}, x_{2},...x_{n}
y_{1}, y_{2},...y_{n}) = 1.
Theorem.—If the functions y_{1}, y_{2},...y_{n} be not independent of one another the functional determinant vanishes, and conversely if the determinant vanishes, y_{1}, y_{2},...y_{n} are not independent functions of x_{1}, x_{2},...x_{n}.
Linear Equations.—It is of importance to study the application of the theory of determinants to the solution of a system of linear equations. Suppose given the n equations
ƒ_{1} = a_{11}x_{1}+ a_{12}x_{2}+ ... a_{1n}x_{n}=0,
ƒ_{2} = a_{21}x_{1}+ a_{22}x_{2}+ ... a_{2n}x_{n}=0,
.......
ƒ_{n} = a_{n1}x_{1}+ a_{n2}x_{2}+ ... a_{nn}x_{n}=0.
Denote by Δ the determinant (a_{11}a_{22}...a_{nn}).
Multiplying the equations by the minors A_{1μ}, A_{2μ},...A_{nμ} respectively, and adding, we obtain
x_{μ}(a_{1μ}A_{1μ}+a_{2μ}A_{2μ}+...+a_{nμ}A_{nμ}) = x_{μ}Δ = 0,
since from results already given the remaining coefficients of x_{1}, x_{2},...x_{μ–1}, x_{μ+1},...x_{n} vanish identically.
Hence if Δ does not vanish x_{1} = x_{1} = ... =x_{n} = 0 is the only solution; but if Δ vanishes the equations can be satisfied by a system of values other than zeros. For in this case the n equations are not independent since identically
A_{1μ}ƒ_{1} + A_{2μ}ƒ_{2}+...+A_{nμ}ƒ_{n} = 0,
and assuming that the minors do not all vanish the satisfaction of n–1 of the equations implies the satisfaction of the n^{th}.
Consider then the system of n–1 equations
a_{21}x_{1}+ a_{22}x_{2} +...+ a_{2n}x_{n} = 0
a_{31}x_{1}+ a_{32}x_{2} +...+ a_{3n}x_{n} = 0
......
a_{n1}x_{1}+ a_{n2}x_{2} +...+ a_{nn}x_{n} = 0,
which becomes on writing x_{s}x_{n} = y_{s},
a_{21}y_{1}+ a_{22}y_{2} +...+ a_{2,n−1}y_{n−1} +a_{2n} = 0
a_{31}y_{1}+ a_{32}y_{2} +...+ a_{3,n−1}y_{n−1} +a_{3n} = 0
.......
a_{n1}y_{1}+ a_{n2}y_{2} +...+ a_{n,n−1}y_{n−1} +a_{nn} = 0.
We can solve these, assuming them independent, for the n−1 ratios y_{1}, y_{2},...y_{n−1}.
Now
a_{21}A_{11} + a_{22}A_{12}+...+a_{2n}A_{1n} = 0
a_{31}A_{11} + a_{32}A_{12}+...+a_{3n}A_{1n} = 0
.......
a_{n1}A_{11} + a_{n2}A_{12}+...+a_{nn}A_{1n} = 0
and therefore, by comparison with the given equations, x_{i} = ρA_{1i}, where ρ is an arbitrary factor which remains constant as i varies.
Hence y_{i} = A_{1i}A_{1n} where A _{li} and A_{1n}, are minors of the complete determinant
(a_{11}a_{22}...a_{nn}).
a_{21} a_{22} ...a_{2,i–1} a_{2,i+1}... a_{2n} | |
a_{31} a_{32} ...a_{3,i–1} a_{3,i+1} ...a_{3n} | |
........... | |
∴ y_{i} = (−)^{i+n} |
a_{n1} a_{n2} ...a_{n,i–1} a_{n,i+1} ...a_{2nn} |
—————————————, |
| a_{21} a_{22} ...a_{2,n–1} |
a_{31} a_{22} ...a_{2,n–1} | |
...... | |
a_{n1} a_{n2} ...a_{n,n–1} |
or, in words, y_{i} is the quotient of the determinant obtained by erasing the i^{th} column by that obtained by erasing the n^{th} column, multiplied by (–1)^{i+n}. For further information concerning the compatibility and independence of a system of linear equations, see Gordon, Vorlesungen über Invariantentheorie, Bd. 1, § 8.
Resultants.—When we are given k homogeneous equations in k variables or k non-homogeneous equations in k − 1 variables, the equations being independent, it is always possible to derive from them a single equation R = 0, where in R the variables do not appear. R is a function of the coefficients which is called the "resultant" or "eliminant" of the k equations, and the process by which it is obtained is termed "elimination." We cannot combine the equations so as to eliminate the variables unless on the supposition that the equations are simultaneous, i.e. each of them satisfied by a common system of values; hence the equation R = 0 is derived on this supposition, and the vanishing of R expresses the condition that the equations can be satisfied by a common system of values assigned to the variables.
Consider two binary equations of orders m and n respectively expressed in non-homogeneous form, viz.
ƒ(x) = ƒ = a_{0}x^{m} – a_{1}x^{m–1} + a_{2}x^{m–2} – ... = 0,
ƒ(φ) = φ = b_{0}x^{n} – b_{1}x^{n–1} + b_{2}x^{n–2} – ... = 0,
If α_{1}, α_{2}, ...α_{m} be the roots of ƒ=0, β_{1}, β_{2}, ...β_{n} the roots of φ=0, the condition that some root of 0 =o may qq cause f to vanish is clearly
R_{ƒ,φ} = ƒ (β_{1})ƒ(β_{2})...ƒ(β_{2}) = 0;
so that R_{ƒ,φ} is the resultant of ƒ and φ, and expressed as a function of the roots, it is of degree m in each root β, and of degree n in each root α, and also a symmetric function alike of the roots α and of the roots β; hence, expressed in terms of the coefficients, it is homogeneous and of degree n in the coefficients of ƒ, and homogeneous and of degree m in the coefficients of φ
Ex. gr.
ƒ = a_{0}x² − a_{1}x+a_{2} =0, φ=b_{0}x² − b_{1}x+b_{2}.
We have to multiply a_{0}β_{21} − a_{1}β_{1}+a_{2} by a_{0}β_{22} − a_{1}β_{2}+a_{2} and we obtain
a_{20}β_{21}β_{22} − a_{0}a_{1}(β_{21}β_{2} + β_{1}β_{22}) + a_{0}a_{2}(β_{21}β_{21} + β_{1}β_{22}) + a_{210}β_{1}β_{2} − a_{1}a_{2}(β_{1} + β_{2}) + a_{22},
where
β_{1} + β_{2} = b_{1}b_{0},β_{1} β_{2} = b_{2}b_{0}, β_{1} β_{2} = b_{21} – 2b_{0}b_{2}b_{20},
and clearing of fractions
R_{ƒ,φ} = (a_{0}b_{2} – a_{2}b_{0})² + (a_{1}b_{0} – a_{0}b_{1})(a_{1}b_{2} – a_{2}b_{1}).
We may equally express the result as
φ(α(_{1})φ(α_{2})...φ(α_{m}) = 0,
II
^{s,t}(α_{s} – β_{t} = 0.
This expression of R shows that, as will afterwards appear, the resultant is a simultaneous invariant of the two forms.
The resultant being a product of mn root differences, is of degree mn in the roots, and hence is of weight mnin the coefficients of the forms; i.e. the sum of the suffixes in each term of the resultant is equal to mn.
Resultant Expressible as a Determinant.—From the theory of linear equations it can be gathered that the condition that p linear equations in p variables (homogeneous and independent) may be simultaneously satisfied is expressible as a determinant, viz. if
a_{11}x_{1} + a_{12}x_{2} +...+ a_{1p}x_{p} = 0,
a_{21}x_{1} + a_{22}x_{2} +...+ a_{2p}x_{p} = 0,
......
a_{p1}x_{1} + a_{p2}x_{2} +...+ a_{pp}x_{p} = 0,
be the system the condition is, in determinant form,
(a_{11}a_{22}...a_{pp}) = 0;
n fact the determinant is the resultant of the equations.
Now, suppose ƒ and φ to have a common factor x – γ,
ƒ(x) =ƒ_{1}(x)(x – γ); φ(x) = φ_{1}(x)(x – γ),
ƒ_{1} and φ_{1} being of degrees m – 1 and n – 1 respectively; we have the identity φ_{1}ƒ(x) = ƒ_{1}(x)φ(x) of degree m + n – 1.
Assuming then φ_{1} to have the coefficients B_{1}, B_{2},...B_{n}
and ƒ_{1}the coefficients A_{1}, A_{2},...A_{m},
we may equate coefficients of like powers of x in the identity, and obtain m + n homogeneous linear equations satisfied by the m + n quantities B_{1}, B_{2},...B_{n}, A_{1}, A_{2},...A_{m}. Forming the resultant of these equations we evidently obtain the resultant of ƒ and φ.
Thus to obtain the resultant of
ƒ=a_{0}x^{3} + a_{1}x^{2} + a_{2}x+ a_{3}, , φ = b_{0}x^{2} + b_{1}x+ b_{2}
we assume the identity
(B_{0}x + B_{1})(a_{0}x^{3} + a_{1}x^{2} + a_{2}x+ a_{3}) = (A_{0}x^{2} + A_{1}x+ A_{2})(b_{0}x^{2} + b_{1}x+ b_{2}),
and derive the linear equations
B_{0}a_{0} | −A_{0}b_{0} | =0, | |||
B_{0}a_{1} | +B_{1}a_{0} | −A_{0}b_{1} | −A_{1}b_{0} | =0, | |
B_{0}a_{2} | +B_{1}a_{1} | −A_{0}b_{2} | −A_{1}b_{1} | −A_{2}b_{0} | =0, |
B_{0}a_{3} | +B_{1}a_{2} | −A_{1}b_{2} | −A_{2}b_{1} | =0, | |
B_{1}a_{3} | −A_{2}b_{2} | =0, |
a_{0} | 0 | b_{0} | 0 | 0 | |
a_{1} | a_{0} | b_{1} | b_{0} | 0 | |
a_{2} | a_{1} | b_{2} | b_{1} | b_{0} | a numerical factor |
a_{3} | a_{2} | 0 | b_{2} | b_{1} | being disregarded. |
0 | a_{3} | 0 | 0 | b_{2} |
This is Euler’s method. Sylvester’s leads to the same expression, but in a simpler manner.
He forms n equations from ƒ by separate multiplication by x^{n –1},x^{n –2},...x,1, in succession, and similarly treats φ with m multipliers x^{m –1},x^{m –2},...x,1,. From these m + n equations he eliminates the m + n powers x^{m+n –1}, x^{m+n –2},x,.. 1,' treating them as independent unknowns. Taking the same example as before the process leads to the system of equations
a_{0}x^{4}+ | a_{1}x^{3}+ | a_{2}x^{2}+ | a_{3}x | =0, | |
a_{0}x^{3}+ | a_{1}x^{2}+ | a_{2}x+ | a_{3} | =0, | |
b_{0}x^{4}+ | b_{1}x^{3}+ | b_{2}x^{2} | =0, | ||
b_{0}x^{3}+ | b_{1}x^{2}+ | b_{2}x | =0, | ||
b_{0}x^{2}+ | b_{1}x+ | b_{2} | =0, |
whence by elimination the resultant
a_{0} | a_{1} | a_{2} | a_{3} | 0 |
0 | a_{0} | a_{1} | a_{2} | a_{3} |
b_{0} | b_{1} | b_{2} | 0 | 0 |
0 | b_{0} | b_{1} | b_{2} | 0 |
0 | 0 | b_{9} | b_{1} | b_{2} |
which reads by columns as the former determinant reads by rows, and is therefore identical with the former. E. Bézout’s method gives the resultant in the form of a determinant of order m or n, according as m is ≷ n. As modified by Cayley it takes a very simple form. He forms the equation
ƒ(x)φ(x′) − ƒ(x′)φ(x) = 0,
which can be satisfied when ƒ and φ possess a common factor. He first divides by the factor x − x′, reducing it to the degree m − 1 in both x and x′ where m > n; he then forms m equations by equating to zero the coefficients of the various powers of x′; these equations involve the m powers x^{0}, x, x^{2},... x^{m−1} - of x, and regarding these as the unknowns of a system of linear equations the resultant is reached in the form of a determinant of order m. Ex. gr. Put
(a_{0}x^{3}+a_{1}x^{2}+a_{2}x +a_{3}) (b_{0}x′^{2}+b_{1}x′+b_{2}) − (a_{0}x′^{3}+a_{1}x′^{2}+a_{2}x′ +a_{3}) (b_{0}x^{2}+b_{1}x+b_{2}) = 0;
after division by x − x′ the three equations are formed
a_{0}b_{0}x²+a_{0}b_{1}x+a_{0}b_{2} | = 0, |
a_{0}b_{1}x²+(a_{0}b_{2}+a_{1}b_{1}−a_{0}b_{2})x+a_{1}b_{2}−a_{3}b_{0} | = 0, |
a_{0}b_{2}x²+(a_{1}b_{2}−a_{3}b_{0})x+a_{2}b_{2}−a_{3}b_{1} | = 0 |
and thence the resultant
a_{0}b_{0} | a_{0}b_{1} | a_{0}b_{2} |
a_{0}b_{1} | a_{0}b_{2}+a_{1}b_{1}−a_{0}b_{2} | a_{1}b_{2}−a_{3}b_{0} |
a_{0}b_{2} | a_{1}b_{2}−a_{3}b_{0} | a_{2}b_{2}−a_{3}b_{1} |
which is a symmetrical determinant.
Case of Three Variables.—In the next place we consider the resultants of three homogeneous polynomials in three variables. We can prove that if the three equations be satisfied by a system of values of the variable, the same system will also satisfy the Jacobian or functional determinant. For if u, v, w be the polynomials of orders m, n, p respectively, the Jacobian is (u_{1} v_{2} w_{3}), and by Euler’s theorem of homogeneous functions
xu_{1} + yu_{2} + zu_{3} = mu
xv_{1} + yv_{2} + zv_{3} = nv
xw_{1} + yw_{2} + zw_{3} = pw;
denoting now the reciprocal determinant by (U_{1} V_{2} W_{3}) we obtain Jx = muU_{1} + nvV_{1} + pwW_{1}; Jy=..., Jz=..., and it appears that the vanishing of u, v, and w implies the vanishing of J. Further, if m = n = p, we obtain by differentiation
J + x∂J∂x =m (u∂U_{1}∂x. + v∂V_{1}∂x + u∂W_{1}∂x + u_{1}U_{1} v_{1}V_{1} w_{1}W_{1}).
or
x∂J∂x =m – 1)J + m (u∂U_{1}∂x. + v∂V_{1}∂x + u∂W_{1}∂x).
Hence the system of values also causes ∂J∂x to vanish in this case; and by symmetry ∂J∂y and ∂J∂z also vanish.
The proof being of general application we may state that a system of values which causes the vanishing of k polynomials in k variables causes also the vanishing of the Jacobian, and in particular, when the forms are of the same degree, the vanishing also of the differential coefficients of the Jacobian in regard to each of the variables.
There is no difficulty in expressing the resultant by the method of symmetric functions. Taking two of the equations
ax^{m} + (by + cz) x^{m–1} +... =0,
a′x^{n} + (b′y + c′z) x^{n–1} +... =0,
we find that, eliminating x, the resultant is a homogeneous function of y and z of degree mn; equating this to zero and solving for the ratio of y to z we obtain mn solutions; if values of y and z, given by any solution, be substituted in each of the two equations, they will possess a common factor which gives a value of x which, combined with the chosen values of y and z, yields a system of values which satisfies both equations. Hence in all there are mn such systems. If, therefore, we have a third equation, and we substitute each system of values in it successively and form the product of the mn expressions thus formed, we obtain a function which vanishes if any one system of values, common to the first two equations, also satisfies the third. Hence this product is the required resultant of the three equations.
Now by the theory of symmetric functions, any symmetric functions of the mn values which satisfy the two equations, can be expressed in terms of the coefficient of those equations. Hence, finally, the resultant is expressed in terms of the coefficients of the three equations, and since it is at once seen to be of degree mn in the coefficient of the third equation, by symmetry it must be of degrees np and pm in the coefficients of the first and second equations respectively. Its weight will be mnp (see Salmon’s Higher Algebra, 4th ed. § 77). The general theory of the resultant of k homogeneous equations in k variables presents no further difficulties when viewed in this manner.
The expression in form of a determinant presents in general considerable difficulties. If three equations, each of the second degree, in three variables be given, we have merely to eliminate the six products x², y², z², yz, zx, xy from the six equations
u = v = w = ∂J∂x = ∂J∂y = ∂J∂z = 0; if we apply the same process to these equations each of degree three, we obtain similarly a determinant of order 21, but thereafter the process fails. Cayley, however, has shown that, whatever be the degrees of the three equations, it is possible to represent the resultant as the quotient of two determinants (Salmon, l.c. p. 89).
Discriminants.—The discriminant of a homogeneous polynomial in k variables is the resultant of the k polynomials formed by differentiations in regard to each of the variables.
It is the resultant of k polynomials each of degree m–1, and thus contains the coefficients of each form to the degree (m–1)^{k–1}; hence the total degrees in the coefficients of the k forms is, by addition, k(m–1)^{k–1}; it may further be shown that the weight of each term of the resultant is constant and equal to m(m–1)^{k–1} (Salmon, l.c. p. 100).
A binary form which has a square factor has its discriminant equal to zero. This can be seen at once because the factor in question being once repeated in both differentials, the resultant of the latter must vanish.
Similarly, if a form in k variables be expressible as a quadratic function of k – 1, linear functions X_{1}, X_{2}, ... X_{k – 1}, the coefficients being any polynomials, it is clear that the k differentials have, in common, the system of roots derived from X_{1} = X_{2} = ... = X_{k – 1} = 0, and have in consequence a vanishing resultant. This implies the vanishing of the discriminant of the original form.
Expression in Terms of Roots.—Since x∂ƒ∂x+∂ƒ∂y = mƒ, if we take any root x_{1}, y_{1}, of ∂ƒ∂x, and substitute in mf we must obtain, y_{1}(∂ƒ∂y)
x–x_{1}
y–y_{1}; hence the resultant of ∂ƒ∂x and ƒ is, disregarding numerical factors, y_{1}y_{2}...y_{n–1} × discriminant of ƒ = a_{0} × disct. of ƒ. Now
ƒ = (xy_{1} – x_{1}y)(xy_{2} – x_{2}y) ... (xy_{m} – x_{m}y),
∂ƒ∂x =Σ_{1} y_{1}(xy_{m} – x_{m}y),
and substituting in the latter any root of ƒ and forming the product, we find the resultant of ƒ and ∂ƒ∂x, viz.
y_{1}y_{2}...y_{m}(x_{1}y_{2} – x_{2}y_{1})²(x_{1}y_{3} – x_{3}y_{1})²...(x_{r}y_{s} – x_{s}y_{r})²...
and, dividing by y_{1}y_{2}...y_{m}, the discriminant of ƒ is seen to be equal to the product of the squares of all the differences of any two roots of the equation. The discriminant of the product of two forms is equal to the product of their discriminants multiplied by the square of their resultant. This follows at once from the fact that the discriminant is
II(α_{r} – α_{s})²II(β_{r} – β_{s})²{II(α_{r} – β_{s}}².
II. The Theory Of Symmetric Functions
Consider n quantities a_{1}, a_{2}, a_{3},...a_{n}.
Every rational integral function of these quantities, which does not alter its value however the n suffixes 1, 2, 3, ... n be permuted, is a rational integral symmetric function of the quantities. If we write (1 +a_{1}x) (1 +a_{2}x)...(1 +a_{n}x) = 1 +a_{1}x + a_{2}x² +... +a_{n}x^{n}, a_{1}, a_{2}, ...a_{n} are called the elementary symmetric functions.
a_{1} = a_{1} + a_{2} +...+a_{n} = Σa_{1}
a_{2} = a_{1}a_{2} + a_{1}a_{3} +...+a_{2}a_{3} = Σa_{1}a_{2}
⋅⋅⋅⋅⋅
a_{n} = a_{1}a_{2}a_{3}...a_{n}.
The general monomial symmetric function is
Σa_{p11 } a_{p22 } a_{p33 }...a_{pnn},
the summation being for all permutations of the indices which result in different terms. The function is written
(p_{1}p_{2}p_{3}...p_{n})
for brevity, and repetitions of numbers in the bracket are indicated by exponents, so that (p_{1}p_{1}p_{2}) is written (p_{21}p_{2}). The weight of the function is the sum of the numbers in the bracket, and the degree the highest of those numbers.
Ex. gr. The elementary functions are denoted by
(l), (l^{2}), (l^{3}), ... (l^{n}),
are all of the first degree, and are of weights 1, 2, 3,...n respectively.
Remark.—In this notation (0) = Σa_{01} = (n
1); (0²) = Σa_{01}a_{02} = (n
1);... (0^{s}) = (n
s), &c. The binomial coefficients appear, in fact, as symmetric functions, and this is frequently of importance.
The order of the numbers in the bracket (p_{1}p_{2} ...p_{n}) is immaterial; we may therefore always place them, as is most convenient, in descending order of magnitude; the numbers then constitute an ordered partition of the weight w, and the leading number denotes the degree.
The sum of the monomial functions of a given weight is called the homogeneous-product-sum or complete symmetric function of that weight; it is denoted by h_{w} it is connected with the elementary functions by the formula
11 – a_{1}x + a_{2}x² a_{3}x³ +... = 1 + h_{1}x + h_{2}x² + h_{3}x³ + ...,
which remains true when the symbols a and h are interchanged, as is at once evident by writing –x for x. This proves, also, that in any formula connecting a_{1}, a_{2}, a_{3} ,... with h_{1}, h_{2}, h_{3},... the symbols a and h may be interchanged.
Ex. gr. from h_{2} = a_{21} – a_{2} we derive a_{2} = h_{21} – h_{2}.
The function Σa_{ p11}a_{ p22} ...a_{ pnn} being as above denoted by a partition of the weight, viz. (p_{1}p_{2} ...p_{n}), it is necessary to bring under view other functions associated with the same series of numbers: such, for example, as
Σa_{ p11}a_{ p32}a_{ p21}a_{ p42}... a_{ pn–2n–2} = (p_{1}p_{3})(p_{2}p_{4} ...p_{n–2}).
The expression just written is in fact a partition of a partition, and to avoid confusion of language will be termed a separation of a partition. A partition is separated into separates so as to produce a separation of the partition by writing down a set of partitions, each separate partition in its own brackets, so that when all the parts of these partitions are reassembled in a single bracket the partition which is separated is reproduced. It is convenient to write the distinct partitions or separates in descending order as regards weight. If the successive weights of the separates w_{1}, w_{2}, w_{3},... be enclosed in a bracket we obtain a partition of the weight w which appertains to the separated partition. This partition is termed the specification of the separation. The degree of the separation is the sum of the degrees of the component separates. A separation is the symbolic representation of a product of monomial symmetric functions. A partition, (p_{1}p_{1}p_{1}p_{2}p_{2}p_{3}) = (p_{31}p_{22}p_{3}) can be separated in the manner (p_{1}p_{2}) (p_{1}p_{2}) (p_{1}p_{3}) = (p_{1}p_{2})^{2} (p_{1}p_{3}), and we may take the general form of a partition to be (p_{w11 }p_{w22 }p_{w33 } ...) and that of a separation (J_{1})^{f1} (J_{2})^{f2}(J_{3})^{f3}... when J_{1}, J_{2}, J_{3}... denote the distinct separates involved.
Theorem.— The function symbolized by (n), viz. the sum of the n^{th} powers of the quantities, is expressible in terms of functions which are symbolized by separations of any partition (n_{v11 }n_{v22 }n_{v33 }...) of the number n. The expression is—
(–) ^{v1+v2v3+...}(v_{1}+v_{2}v_{3}+...)– 1)!v_{1}!+v_{2}!v_{3}!+... (n)
=Σ (–) ^{j1+j2j3+...} (j_{1}+j_{2}j_{3}+...)– 1)!j_{1}!+j_{2}!j_{3}!+...(J_{1})^{j1} (J_{2})^{j2}(J_{3})^{j3}...,
(J_{1})^{j1} (J_{2})^{j2}(J_{3})^{j3}... being a separation of (n_{v11 }n_{v22 }n_{v33 }...) and the summation being in regard to all such separations. For the particular case (n_{v11 }n_{v22 }n_{v33 }...) = (l^{n})
(−)^{n}ln(n) = Σ (–) ^{j1+j2j3+...} (j_{1}+j_{2}j_{3}+...)– 1)!j_{1}!+j_{2}!j_{3}!+...(l)^{j1} (l²)^{j2}(l³)^{j3}...
To establish this write—
1 + μX_{1} + μ²X_{2} + μ³X_{3} +... = II
a(l + μa_{1}x_{1} + μ²a_{21}x_{2} + μ³a_{31}x_{3} + ...),
the product on the right involving a factor for each of the quantities a_{1}, a_{2}, a_{3}..., and μ being arbitrary.
Multiplying out the right-hand side and comparing coefficients
X_{1} = (l)x_{1},
X_{2} = (2)x_{2} + (l²)x_{21},
X_{3} = (3)x_{3} + (2l)x_{2}x_{1} + (l³)x_{31},
X_{4} = (4)x_{4} + (3l)x_{3}x_{1} + (2²)x_{22}+ (2l²)x_{2}x_{21} (l^{4})x_{31},
•••••••
X_{m}=Σ(m_{μ11}m_{μ22}m_{μ33} ...)x_{μ1m1}...,
the summation being for all partitions of m.
Auxiliary Theorem.—The coefficient of x_{λ1l1}x_{λ2l2}x_{λ3l3}... in the product
X_{ μ1m1}X_{ μ2m1}X_{ μ3m1}...μ_{1}!μ_{2}!μ_{3}!... is Σ (J_{1}) ^{j1}(J_{2})^{j2}(J_{3})^{j3}... where J_{1}) ^{j1}(J_{2})^{j2}(J_{3})^{j3}...is a separation of (l_{λ11}l_{λ22}l_{λ33} ...) of specification (m_{μ11}m_{μ22}m_{μ33} ...), and the sum is for all such separations.
To establish this observe the result.
1p!Xp_{3} =Σ (3)^{π1} (2l)^{π2} (1^{3})^{π3}π_{1}!π_{2}!π_{3}!xπ_{1}3xπ_{2}2 xπ_{2}+3π_{3}1
and remark that (3)^{π1}(2I)^{π2}(I^{3})^{π3} is a separation of (3^{π1}2^{π2}1^{π2}+3^{π3}) of specification (3^{p}). A similar remark may be made in respect of
1μ_{1}!Xμ_{1}
m_{1}, 1μ_{2}!Xμ_{2}
m_{2}, 1μ_{3}!Xμ_{3}
m_{3}, ...
and therefore of the product of those expressions. Hence the theorem.
Now
log (1+μX_{1} +μ^{2}X_{2}+μ^{3}X_{3} +...)
=Σ log (1+μα_{1}+μ^{2}α^{2}
^{1}+μ^{3}α^{3}
^{1}+...) whence, expanding by the exponential and multinomial theorems, a comparison of the coefficients of μ^{n} gives
(n)Σ (−)^{ν1+ν2+ν3+...−1} (ν_{1}+ν_{2}+ν_{3}+...−1)!ν!_{1}+ν_{2}!+ν_{3}!+... xν_{1}
n_{1}xν_{2}
n_{2}xν_{3}
n_{3} ...
= Σ ν_{1}+ν_{2}+ν_{3}+...1 (111+112+Y3+... - 1) !Xv1Xv2Xv3 Y1!Y2!1,3!... n1 n 2 n 3 ï¿½ï¿½ ï¿½ and, by the auxiliary theorem, any term XmiXm2X, n3 ... on the right-hand side is such that the coefficient of x n ix n Zx n 3... in 1 "1142 P3 X ? X. is A1 4 4 ,!, 3 1 ... 1 ?ï¿½ï¿½ 2 m3..
(J1)11(J2)12(J3)j3ï¿½.ï¿½ jj!j2!j3!..ï¿½ where since(m1 1 m2 2 m3 3 ...) is the specification of (J1)j1(J2)j2(J3)j3..., ï¿½ l +ï¿½2+/23+ï¿½ï¿½ï¿½ =ii +j2+j3+ï¿½ï¿½ï¿½ï¿½ Comparison of the coefficients of x:14243... therefore yields the result (-) V1+v2+v3+... (P i +Y2+t' +...-1)! () n VI!Y2!P3!...
) j1+j2+j3+..ï¿½ (J1+ j2 +j3+...-1)!/T1)?1(J2)72 (J 3)/3..., j11j2!j3!... ?.1 for the expression of Za n in terms of products of symmetric functions symbolized by separations of ( n 1 1n 2 2n 3 3) Let (n) a, (n) x, (n) X denote the sums of the n th powers of quantities whose elementary symmetric functions are a_{1}, a_{2}, a_{3},...; x_{1}, x_{2}, x_{3},..; X_{1}, X_{2}, X_{3}.... respectively: then the result arrived at above from the logarithmic expansion may be written (n)_{a}(n)_{x} = (n)x,
exhibiting (n) $ as an invariant of the transformation given by the expressions of X_{1}, X_{2}, X_{3}... in terms of x_{1}, x_{2}, x_{3},....
The inverse question is the expression of any monomial symmetric function by means of the power functions (r) = s_{r}.
Theorem of Reciprocity.—If
X1 P2 "3 01 Q 2 7 3 Al A 2 A3 X m1 X m2 X m3 ... = ...+O(s i s 2 s 3 ...)xl1x12x13...+..., where 0 is a numerical coefficient, then also O ?2 0.3 P1 P2 P3 Al A2 A3 +.
X,1X82>$3...=...+8(m m m ...)x 11 x 12 x13......
1 2 3
We have found above that the coefficient of (x 1 1 x 12 x 13...) i n the product XmiXm2X m3 ... is ï¿½1!ï¿½2!ï¿½3!
'1 +ï¿½ ï¿½.(11+j2+j3+... -1)!
(1)/1(12) 2(13)73....
(J1)ji(J2)72(J3)13ï¿½ï¿½ï¿½ jl!j2!j3!...
the sum being for all separations of l_{1}l_{2}l_{3} ...) which have the specification (m41 m2 2 m3 3 ...). We can multiply out this expression so as to obtain a series of monomials of the form 9(sl is2 2 s3 3 ...). It can be shown that the number 0 enumerates distributions of a certain nature defined by the partitions (m_{1},m_{2}...), (sT1sÂ°2...), 1212 an = a 1 a 2 a 3 ... an. (lλ
^{1} lλ
^{1}...) and it is seen intuitively that the number θ remains unaltered when the first two of these partitions are interchanged (see Combinatorial Analysis). Hence the theorem is established.
Putting x_{1}= 1 and x_{2} = x_{3} = x_{4} = ... = 0, we find a particular law of reciprocity given by Cayley and Betti,
(1^{m1}) t(1(1 n1 2) ï¿½2 (1.3)ï¿½3... = ... +ti (Si 1S2 ?S3 3 ...) -f -..., (PO v1(1s2)a2(1.3)v3... _ ...+o(mi
and another by putting x i = x 2 = x3= ...' =I, for then X. becomes hm, and we have
h,ï¿½,,ih,ï¿½,,2hm3... _ ... +tir (S? 1 S 2 2 S 3 3 ...) +..., ?1 ?2 ?3 _ ï¿½ l ï¿½2 ï¿½3 h S h S2 h 83 ... -. +o (m l m2 m3) +...,
Theorem of Expressibility.—“If a symmetric function be symboilized by (Aï¿½v...) and (X1X2X3..ï¿½), (ï¿½i/-12ï¿½3ï¿½ï¿½ï¿½), (v1v2v3...)... be any partitions of X, respectively, the function is expressible by means of functions symbolized by separation of
X1A2X 3. ï¿½ ï¿½ / 1111-2113. ï¿½ ï¿½ P1 v2 v3...)”
For, writing as before, Xm 'Xm 2 Xm '= zzo(SQls:2s73...) xi'x12x13..., 1 2 3" 1231 2 3 = EPxi l x A2 x A3, P is a linear function of separations of(/ 1 / 2 A2 / 4 3 3 ...) of specification (m"`1mï¿½2m"`3...), and if X; 1 X 3 2X8 3 ' .. = ?P'xilx12xi 3
P' is a linear function of separations of (li'12 2 13 3 ...) of specification (si 1 s 22 s 33)
Suppose the separations of (11 1 13 2 1 3 3 ...) to involve k different specifications and form the k identities
¿½1s ï¿½ s Al A 2 A3 .. Xm1sXm2sXm3s... = EP x tl x t2 x t3 ... (S - 1 , 2, ...k), where (mï¿½lsm"`2sm"`38...) is one of the k specifications.
The law of reciprocity shows that p(s) = zti (m 1te2tmtL3t) t=1 st It 2t 3t viz.: a linear function of symmetric functions symbolized by the k specifications; and that () St =ti ts. A table may be formed expressing the k expressions Pa l), P(2),...P(1) as linear functions of the k expressions (m"`'smï¿½2smï¿½3s...), s =1, 2, ...k, and the numbers BSc occurring therein is 2s 3s possess row and column symmetry. By solving k linear equations we similarly express the latter functions as linear functions of the former, and this table will also be symmetrical.
Theorem.—The symmetric function (m ï¿½8 m' 2s m ï¿½3s ...) whose is 2s 3s partition is a specification of a separation of the function symbolized by (li'l2 2 l3 3 ...) is expressible as a linear function of symmetric functions symbolized by separations of (li 1 12 2 13 3 ...) and a symmetrical table may be thus formed." It is now to be remarked that the partition (/,A.1/2)1/42/A38...)can be derived from (m"13mï¿½2sm"`38...) 1 2 3 is 2s 3s by substituting for the numbers mi., m 231 m 331 ... certain partitions of those numbers (vide the definition of the specification of a separation).
Hence the theorem of expressibility enunciated above. A new statement of the law of reciprocity can be arrived at as follows: Since.
P(s) _ /ll8!/12s!/23s!...
t =1 (J1)Jl (J2)?2(J /3... ots(mlllsmtA2smï¿½3s...), j1 !j2 j3... ls 2s 3s where tist =tit8. Theorem of Symmetry. - If we form the separation function (J2) j1!j2!13!...
appertaining to the function (li'l32l3...), each separation having a specification m" ` ' 8 m ï¿½2s m ï¿½38 multiply b P (is 2s 3s .ï¿½ï¿½), P Y by ls! /t2s! / 38 !... and take therein the coefficient of the function (mi t tm7t t m 31 t ...), we obtain the same result as if we formed the separation function in regard to the specification (mï¿½ It t'tm2 32tm"`l3t...), multiplied by Alt!! /let! /1 3 1!... and took ï¿½1a ï¿½ therein the coefficient of the function (mis mï¿½ 2s Es m 3s 3s ...).
Ex.gr., take (li 1 l2 2. ..)=(214); (m ?88m288...) = (321); (m ?i t m2L t...)=(313); we find (21)(12)(1)+(13)(2)(1) =...+13(313)+..., (21) (1)3=...+13(321)+...
The Differential Operators.—Starting with the relation
(1 + a i x) (1 +a 2 x)... (1 +a n x) = 1 +a 1 x+a 2 x 2 +... +aï¿½xn
multiply each side by I +px, thus introducing a new quantity A; we obtain (1 +a1x) (1+a2x)...(1 -Fanx)(1+,ux) = 1+(a1 +1a)x + (a2+1aa1)x2+... so that f (al, a 3, a3,.ï¿½.an) =f, a rational integral function of the elementary functions, is converted into f(a1 +12, a2+ p a1,... a n +I la n -i) = f+/ldlf +?`id2f ` `3 d3f+... ?. 1 1 where laan and di denotes, not s successive operations of d1, but the operator of order s obtained by raising d l to the s th power symbolically as in Taylor's theorem in the Differential Calculus.
Write also s l d1= D, so that
f(a i a2+ p al, ...an+Ilan-1) =f +FLDif +F4 2 D2f + t i 3 D 3 f -}-....
The introduction of the quantity p converts the symmetric function 1 2 3 into (XiX2X3+...) -Hu Al (X 2 A 3 .-) +/l02(X1X3.ï¿½.) +/103(A1X2.ï¿½.) +....
Hence, if f(ai, a 2, ...a n) _ (?i?2%?3ï¿½ï¿½ï¿½), 1 2 3 +,01(X2A3...) +02(X1X3.ï¿½.) +IlA'(XlX2.ï¿½.) +... (1 +/-lD1+Fl2D2+ï¿½3D3+...) (X i X 2 X 3 ...) ï¿½ Comparing coefficients of like powers of A we obtain DX1(X1X2X3...) = (X2X3...), while D 8 (X 1 X 3 X 3 ...) =o unless the partition (X3X3X3...) contains a part s. Further, if DA 1 DA 2 denote successive operations of DA 1 and DA2, DX1DA2(x1X2X2...) (%3...), and the operations are evidently commutative.
Also D n D n 2 D;3 (,,{{,,11*1,/,?*2,/,Tr3) = I, and the law of o eration of the p2 X13 ... ['2 3 ... p operators D upon a monomial symmetric function is clear.
We have obtained the equivalent operations
1 +/lDi+ p2 D2+/ 13D 3 - F ... = expμd_{1}
where exp denotes (by the rule over exp) that the multiplication of operators is symbolic as in Taylor's theorem. di denotes, in fact, an operator of order s, but we may transform the right-hand side so that we are only concerned with the successive performance of linear operations. For this purpose write as = a08+ aiaas+i+a2aas+2+....
It has been shown (vide " Memoir on Symmetric Functions of the Roots of Systems of Equations," Phil. Trans. 1890, p. 490) that exp(mldl +m2d2+m3d3+...) = exp (Midi +M2d2+M3d3+...), where now the multiplications on the dexter denote successive operations, provided that pp t exp(MiE+M2 2+M3E3+...) +mlH+m2V+m3S3+..., being an undetermined algebraic quantity.
Hence we derive the particular cases 1 1 expel ' =exp(d1 -2d2+5d3 - ...); exp/ld 1 = exp(Ad1p2d2 +/13d3 - ...), and we can express D. in terms of dl, d 2, d 3 ,..., products denoting successive operations, by the same law which expresses the ele mentary function a s in terms of the sums of powers s l, s 2, s3,...
Further, we can express d 8 in terms of Dl, D 2, D3, ... by the same law which expresses the power function s, in terms of the elementary functions a 1, a2, a3,...
Operation of 'D.' a Product of Symmetric Functions. - Suppose f to be a product of symmetric functions f i f 2 ...f m . If in the identity f =f l f 2 ...fm we introduce a new root A we change a 8 into a8+μa8_l, and we obtain
(1 +AD1 2 D2+... +AsDs ...) p Di p2 D2+... -} p3D8 ...) fl X (1 +/lDl+ï¿½2D2+...+Asps+...) f2 X.
X (1 +PD1+12D2+...+ï¿½8D8+...) fm, and now expanding and equating coefficients of like powers of μ
D 1 f - Z(Difi)f2f3. ..fm , D2f =I(D2f1)f2f3ï¿½..fm+2(Difi)(D1f2)f3...fm, D 3 f =F(D3f1)f2f3... f m +Z(D2f1) (Dif2)f3...fm+Z(D3f1) f 2 f fm, the summation in a term covering every distribution of the operators of the type presenting itself in the term.
Writing these results
D1f = D(1)f.
D1f = D(1)f.+D
D1f = D(1)f.+D
Writing these results Dif = D(1)f, D = D(2)f+D(l2)f, D3f = D(3)f+ (21)f+ D(13)f, s =1 (J1)11(J3)12(J3)13... j1!j2!j3!... where (J1) 11 (J2) 12 13. .. is a separation of (11 1 12213 3 ...) of specification (mM'8m"`28m"`3s...), placing s under the summation sign to denote the is Zs 3s specification involved, 141t412t!p31!...
1 a a a a d =aal+a laa2 a2aa3+... +an we may write in general D s f = ZD(p l p 2 p 3 ï¿½ï¿½) the summation being for every partition (piP2p3...) of s, and D(p iP2 p 3 ...)f being =2 (Dpifi)(DP2f2) (DL'h3f3)f4...f,n. Ex. gr. To operate with D2 upon (213) (214) (15), we have D (2)f = (13) (214) (15) + (213) (14) (15), D c1 2)f = (122) (213) (15) +(213) (213) (14) + (212) (214) (14), and hence D2f = (214) (15) (13) +(213) (15) (14) +(213) (212) (15) +(213)2(14) +(214) (212) (14).
Application to Symmetric Function Multiplication.-An example will explain this. Suppose we wish to find the coefficient of (52413) in the product (20(2' 4)(0). (15).
Write (213) (214) (15) =... +A(524) (13) +...; then D5D1D1 (213) (214) (15) =A; every other term disappearing by the fundamental property of D8. Since we have: D2D?(1 4)(1 4)(13) =A Dg34 (13)+2(14)(13)(12)} =A D 2 D3 12(1)()+7(13)(1)+2(14)()+6(13)(12)} =A D712(1)3=A.
where ultimately disappearing terms have been struck out. Finally A=6.12=72.
The operator d1= aoaai+aiaa2+a20a3+... which is satisfied by every symmetric fraction whose partition contains no unit (called by Cayley non-unitary symmetric functions), is of particular importance in algebraic theories. This arises from the circumstance that the general operator Ao,a0aa1 + ialaa2 + 2a2 a 3 +...
is transformed into the operator d 1 by the substitution (ac, al, a2, ï¿½ï¿½ï¿½as, ï¿½ï¿½ï¿½) _ (ao, Xoai, X 6 X i a 2, ï¿½ï¿½ï¿½, XcX1..%s_las,ï¿½ï¿½ï¿½), so that the theory of the general operator is coincident with that of the particular operator d1. For example, the theory of invariants may be regarded as depending upon the consideration of the symmetric functions of the differences of the roots of the equation aox n - (i) a i x n - 1 + (z) a 2 x n 2 - ... = 0; and such functions satisfy the differential equation aoaa i +2a0a 2 +3a 2 aa 3 +... +na n _ i aa n = 0. For such functions remain unaltered when each root receives the same infinitesimal increment h; but writing x-h for x causes ao, a1, a 2 a3,... to become respectively ao, ai+hao, a2+2ha1, a 3 +3ha 2, ... and f(ae i a5, a 2, a3,...) becomes f+h(aoaai +2alaa2+3a2aa3+...) f, and hence the functions satisfy the differential equation. The important result is that the theory of invariants is from a certain point of view coincident with the theory of non-unitary symmetric functions. On the one hand we may state that non-unitary sym metric functions of the roots of aox n - a l x n - 1 -{-a 2 x n - 2 - ... =o, are symmetric functions of differences of the roots of aox n - 1!(n)a4xn-1+2!()a2xn-2-... = 0; and on the other hand that symmetric functions of the differences of the roots of aox n (7)alxn-1+ (z)a2xn-2-... =0, are non-unitary symmetric functions of the roots of a xn-a l xn 1 a2 x n-2 -... = 0.
0 1! +2!
An important notion in the theory of linear operators in general is that of MacMahon's multilinear operator (" Theory of a Multilinear partial Differential Operator with Applications to the Theories of Invariants and Reciprocants," Proc. Lond. Math. Soc. t. xviii. (1886), pp. 61-88). It is definied as having four elements, and is written the coefficient of a0 o a1 a2 2 ... being !
mi, ! . The operators ko.ki.k2 aoaai+alaa2+ï¿½ï¿½., a00a i +2a11, 2 +ï¿½ï¿½ï¿½ are seen to be (I, o; 1, I) and (I, I; I, I) respectively. Also the operator of the Theory of Pure Reciprocants (see Sylvester Lectures on the New Theory of Reciprocants, Oxford, 1888) is (4, 1;2,1) =2 4a 0 ea 1 +10acaiaa 2 +6(2aoaz+a 2 1) 0 9a3+... ï¿½ It will be noticed that (ï¿½, v; m, n) =p(1, 0; m, n)+v(0, 1; m, n).
The importance of the operator consists in the fact that taking any two operators of the system
(I l, v; m, n); (Ill, v l : m l, n1),
the operator equivalent to
(I l , v; m, n ) (111, v 1; ml, n1) - (i l l , v1; ml, n1) (/l, v; m, n), known as the " alternant " of the two operators, is also an operator of the same system. We have the theorem (I I, v; m, n) (/l l, v l; ml , n i) - (Il l, P 1; m l, n ') (/l, v; m, n) = (11, vl; ml, ni); where 1 /l1= (ml +m-1) ml (/l +nlv) - u-2 Cu '+nvl) 1 1 m-1 1 m1-1 vl =(n -n)vv-E ml / lY- m /lv, m i =7111+m-I, n1=nl+n,
and we conclude that qua " alternation" the operators of the system form a " group." It is thus possible to study simultaneously all the theories which depend upon operations of the group. Symbolic Representation of Symmetric Functions.-Denote the s 8 s elementar symmetric function a s by al a 2 a3 ...at pleasure; then, Y y si, ,si,... p, taking n equal to 00, we may write 1 +aix +a2x2 +... _ (1 + p ix) (1 + P2x) ... = a l z = e a2z =e.3.=...
where s s a i a 2 a3 = =..
Further, let 1 -1-b i x+ b 2 x 2' +... +bmx m = (1 +Q 1 x) (1 +0 2 x)... (1 +umx); so that 1 +alal+a2a1 +... = (1 +Plat) (1 +P2(71)... = ePlal, 1 + a i Q 2+ a 2 0 2 +... _ (1 +PiQ2) (1 +P2(72)... =e2a2, 1 +aiam.+a2am+... = (1 +Plain) (1 = er,nam; and, by multiplication, II (1 +ala+a2a2+...) = II (1-}-biP+b2P 2 +... +bmP"`), a = e?l a' 1 Â°2 a 2 +.. +om a m .
Denote by brackets () and [ ] symmetric functions of the quantities p and a respectively. Then 1111 + a i[ 1 ]+ a i [ 12 1+a2[ 2 ]+ a 7 [ 13 ] +ala2[ 21 ]+a3[3]+-ï¿½ + a p1 a p2 a P 3 ï¿½ï¿½ .ap rn[Y1 p 2t' 3 ... i'mJ +-ï¿½ . 1 + b l(1) + b (12) + b 2(2) +bi (13) + b 1b2(21) + b 3(3) +... +00 2 0 ..b qm (m qm m -1 qm-1 ...2 Q2 1 s1) -{-... 2 3 m = ealal+Q2a2.. +amam Expanding the right-hand side by the exponential theorem, and then expressing the symmetric functions of al, a2, ...a m, which arise, in terms of b1, b2, ...' b., we obtain by comparison with the middle series the symbolical representation of all symmetric functions in brackets () appertaining to the quantities p i, P2, P3,ï¿½ï¿½ï¿½ To obtain particular theorems the quantities a l, a 2, a 3 , ...a, n are auxiliaries which are at our entire disposal. Thus to obtain Stroh's theory of seminvariants put b1=0-1+a2+ï¿½ï¿½.+0-m [1] =0; we then obtain the expression of non-unitary symmetric functions of the quantities p as functions of differences of the symbols a 2 , a2, a3, ...
Ex. gr. 14(22) with m =2 must be a term in eQial+?2a2= eri (a1-a2>=...-[-a1(a1-a2)4+... and since b2 =at we must have (22) =24(al-a2)4 = 24(a i+ a 2) -6(a? a2+ ala2)+4a2a2 =2a4-2ala3+a2 as is well known.
Again, if a i, a 2, a 3 ...a m , be the t " roots of -1, b 1 = b 2 =... = b n_1 =o
and b.= I, leading to 1 + (m) + (m 2) + (m 3) +... = ea lal+a2az+. .+omam (m8) =ms!(alai+a2a2+... +amaa.)sm, (ll, v; m P -"O a an + (l l + v) (ll +2 v) (m (11 +3v) +...], m - 2 2 !2 a 0 a 1 aan +2 ! 1 ! 1 ! a o -2 a1a2 ! 3 !a7-3ap ? aan +2 (m-1) ! 1 ! a0 a3 + (m-2) (m -m1)! 11 ! ao -ialaan+l m ! m _i m ! -1)!1 ! ao (m -2) m ! m! +(m -3) D 5(213) (214) (15) - (13) (14) (14), as= and and we see further that (alai +a2a2+...+amam) k vanishes identically unless (mod m). If m be infinite and 1 + b i x + b 2 x 2 +... (1 + a i x) (1+ = s i z we have the symbolic identity +02712+0.3x3+... ePl g l + P2P2 + P31 3 3 -f -.. ï¿½, and (alai +(72a2+a3a3+ï¿½ï¿½ ï¿½) P = (Pith +P2t2 +P3f 3 3+ ï¿½ ï¿½ ï¿½) P ï¿½ Instead of the above symbols we may use equivalent differential operators. Thus let =a10a0+2a20al+3a30a2+...
and let a, b, c, ... be equivalent quantities. Any function of differences of S a, S b, S c ,... being formed, the expansion being carried out, an operand ao or bo or co ... being taken and b, c,... being subsequently put equal to a, a non-unitary symmetric function will be produced. Ex. gr. (Sa-3b)2(Sa
Sc) = (Sa-23aSb +3b) (Oa - Se) =Sz - 23QSb+303 b - SQS c +23a3 b 3c - StSc = 6a 3 - 4a2b1 +2a,b 2 - 2a2c1 +2alblci - 2b2c1 =2 (al - 3a1a2+3a3) = 2 (3) .
The whole theory of these forms is consequently contained implicitly in the operation S. Symmetric Functions Several Systems Quantities. - It will suffice to consider two systems of quantities as the corresponding theory for three or more systems is obtainable by an obvious enlargement of the nomenclature and notation.
Taking the systems of quantities to be / al, a2, a3,...
132, 03,ï¿½.ï¿½ we start with the fundamental relation (1+alx+aly)(1+a2x +t2Y) (1+a3x +03y)... = 1 +alox +aoly +a20x 2 +auxy +aG2y2 +... P y q +... As shown by L. Schlafli 1 this equation may be directly formed and exhibited as the resultant of two given equations, and an arbitrary linear non-homogeneous equation in two variables. The right-hand side may be also written /? /? /?
1+Eaix+Esiy+ /al a2x 2 +Malt2xy -Z01023,2+ï¿½ï¿½ï¿½ The most general symmetric function to be considered is E 41 041 8424-3033..ï¿½ .conveniently written in the symbolic form (pigi p2g2 p3go...)ï¿½ Observe that the summation is in regard to the expressions obtained by permuting then suffixes I, 2, 3, ...n. The weight of the function is bipartite and consists of the two numbers Ep and Eq; the symbolic expression of the symmetric function is a partition into biparts (multiparts) of the bipartite (multipartite) number Ep, Eq. Each part of the partition is a bipartite number, and in representing the partition it is convenient to indicate repetitions of parts by power symbols. In this notation the fundamental relation is written (l + a i x +01Y) (I + a 2x+l32Y) (1 + a3x+133y)... = 1 +(l A x +(01) y +(102) x2 +(1001)xy+(512)3,2 +(103)x 3 +(10201)x i y+(10 O12)xy2+ (013)y3+... where in general a pg = (10 P 010).
All symmetric functions are expressible in terms of the quantities ap g in a rational integral form; from this property they are termed elementary functions; further they are said to be single-unitary since each part of the partition denoting ap q involves but a single unit.
The number of partitions of a biweight pq into exactly i biparts is given (after Euler) by the coefficient of a ,z xPy Q in the expansion of the generating function 1 - ax. 1 - ay. 1 - axe. 1 - 1aye. 1ax3.1ax2y. 1 - axy2.1 - ay3...
The partitions with one bipart correspond to the sums of powers in the single system or unipartite theory; they are readily expressed in terms of the elementary functions. For write (pq) =sï¿½ and take logarithms of both sides of the fundamental relation; we obtain slox +soot' = + (3ly) S20x 2 +2siixy+s02y 2 = E(aix+(3 ly) 2, &C., and siox+SOly - (S 20 x2 + 2s ii x y+ s ooy 2) +... log (1 +aiox +aol)/+...+apgxPyq+.... From this formula we obtain by elementary algebra 1) ! p, g 5
?
7r corresponding to Thomas Waring's formula for the single system. The analogous formula appertaining to n systems of quantities which Vienna Transactions, t. iv. 1852.
expresses s pg ,... in terms of elementary functions can be at once written down.
We can verify the relations s 30 -a310 -3a 20 a 10 + a30, S 21 - 02100 01 -a 2C a 01 -0 11 0 10 021 The formula actually gives the expression of q) by means of separations of (10P01'), which is one of the partitions of (pq). This is the true standpoint from which the theorem should be regarded. It is but a particular case of a general theory of expressibility.
To invert the formula we may write 1 +aiox+aoly+... +apgxPyq+... = exp {(siox+Solt') - s20 x 2+ 2siixy+S02y2)+ï¿½ï¿½ï¿½}, and thence derive the formula ? /,) (-)P+4-laP4 (p i+ g l - 1) ! C '" 1 S (p2 +q21)t ? 7r 2 (-)?,rl ,rl 7r2pl lql ')C)C p2 !g2 ! ï¿½ ï¿½ï¿½ 7r1! 72 !...s7114h sP242...
which expresses the elementary functions in terms of the single bipart functions. The similar theorem for n systems of quantities can be at once written down.
It will be ï¿½ shown later that every rational integral symmetric function is similarly expressible.
The Function hpg. - As the definition of h pg we take 1 + nlox+naly+... +n,gxPyq+...
1 -(1aix - Rly) (1-a2x-R2y)...' and now expanding the (P1 right-hand side _ I ql)(P 2 +1721..Q1 /2172ï¿½..), h pg - pi p2 / J L' the summation being for all partitions of the biweight. Further writing 1 +hlox+holy+...+ hpgx P y {-...
1a i ox +... + (-) P+q a pg x P y +..., we find that the effect of changing the signs of both x and y is merely to interchange the symbols a and h; hence in any relation connecting the quantities pg with the quantities a pg we are at liberty to interchange the symbols a and h. By the exponential and multinomial theorems we obtain the results) 1,r -1 (E7r) ! Aal Ar2 7R1! 7 R 2L.ï¿½ï¿½ P141 P242..ï¿½ And In This A And H Are Interchangeable.
(pi+qi - 1 )! ('1 (p2+172-1)! 1,r 2 Sï¿½2 pi! qi! S ] l p2! q2!.ï¿½ï¿½ S ...7f1! 7r2!...SPIQYP242..ï¿½ Dif f erential Operations. - If, in the identity 1 (1 +anx = 1+aiox+aoly+a20x 2 +allxy+a02y 2 +..., we multiply each side by (I -ï¿½-P.x+vy), the right-hand side becomes 1 +(aio+1.1 ') x +(a ol+ v) y +...+(a p4+/ 1a P-1,4+ va Pr4-1) xPyq - - ...; hence any rational integral function of the coefficients an, say f (al Â° , aol, ...) =f exp(ï¿½dlo+vdol)f d a P-1,4, dot = dapg
The rule over exp will serve to denote that i udio+ vdo h is to be raised to the various powers symbolically as in Taylor's theorem.
Writing
D = gi d od p! 1 exp(Adlo + vdol) = (1+/oD10+ v Doi +..ï¿½+ VQ +.ï¿½.)f;
now, since the introduction of the new quantities 1.1., v results in the addition to the function (plglp2g2p3g3...) of the new terms
A PI Pg1 (p 2q2 p 3g3ï¿½ï¿½ï¿½) +/ AP2Pg2 (p 1 g 1P343 ...)+/ Z3vg3 (p l g i p 2 g 2 ...)+ ï¿½,
we find
DP141(plqip2q2p3q3ï¿½ï¿½ï¿½) = (p 2 q 2 p 3 q 3ï¿½ï¿½ï¿½),
and thence
D P141 D P242 D P343 ï¿½ï¿½. (p g p ,g p ,g3 ï¿½ï¿½ï¿½) = I;
while D rs f =o unless the part rs is involved in f. We may then state that D pg is an operation which obliterates one part pq when such part is present, but in the contrary case causes the function to) 171-1-(E7r-1)!7r1 a?2 an! 7r 2 ! ... P141 P242 ï¿½ï¿½ï¿½ -1 hp, - hpg = is converted into where dlo = d a P,q-1 - dapg vanish. From the above D p4 is an operator of order pq, but it is convenient for some purposes to obtain its expression in the form of a number of terms, each of which denotes pq successive linear operations: to accomplish this write d ars and note the general result exp (mlodlo+moldol +... +mp4dp4 +...) =exp Mp g dp 4+ï¿½ï¿½ .
where the multiplications on the leftand right-hand sides of the equation are symbolic and unsymbolic respectively, provided that m P4, M P4 are quantities which satisfy the relation exp (M14+Moir+...+Mp4EpnP+...) =1+mic -Fmoif+...+mp,eng+...; where E, n are undetermined algebraic quantities. In the present particular case putting m 10 = 1 2, mot= v and m P4 =o otherwise M10t+M01n+...+Mpot P n 4 +... =log (1 +ï¿½t+vn) M P4 = (_)p+4 -1(p+g 1)!ï¿½p p 4; p!g! and the result is thus exp(Mdlo+vdol) = {ï¿½die+vdol- 2 (ï¿½ 2 d2 +2ï¿½vd11+ v2d02)+...{ =1 +,D10+vD01+... +1.0v4Dp4+...; and thence p010+ v d01 - (ï¿½ 2d 20+ 2 ï¿½ vd 11 +v 2 d02) +ï¿½ï¿½ï¿½ = log (1+IuD10+PDc.1+...+ï¿½pv4Dp4+...).
(-) Dp4= P+4-1 ,w (p11 -1)! " 1 ? (p2 +g2- 1) ! +1,q p l !gll p2!g2! )?n -1 d" ï¿½" lrl!7r2!... d d
the last written relation having, in regard to each term on th right-hand side, to do with 17r successive linear operations. Recalling the formulae above which connect s P4 and a m , we see that dP4 and Dp q are in co-relation with these quantities respectively, and may be said to be operations which correspond to the partitions (pq), (10 P 01 4) respectively. We might conjecture from this observation that every partition is in correspondence with some operation; this is found to be the case, and it has been shown (loc. cit. p. 493) that the operation 1 1 d P? 41 d p1 42 ... (multiplication symbolic) ?r1! ?2,ï¿½..
corresponds to the partition
(p1g1' rl p2g2 n2 ...).
The partitions being taken as denoting symmetric functions we have complete correspondence between the algebras of quantity and operation, and from any algebraic formula we can at once write down an operation formula. This fact is of extreme importance in the theory of algebraic forms, and is easily representable whatever be the number of the systems of quantities.
We may remark the particular result (-) p + p q! d p4sp4 +Dp4(pg)+1; d P4 causes every other signle part function to vanish, and must cause any monomial function to vanish which does not comprise ,one of the partitions of the biweight pq amongst its parts.
Since dp4+(-)P+T1(p +q qi 1)!dd4, the solutions of the partial differential equation d P4 =o are the single bipart forms, omitting s P4 , and we have seen that the solutions of p4 = o are those monomial functions in which the part pq is absent.
One more relation is easily obtained, viz.
=d P 4 lodp+1,4 -holdp,4+1+...+(-)r+shrsdp+r,4+s+.. daP4 References For Symmetric Functions.-Albert Girard, In- -vention nouvelle en l'algebre (Amsterdam, 1629); Thomas Waring, Meditationes Algebraicae (London, 1782); Lagrange, de l'acad. de Berlin (1768); Meyer-Hirsch, Sammlung von Aufgaben aus der Theorie der algebraischen Gleichungen (Berlin, 1809); Serret, Cours d'algebre superieure, t. iii. (Paris, 1885); Unferdinger, Sitzungsber. d. Acad. d. Wissensch. i. Wien, Bd. lx. (Vienna, 1869); L. Schlafli, " Ueber die Resultante eines Systemes mehrerer algebraischen ï¿½leichungen," Vienna Transactions, t. iv. 1852; MacMahon, " Memoirs on a New Theory of Symmetric Functions," American 1 Phil. Trans., 1890, p. 490.
Journal of Mathematics, Baltimore, Md. 1888-1890; " Memoir on Symmetric Functions of Roots of Systems of Equations," Phil. Trans. 1890.
III. THE Theory Of Binary Forms A binary form of order n is a homogeneous polynomial of the nth degree in two variables. It may be written in the form n n-1 2 ax 1 +bx1 x2 +cx 1 x 2 + ...; or in the form n n n=1 n n-2 2 +(1)bx x2+ ?
1112 which Cayley denotes by (a, b, c, ...)(xi, x2)n (i),(2)ï¿½ï¿½ï¿½ being a notation for the successive binomial coefficients n, 2n (n-I),.... Other forms are n-1 n-2 2 ax +nbx x +n(n-i)cx x +..., 1121 2 the binomial coefficients C) being replaced by s!(e), and n 1, n-1 1 n-2 2 ax 1 +l i ox l 'x 2 + L ?cx 1 'x2+..., the special convenience of which will appear later. For present purposes the form will be written a0x 1 +(7)a1x1=1 x2+ C 2)o'2x12 x 2 +...+anx2, the notation adopted by German writers; the literal coefficients have a rule placed over them to distinguish them from umbral coefficients which are introduced almost at once. The coefficients a 01 a1, a2,..ï¿½an, n+I in number are arbitrary. If the form, sometimes termed a quantic, be equated to zero the n+I coefficients are equivalent to but n, since one can be made unity by division and the equation is to be regarded as one for the determination of the ratio of the variables.
If the variables of the quantic f(x i , x 2) be subjected to the linear transformation x1 = a12Et2, x2 = a21E1+a2252, E1, being new variables replacing x1, x 2 and the coefficients an, all, a 21, a22, termed the coefficients of substitution (or of transformation), being constants, we arrive at a transformed quantic f% 1tn n n-1 n-2 52) = a S +(1)a11 E 2 + (2)a2E1 E 2 +ï¿½ï¿½ï¿½ in the new variables which is of the same order as the original quantic; the new coefficients a, a, a'...a are linear functions 0 1 2 n of the original coefficients, and also linear functions of products, of the coefficients of substitution, of the nth degree.
By solving the equations of transformation we obtain rE1 = a22x1 - a12x1, r = - a21x1 + allx2, aua12 where r = I = anon-anon; a21 a22 r is termed the determinant of substitution or modulus of transformation; we assure x 1 , x 2 to be independents, so that r must differ from zero.
In the theory of forms we seek functions of the coefficients and variables of the original quantic which, save as to a power of the modulus of transformation, are equal to the like functions of the coefficients and variables of the transformed quantic. We may have such a function which does not involve the variables, viz.
F(a ' a ' a ,...a) =r A F(ao, a1, a2,ï¿½ï¿½ï¿½an), 0 1 2 n the function F(ao, al, a2,...an) is then said to be an invariant of the quantic gud linear transformation. If, however, F involve as well the variables, viz.
F(-1-1 -1 t a a 0, a l, a 2 ,... ;51, 2) = r F(ao, al, a2,...; xi, x2),
the function F(a 01 a 1, a 2 ,... x i, x 2) is said to be a covariant of the quantic. The expression "invariantive forms " includes both invariants and covariants, and frequently also other analogous forms which will be met with. Occasionally the word " invariants " includes covariants; when this is so it will be implied by the text. Invariantive forms will be found to be homogeneous functions alike of the coefficients and of the variables. Instead of a single quantic we may have several f(ao, a1, a2...; x1, x2), 4 (b o, b1, b2,...; x1, x2), ... which have different coefficients, the same variables, and are of the same or different degrees in the variables; we may transform them all by the same substitution, so that they become
_, _, _, _, _, _, f(a Â°, a, a 2 ,...; (b 0, b, b 2 ,...; 1, S2),....
If then we find
F ( a, a 1, a 2,...b 0, b, b 2 ,...,. .. ï¿½; S = r A F(a 0, 711, a2,ï¿½ï¿½ï¿½bo, b l, b 2,ï¿½ï¿½ï¿½9ï¿½ï¿½ï¿½; xl, x2), viz.
)- ( p+4-1 (p - - q -1)!dpq+ ?l -) 1)!D'1 DT2 p!g! .... From these formulae we derive two important relations, dp4 = or the function F, on the right which multiplies r, is said to be a simultaneous invariant or covariant of the system of quantics. This notion is fundamental in the present theory because we will find that one of the most valuable artifices for finding invariants of a single quantic is first to find simultaneous invariants of several different quantics, and subsequently to make all the quantics identical. Moreover, instead of having one pair of variables x i, x2 we may have several pairs yl, y2; z i, z2;... in addition, and transform each pair to a new pair by substitutions, having the same coefficients a ll, a12, a 21, a 22 and arrive at functions of the original coefficients and variables (of one or more quantics) which possess the above definied invariant property. A particular quantic of the system may be of the same or different degrees in the pairs of variables which it involves, and these degrees may vary from quantic to quantic of the system. Such quantics have been termed by Cayley multipartite.
Symbolic Form.-Restricting consideration, for the present, to binary forms in a single pair of variables, we must introduce the symbolic form of Aronhold, Clebsch and Gordan; they write the form
Iln n n-1 n-1 n n n aixi+a2x2) = 44+(1) a l a 2 x 1 x2+...+a2.x2=az
wherein al, a2 are umbrae, such that
n-1 n-1 n a 1, a 1 a 2 ,...a 1 a 2 , a2
are symbolical respreentations of the real coefficients ï¿½o, ai,... an_1 i a n, and in general a n-k a 2 is the symbol for Q k. If we restrict ourselves to this set of symbols we can uniquely pass from a product of real coefficients to the symbolic representations of such product, but we cannot, uniquely, from the symbols recover the real form, This is clear because we can write n-1 n-2 2 2n-3 3 a1a2 =a l a 2, a 1 a 2 = a 1 a2 while the same product of umbrae arises from n n-3 3 2n-3 3 aoa 3 = a l .a a 2 = a a 2 .
Hence it becomes necessary to have more than one set of umbrae, so that we may have more than one symbolical representation of the same real coefficients. We consider the quantic to have any n number of equivalent representations a- b n -c n So that a 1 -k a 2 = b l -k b 2 - c 1 -k c 2 = ... = a k; and if we wish to denote, by umbrae, a product of coefficients of degree s we employ s sets of umbrae.
n-1 2 Ex. gr. We write;L 22 = a 1 a 2 .b 1 n-2 b2s 3 n - 3 3 n-3 3 n-3 3 a 3 = a 1 a 2 .b 1 b 2 .c 1 c2, and so on whenever we require to represent a product of real coefficients symbolically; we then have a one-to-one correspondence between the products of real coefficients and their symbolic forms. If we have a function of degree s in the coefficients, we may select any s sets of umbrae for use, and having made a selection we may when only one quantic is under consideration at any time permute the sets of umbrae in any manner without altering the real significance of the symbolism.
Ex. gr. To express the function aoa2 - _ which is the discriminant of the binary quadratic aoxi -+-2a1x2x2-+a2x2 = ai =1, 1, in a symbolic form we have 2(aoa 2 -ai) =aoa2 +aGa2 -2 a1 ï¿½ al = a;b4 -}-alb? -2ala2blb2 = (aib2-a2b1)2.
Such an expression as
a l b 2 -a 2 b i, which is aa 2 ab 2 aa x 2 2 ax1'
is usually written (ab) for brevity; in the same notation the determinant, whose rows are a l, a 2, a3; b2, b 2, b 3; c 1, c 2, c 3 respectively, is written (abc) and so on. It should be noticed that the real function denoted by (ab) 2 is not the square of a real function denoted by (ab). For a single quantic of the first order (ab) is the symbol of a function of the coefficients which vanishes identically; thus
(ab) =a1b2-a2bl= aw l -a1ao=0
and, indeed, from a remark made above we see that (ab) remains unchanged by interchange of a and b; but (ab), = -(ba), and these two facts necessitate (ab) = o.
To find the effect of linear transformation on the symbolic form of quantic we will disuse the coefficients a 111 a 12, a21, a22, and employ A1, Iï¿½1, A2, ï¿½2. For the substitution
rr xl =A 11 +1 2 12, 52=A21+ï¿½2E2, of modulus A1 ï¿½i = (Alï¿½.2-A2ï¿½1) = (AM), A 2 ï¿½2
the quadratic form a k xi -1-2a 1 x i x 2 +a 2 4 = x =f (x), becomes
A41 +2A1E16 =At = OW, where Ao = aoA i +2a1AiA2 +a2Az, _ _ A 1 = ao A lï¿½l +ai(A1/.22+A2ï¿½1) +7,2X2/22, A2 = aoï¿½l +2a1ï¿½1/ï¿½2 +a 2ï¿½2 ï¿½
We pass to the symbolic forms
a:= (aixi+a2x2) 2, A 2 = (A 151+ A 26) 2/
by writing for ao, al, a2 the symbols ai, a 1 a 2, a? A 1, A2 ï¿½ Ai, A 1 A 2, A2 and then Ao = al Ai+2a1a2AIA2+a2 A2 - (a1A1+a2A2) 2 = a?, A l = (a 1 A 1 +a2A2) (alï¿½l +a2ï¿½2) = aAaï¿½, A 2 = (alï¿½l +a2/-12) 2 = aM; so that A = aa l +2a A a u 152+aM5 2 = (aA6+a,e2)2; whence A1, A 2 become a A, a m, respectively and ?(S) = (a21+a,E2) 2;
The practical result of the transformation is to change the umbrae a l , a 2 into the umbrae a s = a1A1 +a2A2, a ï¿½ = a1/ï¿½1 + a21=2 respectively.
By similarly transforming the binary n ic form ay we find
Ao = (aI A 1 +a2 A2) n = aAn A l = (alAi - I -a 2 A 2) n1 (a1ï¿½1 +a2m2) = aa a ï¿½ - A i n-1 A2, n-k k n-k k n-k k A = (al l+a2A2) (alï¿½1+a2ï¿½2) = a A ï¿½ =A 1 A2, so that the umbrae A1, A 2 are a A, a ï¿½ respectively.
Theorem.-When the binary form
a y = (alxl +a2x2)n is transformed to A;. = (A11+A22)n
by the substitutions 51 = A l, E1+ï¿½1 2, 52 = A2E1+ï¿½2E2, the umbrae Al, A2 are expressed in terms of the umbrae al, a 2 by the formulae A l = Alai +A2a2, A2 = ï¿½la1 +ï¿½2a2ï¿½
We gather that A1, A2 are transformed to a l, a 2 in such wise that the determinant of transformation reads by rows as the original determinant reads by columns, and that the modulus of the transformation is, as before, (A / .c). For this reason the umbrae A1, A 2 are said to be contragredient to xi, x 2. If we solve the equations connecting the original and transformed unbrae we find
(A ï¿½) (- a 2) =A i( - A 2) + ï¿½'1A1,
(A ï¿½) a1 = A2(- A2)+ï¿½2A1,
and we find that, except for the factor (A /), -a 2 and +ai are trans formed to -A 2 and +A i by the same substitutions as x i and x 2 are transformed to i and E2. For this reason the umbrae -a 2, a l are said to be cogredient to 5 1 and x 2. We frequently meet with cogredient and contragedient quantities, and we have in general the following definitions:-(i) " If two equally numerous sets of quantities x, y, z,... x', y', z',... are such that whenever one set x, y, z,... is expressed in terms of new quantities X, Y, Z, ... the second set x', y', z', ... is expressed in terms of other new quantities X', Y', Z', .... by the same scheme of linear substitution the two sets are said to be cogredient quantities." (2) " Two sets of quantities x, y, z, ...; E, n, i, ... are said to be contragredient when the linear substitutions for the first set are
x =A1X+u1Y-}-v1Z-?--...,
y = A2X+,u2Y +v2Z ï¿½...,
Z = A 3 X +ï¿½3Y -1v 3 Z - -...,
and these are associated with the following formulae appertaining to the second set,
X = A1?+A277+A3? +...,
H =/.G1rr+ï¿½27]+ï¿½3? + ï¿½ï¿½ï¿½,
Z = v16+v2%/+v3" +ï¿½ï¿½ï¿½,
wherein it should be noticed that new quantities are expressed in terms of the old, as regards the latter set, and not vice versa."
Ex. gr. The symbols - dy, d z, ... are contragredient with the d- variables x, y, z, ... for when
( x , z, ï¿½ï¿½ï¿½) = (A l, ï¿½i, VI I ï¿½ï¿½ï¿½)
(X, Y, Z, ï¿½ï¿½ï¿½), I A 2, / 2 2, Y2, ... I I A S, 1 2 3, Y 3, .... 1
(Tr (T d d d d d d ,.. rd Y' ' ...) = 01, A2, A 3, ...)
(d ' ' z / 2 1, /22, / 1 3, ... Pl, P2, P3, ... Observe the notation, which is that introduced by Cayley into the theory of matrices which he himself created.
Just as cogrediency leads to a theory of covariants, so contragrediency leads to a theory of contravariants. If u, a quantic in x, y, z, ..., be expressed in terms of new variables X, Y, Z ...; and if, n,, ..., be quantities contragredient to x, y, z, ...; there are found to exist functions of, n, ?, ..., and of the coefficients in u, which need, at most, be multiplied by powers of the modulus to be made equal to the same functions of E, H, Z, ... of the transformed coefficients of u; such functions are called contravariants of u. There also exist functions, which involve both sets of variables as well as the coefficients of u, possessing a like property; such have been termed mixed concomitants, and they, like contravariants, may appertain as well to a system of forms as to a single form.
As between the original and transformed quantic we have the umbral relations
A1 = A1a1 d-A2a2, A2 = /21a1+/22a2,
and for a second form
B1 =A 1 b 1+ A 2 b 2, B 2 =/21bl +ï¿½2b2ï¿½
The original forms are ax, bi, and we may regard them either as different forms or as equivalent representations of the same form. In other words, B, b may be regarded as different or alternative symbols to A, a. In either case
(AB) =A 1 B 2 -A 2 B 1 = (A/2)(ab);
and, from the definition, (ab) possesses the invariant property. We cannot, however, say that it is an invariant unless it is expressible in terms of the real coefficients. Since (ab) = a l b 2 -a 2 b l, that this may be the case each form must be linear; and if the forms be different (ab) is an invariant (simultaneous) of the two forms, its real expression being aob l -a l b 0. This will be recognized as the resultant of the two linear forms. If the two linear forms be identical, the umbral sets a l, a2; b l, b 2 are alternative, are ultimately put equal to one another and (ab) vanishes. A single linear form has, in fact, no invariant. When either of the forms is of an order higher than the first (ab), as not being expressible in terms of the actual coefficients of the forms, is not an invariant and has no significance. Introducing now other sets of symbols C, D, ...; c, d, ... we may write
(AB)i(AC)j(BC)k... _ (AIt)i+j+k+... (ab)i(ac)j(bc)k..., that the symbolic product (ab)i(ac)j(bc)k..., possesses the invariant property. If the forms be all linear and different, the function is an invariant, viz. the i t " power of that appertaining to a x and b x multiplied by the j t " power of that appertaining to a x and c x multiplied by &c. If any two of the linear forms, say p x, qx, be supposed identical, any symbolic expression involving the factor (pq) is zero. Notice, therefore, that the symbolic product (ab)i(ac)j(bc)k... may be always viewed as a simultaneous invariant of a number of different linear forms a x, x, c x, .... In order that (ab)i(ac)j(bc)k... may be a simultaneous invariant of a number of different forms az', bx 2, cx 3 ,..., where n1, n 2 , n3, ... may be the same or different, it is necessary that every product of umbrae which arises in the expansion of the symbolic product be of degree n, in a l, a 2; in the case of b,, b 2 of degree n 2; in the case of c 1, c 2 of degree n3; and so on. For these only will the symbolic product be replaceable by a linear function of products of real coefficients. Hence the condition is i+k+... =n2, j+k+... =n3, 'If' the forms a:, b:, cy 7 ...be identical the symbols are alternative, and provided that the form does not vanish it denotes an invariant of the single form ay. There may be a number of forms ay,bi,ci,... and we may suppose such identities between the symbols that on the whole only two, three, or more of the sets of umbrae are not equivalent; we will then obtain invariants of two, three, or more sets of binary forms. The symbolic expression of a covariant is equally simple, because we see at once that since AE, B, Ce,... are equal to a x, x, c x, ... respectively, the linear forms a x, b., cg, ... possess the invariant property, and we may write
(AB) i (AC)'(BC) k ...A P E B C...
= t) 1 v ...axbxcx...,
and assert that the symbolic product
(ab)i(ac)'(bc)k...aibxc2...
possesses the invariant property. It is always an invariant or covariant appertaining to a number of different linear forms, and as before it may vanish if two such linear forms be identical. In general it will be simultaneous covariant of the different forms n 1 rz 2 n3 a, b x, ? if
i+j ?- ... +P=n1, j+k+...+T =n3,
It will also be a covariant if the symbolic product be factorizable into portions each of which satisfies these conditions. If the forms be identical the sets of symbols are ultimately equated, and the form, provided it does not vanish, is a covariant of the form ate. The expression (ab) 4 properly appertains to a quartic; for a quadratic it may also be written (ab) 2 (cd) 2 , and would denote the square of the discriminant to a factor pres. For the quartic (ab) 4 = (aib2-a2b,) alb2 -4a7a2blb2+64a2 bib2 - 4a 1 a 2 b 7 b 2 + a a b i = a,a 4 - 4ca,a 3 +6a2 - 4a3a3+ aoa4 = 2(a 0 a 4 - 4a1a3 +e3a2),
one of the well-known invariants of the quartic.
For the cubic (ab) 2 axbx is a covariant because each symbol a, b occurs three times; we can first of all find its real expression as a simultaneous covariant of two cubics, and then, by supposing the two cubics to merge into identity, find the expression of the quadratic covariant, of the single cubic, commonly known as the Hessian. By simple multiplication (al b l b2 -24a2bib2+ala2b;)xi +(aibz -ala214b2-aia2blb2+a2b2)xlx2 + (aia 2 b2 - 2a l a2b l b2 +a2/4b 2)x2; and transforming to the real form, (aob 2 - 2a1b,+a2bo)xi (aob 3 -a l b 2 - alb,+a3bo)xlx2 + (aib3 - 2a2b2+a3b1)x2, the simultaneous covariant; and now, putting b = a, we obtain twice. the Hessian ( 0 a 2 -al)xi + (a 0 a 3 -ala2)xlx2+ (a l a i - a2)x2. It will be shown later that all invariants, single or simultaneous, are expressible in terms of symbolic products. The degree of the covariant in the coefficients is equal to the number of different symbols a, b, c, ... that occur in the symbolic expression; the degree in the variables (i.e. the order of the covariant) is P+P+T+... and the weight' of the coefficient of the leading term xi +Q+T+.ï¿½ï¿½ is equal to i+j+k+.... It will be apparent that there are four numbers associated with a covariant, viz. the orders of the quantic and covariant, and the degree and weight of the leading coefficient; calling these 'n, e,' 0, w respectively we can see that they are not independent integers, but that they are invariably connected by a certain relation n9 -2w = e. For, if c(ao i ...x l, x 2) be a covariant of order e appertaining to a quantic of order n, t (T. 0, ï¿½ï¿½. 1 2) = (A /-?) ' (ao,... A 1 1+/ 2 12, A 2E1 +/ 2 2 2) we find that the leftand right-hand sides are of degrees nO and 2w+e respectively in A,, ï¿½l, A 2, /22, and thence nO = 2w ï¿½E.
Symbolic Identities.- For the purpose of manipulating symbolic expressions it is necessary to be in possession of certain simple identities which connect certain symbolic products. From the three equations ax = alxl+ a2x2, b.= blxl+b2x2, cx = clxi+c2x2, we find by eliminating x, and x 2 the relation a x (bc)+b x (ca) +c x (ab) =0. .. (I.) Introduce now new umbrae dl, d 2 and recall that +d 2 -d 1 are cogredient with x, and x 2. We may in any relation substitute for any pair of quantities any other cogredient pair so that writing -}-d 2, -d l for x 1 and x 2, and noting that gx then becomes (gd), the above-written identity bceomes (ad)(bc)+(bd)(ca)+(cd)(ab) = 0. .. (II.) Similarly in (I.), writing for c l, c 2 the cogredicnt pair -y2, +y1, we obtain axb5-a5bx=(ab)(xy).. -.. . (III.) Again in (I.) transposing a x (bc) to the other side and squaring, we obtain 2(ac) (bc)axbx = (bc) 2 a'+(ac) 2 bx- (ab) 2 c1. (IV.) and herein writing d 2, -d 1 for x l, x2, 2 (ac) (bc) (ad) (bd) = (bc) 2 (ad) 2 +(ac) 2 (bd) 2 - (ab) 2 (cd) 2. (V.) As an illustration multiply (IV.) throughout by az 2b x 2cz 2 so that each term may denote a covariant of an niÂ°. 2 (ac)(bc)anx xibn-i -1 x = (bc)2anbn-2Cn-2 + (ac)2an x x x The weight of a term aoÂ°a l l ...an n is defined as being k,+2k2+...
+nkn. -2 _ ab 2an-2bn-2Crz z x () x x x, Each term on the right-hand side may be shown by permutation of a, b, c to be the symbolical representation of the same covariant; they are equivalent symbolic products, and we may accordingly write 2(ac) (bc)ai -1 bi -1 cx 2 =(ab)2a:-2b:-2c:, a relation which shows that the form on the left is the product of the two covariants n (ab) ay 2 by 2 and cZ. The identities are, in particular, of service in reducing symbolic products to standard forms. A symbolical expression may be always so transformed that the power of any determinant factor (ab) is even. For we may in any product interchange a and b without altering its signification; therefore (ab) 2m+1 4) 1 = - (ab) 2 " 4)2, where 4,1 becomes by the interchange, and hence (ab)2m+14)1= Z (ab) 2m+1 (4) 1 - 02); and identity (I.) will always result in transforming 01-02 so as to make it divisible by (ab).
Ex. gr. (ab)(ac)bxcx = - (ab)(bc)axcx = 2(ab)c x {(ac)bx-(bc)axi = 1(ab)2ci; so that the covariant of the quadratic on the left is half the product of the quadratic itself and its only invariant. To obtain the corresponding theorem concerning the general form of even order we multiply throughout by (ab)2' 2c272 and obtain (ab)2m-1(ac)bxc2:^1=(ab)2mc2 Paying attention merely to the determinant factors there is no form with one factor since (ab) vanishes identically. For two factors the standard form is (ab) 2; for three factors (ab) 2 (ac); for four factors (ab) 4 and (ab) 2 (cd) 2; for five factors (ab) 4 (ac) and (ab) 2 (ac)(de) 2; for six factors (ab) 6, (ab) 2 (bc) 2 (ca) 2 , and (ab) 2 (cd) 2 (ef) 2 . It will be a useful exercise for the reader to interpret the corresponding covariants of the general quantic, to show that some of them are simple powers or products of other covariants of lower degrees and order. The Polar Process
The ï¿½th polar of ax with regard to y is n-ï¿½ a aye i.e. of the symbolic factors of the form are replaced by IA others in which new variables y1, y2 replace the old variables x1, x 2 . The operation of taking the polar results in a symbolic product, and the repetition of the process in regard to new cogredient sets of variables results in symbolic forms. It is therefore an invariant process. All the forms obtained are invariants in regard to linear transformations, in accordance with the same scheme of substitutions, of the several sets of variables.
An important associated operation is a ? 32 ax l ay 2 ax2ay1' which, operating upon any polar, causes it to vanish. Moreover, its operation upon any invariant form produces an invariant form. Every symbolic product, involving several sets of cogredient variables, can be exhibited as a sum of terms, each of which is a polar multiplied by a product of powers of the determinant factors ( xy), (xz), (yz),... Transvection. - We have seen that (ab) is a simultaneous invariant of the two different linear forms a x, bx, and we observe that (ab) is equivalent to where f =a x, 4)=b. If f =ay, 4 = b' be any two binary forms, we generalize by forming the function (m-k)! (n-k)! of a4) of a 4) k m! l ax 2 2 ax i l This is called the kth transvectant of f over 4); it may be conveniently denoted by (f, (15)k. (a m b n) k (ab) kamkbn-k x, x - x it is clear that the k th transvectant is a simultaneous covariant of the two forms.
It has been shown by Gordan that every symbolic product is expressible as a sum of transvectants.
If m > n there are n +1 transvectants corresponding to the values o, t, 2,... n of k; if k = o we have the product of the two forms, and for all values of k>n the transvectants vanish. In general we may have any two forms 01/1X1+ 'II ï¿½ Yy + 02x2) p Y'x =, / / being the umbrae, as usual, and for the kth transvectant we have (4)1,,, 4)Q) k = (4)) k 4)2 -krk, a simultaneous covariant of the two forms. We may suppose of, 4 ,2 to be any two covariants appertaining to a system, and the process of transvection supplies a means of proceeding from them to other covariants.
The two forms ax, bx, or of, 0, may be identical; we then have the kth transvectant of a form over itself which may, or may not, vanish identically; and, in the latter case, is a covariant of the single form. It is obvious that, when k is uneven, the kth transvectant of a form over itself does vanish. We have seen that transvection is equivalent to the performance of partial differential operations upon the two forms, but, practically, we may regard the process as merely substituting (ab) k, (OW for azbx, 4x t ' respectively in the symbolic product subjected to transvection. It is essentially an operation performed upon the product of ï¿½two forms. If, then, we require the transvectants of the two forms f+Xf', 0+14', we take their product fc5+xf'95+,-ifct'+atif'cb', and the kth transvectant is simply obtained by operating upon each term separately, viz.
(f, 4)) k +(f, 4)) k +ï¿½(f, 4/) k +aï¿½(1, 4)')k; and, moreover, if we require to find the kth transvectant of one linear system of forms over another we have merely to multiply the two systems, and take the k th transvectant of the separate products.
The process of transvection is connected with the operations 12; for ?k (a m b n) = (ab)kam-kbn-k, (x y x y or S 2 k (a x by) x = 4))k; so also is the polar process, for since f k m-k k k n - k k y = a x by, 4)y = bx by, if we take the k th transvectant of f i x; over 4 k, regarding y,, y 2 as the variables, (f k, 4)y) k (ab) ka x -kb k (f, 15)k; or the k th transvectant of the k th polars, in regard to y, is equal to the kth transvectant of the forms. Moreover, the kth transvectant (ab) k a m-k b: -k is derivable from the kth polar of ax, viz. ai by substituting for y 1, y 2 the cogredient quantities b2,-b1, and multiplying by by-k.
First and Second Transvectants.- A few words must be said about the first two transvectants as they are of exceptional Interest. Since, If F = An, 4) = By, 1 = I
(Df A4) Of A?) Ab A"'^1Bz 1=, (F, Mn Ax I Ax 2 Axe Ax1) J
The First Transvectant Differs But By A Numerical Factor From The Jacobian Or Functional Determinant, Of The Two Forms. We Can Find An Expression For The First Transvectant Of (F, ï¿½) 1 Over Another Form Cp. For (M N)(F,4)), =Nf.4Y Mfy.4), And F,4, F 5.4)= (Axby A Y B X) A X B X 1= (Xy)(F,4))1; (F,Ct)1=F5.D' 7,(Xy)(F4)1. Put M 1 For M, N I For N, And Multiply Through By (Ab); Then { (F ,C6) } = (Ab) A X 2A Y B X 1 M N I 2 (Xy) ,?) 2, = (A B)Ax 1B X 2B Y L I Multiply By Cp 1 And For Y L, Y2 Write C 2, C1;
Then The Right Hand Side Becomes
(Ab)(Bc)Am Lbn 2Cp 1 M I C P (F?) 2 M { N2 X, Of Which The First Term, Writing C P = ,,T, Is Mn 2 A B (Ab)(Bc)Axcx 1 M 2 N 2 P 2 2222 2 2 _2 A X B X C (Bc) A C Bx M N 2 2 2 M2Â°N 2 N 2 M 2 2 A X (Bc) B C P C P (Ab) A B B(Ac) Ax Cp 2 = 2 (04) 2 1 (F,0) 2.4 (F,Y') 2 ï¿½?;
And, If
(F,4)) 1 = Km " 2, (F??) 1 1 M N S X X X Af A _Af A Ax, Ax Ax Ax1 Observing That And This, On Writing C 2, C 1 For Y 11 Y 21 Becomes
( Kc) K X 'T 3C X 1= (F,0 1 ', G 1; ï¿½'ï¿½1(F,O) 1 M 1=1 M 2 0`,4)) 2 0, T (Fm 2.4 (0,0 2 .F '
And Thence It Appears That The First Transvectant Of (F, (P) 1 Over 4) Is Always Expressible By Means Of Forms Of Lower Degree In The Coefficients Wherever Each Of The Forms F, 0, 4, Is Of Higher Degree Than The First In X 1, X2.
The second transvectant of a form over itself is called the Hessian of the form. It is (f = (ab ) 2 a n-2 r7 2 =Hx - =H; unsymbolically bolically it is a numerical multiple of the determinant a2 f a2f (32 f) 2ï¿½ It is also the first transvectant of the differxi ax axa x 2 ential coefficients of the form with regard to the variables, viz. (L, _f_)'. For the quadratic it is the discriminant (ab) 2 and for ax2 the cubic the quadratic covariant (ab) 2 axbx. In general for a form in n variables the Hessian is 3 2 f 3 2 f a2f ax i ax n ax 2 ax " ï¿½ï¿½ ' axn and there is a remarkable theorem which states that if H =o and n=2, 3, or 4 the original form can be exhibited as a form in I, 2, 3 variables respectively.
The Form f+A4. - An important method for the formation of covariants is connected with the form f +X4), where f and 4 are of the same order in the variables and X is an arbitrary constant. If the invariants and covariants of this composite quantic be formed we obtain functions of X such that the coefficients of the various powers of X are simultaneous invariants of f and 4). In particular, when 4) is a covariant of f, we obtain in this manner covariants of f. The Partial Differential Equations.--It will be shown later that covariants may be studied by restricting attention to the leading coefficient, viz. that affecting xi where e is the order of the covariant.
An important fact, discovered by Cayley, is that these coefficients, and also the complete covariants, satisfy certain partial differential equations which suffice to determine them, and to ascertain many of their properties. These equations can be arrived at in many ways; the method here given is due to Gordan. X1, X 2, u1, /22 being as usual the coefficients of substitution, let x1a ? + X 2 - = D, X 1 -' j +X 2 =D 2 AA' ?2 / 2 1 3 - 5 -, =112 87,2 = ?1a a + ?2a a =Dï¿½ï¿½, 1 be linear operators. Then if j, J be the original and transformed forms of an invariant J= (a1)wj, w being the weight of the invariant.
Operation upon J results as follows D AA J = wJ; D A J=0; D ï¿½A J =0;D ï¿½ï¿½ J = wJ.
The first and fourth of these indicate that (a 2) w is a homogeneous function of X i, X2, and of /u1, ï¿½ 2 separately, and the second and third arise from the fact that (X / 1) is caused to vanish by both Da ï¿½ and Dï¿½A. Since J= F(A0,A11...Ak,ï¿½..), where A k= we find that the results are equivalent to. aJ - ., _ A aJ ï¿½. k (DwAk) Ak 0; (D (ï¿½ A k) Ak =wJ.
k k According to the well-known law for the changes of independent variables. Now D A xA k = (n - k) A k; Aï¿½ A k = k A?1; D ï¿½A A k = (n - k) A k+1;D mï¿½ A k = kA k; (n - k)A ka - w Ak - 1 aA k = O; a _ J (n - k) A k +l A k = O; kA k Ak = wJ; equations which are valid when X 1, X 2, ï¿½ 1, ï¿½2 have arbitrary values, and therefore when the values are such that J =j, A k =akï¿½ Hence Â°a-do +(n -1)71 (a2aa-+... =wj, - aj aj - aj a Â°aa1 +2a 1aa2 +3a 2aa3 +... =0, - aj aj aj nal aao +(n-1)a2 at i -} (n - 2)a 3aa2+... =0, a 1 a ? +2a 2 a? +3a 3 a +... = wj, aa 1 aa 2 a a 3 the complete system of equations satisfied by an invariant. The fourth shows that every term of the invariant is of the same weight. Moreover, if we add the first to the fourth we obtain aj 2w ak = 7 1=6, j, =0j, where 0 is the degree of the invariant; this shows, as we have before observed, that for an invariant w= - n0. The second and third are those upon the solution of which the theory of the invariant may be said to depend. An instantaneous deduction from the relation w= 2 n0 is that forms of uneven orders possess only invariants of even degree in the coefficients. The two operators - a a - a = a Â°aa 1 +2 a 1aa2 +... +na" -laan -a a O = na laao + (n 1)a 2aa1 +ï¿½.. +a"aa"-1 have been much studied by Sylvester, Hammond, Hilbert and Elliott (Elliott, Algebra of Quantics, ch. vi.). An important reference is " The Differential Equations satisfied by Concomitants of Quantics," by A. R. Forsyth, Proc. Lond. Math. Soc. vol. xix. The Evectant Process
If we have a symbolic product, which contains the symbol a only in determinant factors such as (ab), we may write x 2 ,-x 1 for a 1, a 2 , and thus obtain a product in which (ab) is replaced by b x, (ac) by c x and so on. In particular, when the product denotes an invariant we may transform each of the symbols a, b,...to x in succession, and take the sum of the resultant products; we thus obtain a covariant which is called the first evectant of the original invariant. The second evectant is obtained by similarly operating upon all the symbols remaining which only occur in determinant factors, and so on for the higher evectants.
Ex. gr. From (ac) 2 (bd) 2 (ad)(bc) we obtain (bd) 2 (bc) cyd x +(ac) 2 (ad) c xdx - (bd) 2 (ad)axb x - (ac)2(bc)axbx =4(bd) 2 (bc)c 2. d x the first evectant; and thence 4cxdi the second evectant; in fact the two evectants are to numerical factors pres, the cubic covariant Q, and the square of the original cubic.
If 0 be the degree of an invariant j - aj aj a; oj =a Â° a a o +al aa l +... +anaan naj n.-1 aj naj =a l aa Â° +a 1 a2c3a1...+a2aan and, herein transforming from a to x, we obtain the first evectant (-) k, x1x2 aak k
Combinants. - An important class of invariants, of several binary forms of the same order, was discovered by Sylvester. The invariants in question are invariants qud linear transformation of the forms themselves as well as qud linear transformation of the variables.
If the forms be ax, b2, cy,... The Aronhold process, given by the operation a as between any two of the forms, causes such an invariant to vanish. Thus it has annihilators of the forms a0 db - 0 +al d 1+a2d 22+... Â°c - iao l a12da2+'.. and Gordan, in fact, takes the satisfaction of these conditions as defining those invariants which Sylvester termed " combinants." The existence of such forms seems to have been brought to Sylvester's notice by observation of the fact that the resultant of of and b must be a factor of the resultant of Xax+ 12 by and X'a +tA2 for a common factor of the first pair must be also a common factor so we obtain P: = of the second pair; so that the condition for the existence of such common factor must be the same in the two cases. A leading proposition states that, if an invariant of Xax and i ubi be considered as a form in the variables X and ,u, and an invariant of the latter be taken, the result will be a combinant of cif and b1'. The idea_can be generalized so as to have regard to ternary and higher forms each of the same order and of the same number of variables.
For further information see Gordan, Vorlesungen Tiber Invariantentheorie, Bd. ii. ï¿½ 6 (Leipzig, 1887); E. B. Elliott, Algebra of Quantics, Art. 264 (Oxford, 1895).
Associated Forms.-A system of forms, such that every form appertaining to the binary form is expressible as a rational and integral function of the members of the system, is difficult to obtain. If, however, we specify that all forms are to be rational, but not necessarily integral functions, a new system of forms arises which is easily obtainable. A binary form of order n contains n independent constants, three of which by linear transformation can be given determinate values; the remaining n-3 coefficients, together with the determinant of transformation, give us n -2 parameters, and in consequence one relation must exist between any n - I invariants of the form, and fixing upon n-2 invariants every other invariant is a rational function of its members. Similarly regarding 1 x 2 as additional parameters, we see that every covariant is expressible as a rational function of n fixed covariants. We can so determine these n covariants that every other covariant is expressed in terms of them by a fraction whose denominator is a power of the binary form.
First observe that with f x =a: = b z = ï¿½ï¿½ï¿½,f1 = a l a z ', f 2 = a 2 az-', f x =f,x i +f 2 x i, we find (ab) - (a f) bx - (b f) ax. fx ? and that thence every symbolic product is equal to a rational function of covariants in the form of a fraction whose denominator is a power of f x. Making the substitution in any symbolic product the only determinant factors that present themselves in the numerator are of the form (af), (bf), (cf),...and every symbol a finally appears in the form.
% -k Y k = (af) k a n x. 'hc has f as a factor, and may be written f. uk; for observing that 1,to =f. =f. uo; 4, 1=0=f.; where u 0 =1, u1=o, assume that tfik = (af) k ay -k = f. u k =ï¿½y. ukx(n-2) ï¿½ Taking the first polar with regard to y (n - k) (a f) xa x -k-l ay+ k (af) k-l ay -k (ab) (n -1) b12by n kn-2k-1 n-1 k(n-2) =k(n- 2)a u x u5+nax ayux and, writing f 2 and -f l for y1 and 3,21 (n-k)(a f) k+ta i k-1 + k (n - 1)(ab)(a f) k-1 (b f)4 1 k by-2 = (uf)u xn-2k-1? Moreover the second term on the left contains ( a f)' c -2b z 2 = 2 (a f) k-2b x 2 - (b) /0-2a 2 ï¿½ if k be uneven, and (af)?'bx (i f) of) '-la if k be even; in either case the factor (af) bx - (bf) ax = (ab) f, and therefore (n-k),bk+1 +Mï¿½f = k(n-2)f.(uf)uxn-2k-1; and 4 ' +1 is seen to be of the form f .14+1. We may write therefore 1 These forms, n in number, are called " associated forms " of f (" Schwesterformen," " formes associbes ").
Every covariant is rationally expressible by means of the forms f, u 2, u3,... u n since, as we have seen uo =I, u 1 =o. It is easy to find the relations u2 =2(f u3 = ((f ,f')2,f") 114=2(f,f') 4 ï¿½f 2 41(1,f')212, and so on.
To exhibit any covariant as a function of uo, ul, a n = (aiy1+a2y2) n and transform it by the substitution fi y 1+f2 y where f l = aay 1 ,f2 = a2ay -1, x y - x y = X x thence f . y1 = x 15+f2n; fï¿½ y2 =x2-f?n, f .a b = ax+ (a f) n, l; n u 2 " 2 22 2 +` n) u3 n-3n3+...+U 2jnï¿½ 3 n Now a covariant of ax =f is obtained from the similar covariant of ab by writing therein x i, x 2, for yl, y2, and, since y?, Y2 have been linearly transformed to and n, it is merely necessary to form the covariants in respect of the form (u1E+u2n) n, and then division, by the proper power of f, gives the covariant in question as a function of f, u0 = I, u2, u3,...un.
Summary of Results.-We will now give a short account of the results to which the foregoing processes lead. Of any form az there exists a finite number of invariants and covariants, in terms of which all other covariants are rational and integral functions (cf. Gordan,, Bd. ii. ï¿½ 21). This finite number of forms is said to constitute the complete system. Of two or more binary forms there are also complete systems containing a finite number of forms. There are also algebraic systems, as above mentioned, involving fewer covariants which are such that all other covariants are rationally expressible in terms of them; but these smaller systems do not possess the same mathematical interest as those first mentioned.
The Binary Quadratic.-The complete system consists of the form itself, ax, and the discriminant, which is the second transvectant of the form upon itself, viz.: (f, f') 2 = (ab) 2; or, in real coefficients, 2(a 0 a 2 a 2 1). The first transvectant, (f,f') 1 = (ab) a x b x ,vanishes identically. Calling the discriminate D, the solution of the quadratic as =o is given by the formula a: = o ( a0+a12_x2 (a0x+aix2 If the form a 2 be written as the product of its linear factors p.a., the discriminant takes the form -2(pq) 2. The vanishing of this invariant is the condition for equal roots. The simultaneous system of two quadratic forms ai, ay, say f and 0, consists of six forms, viz.
the two quadratic forms f, 4); the two discriminants (f, f')2,(0,4')2, and the first and second transvectants of f upon 4, (f, ,>) 1 and (f, 402, which may be written (aa)a x a x and (aa) 2 . These fundamental or ground forms are connected by the relation - 2 1 (f,4) 1) 2 = -2f4,(f ,4,)2+ 02(f,f')2.
If the covariant (f,4) 1 vanishes f and 4 are clearly proportional, and if the second transvectant of (f, 4 5) 1 upon itself vanishes, f and 4) possess a common linear factor; and the condition is both necessary and sufficient. In this case (f, ï¿½) 1 is a perfect square, since its discriminant vanishes. If (f,4) 1 be not a perfect square, and rx, s x be its linear factors, it is possible to express f and 4, in the canonical forms Xi(rx)2+X2(sx)2, 111(rx)2+1.2 (sx) 2 respectively. In fact, if f and 4, have these forms, it is easy to verify that (f, 4,)i= (A j z) (rs)r x s x . The fundamental system connected with n quadratic forms consists of (i.) the n forms themselves f i, f2,ï¿½ï¿½ fn, (ii.) the (2) functional determinants (f i ,f k) 1 , (iii.) the (n 2 1) in variants (f l, fk) 2, (iv.) the (3) forms (f i, (f k, f ni)) 2 , each such form remaining unaltered for any permutations of i, k, m. Between these forms various relations exist (cf. Gordan, ï¿½ 134).
The Binary Cubic.-The complete system consists of f=aa,(f,f')'=(ab)2a b =0 2 ,(f 0)= (ab) 2 (ca)b c=Q3, x x x x x x and (0,0')2 (ab) 2 (cd) 2 (ad) (bc) = R.
To prove that this system is complete we have to consider (f, o) 2, 04') 1, (f,Q) 1, (f,Q) 2, (f,Q) 3, 0,Q) 1, (o,Q)2, and each of these can be shown either to be zero or to be a rational integral function of f, 0 Q and R. These forms are connected by the relation 2Q2+ 3+Rf2=0.
The discriminant of f is equal to the discriminant of 0, and is therefore (0, 0') 2 = R; if it vanishes both f and 0 have two roots equal, 0 is a rational factor of f and Q is a perfect cube; the cube root being equal, to a numerical factor pres, to the square root of A. The Hessian 0 =A 2 is such that (f, 2 and if f is expressible in the form X(p x) 3 +,i(g x) 3 , that is as the sum of two perfect cubes,. we find that Di must be equal to p x g x for then t x (p x) 3 +, u (g x) 3, Hence, if px, qx be the linear factors of the Hessian 64, the cubic can be put into the form A(p x) 3 +ï¿½(g x) 3 and immediately solved. This method of solution fails when the discriminant R vanishes, for then the Hessian has equal roots, as also the cubic f. The Hessian in that case is a factor of f, and Q is the third power of u2,... take and Uk = (af) k ai k the linear factor which occurs to the second power in f. If, moreover, 0 vanishes identically f is a perfect cube.
The Binary Quartic.-The fundamental system consists of five forms ax=f; (f,f')2=(ab) 2axbx=Ax; (f,f')4=(ab) 4= 2; (f, 0)1= (ao) azsi = (ab) 2 (cb) a:b x c5 =1; (f 4) 4 = (as) 4 = (6) 2 00 2 (ca) 2 = j, viz. two invariants, two quartics and a sextic. They are connected by the relation 212 = 2 i f?0 - D3 -3 jf 3.
The discriminant, whose vanishing is the condition that f may possess two equal roots, has the expression j 2 - 6 i 3; it is nine times the discriminant of the cubic resolvent k 3 - 2 ik- 3j , and has also the expression 4(1, t') 6 . The quartic has four equal roots, that is to say, is a perfect fourth power, when the Hessian vanishes identically; and conversely. This can be verified by equating to zero the five coefficients of the Hessian (ab) 2 axb2. Gordan has also shown that the vanishing of the Hessian of the binary n ic is the necessary and sufficient condition to ensure the form being a perfect n th power. The vanishing of the invariants i and j is the necessary and sufficient condition to ensure the quartic having three equal roots. On the one hand, assuming the quartic to have the form 4xix 2, we find i=j=o, and on the other hand, assuming i=j=o, we find that the quartic must have the form a o xi+4a 1 xix 2 which proves the proposition. The quartic will have two pairs of equal roots, that is, will be a perfect square, if it and its Hessian merely differ by a numerical factor. For it is easy to establish] the formula (yx) 2 0 4 = 2f.4-2(f y 1 ) 2 connecting the Hessian with the quartic and its first and second polars; now a, a root of f, is also a root of Ox, and con se uentl the first polar 1 of of q y p f? =y la xl -i-y2a x2 must also vanish for the root a, and thence ax, and a must also vanish for the same root; which proves that a is a double root of f, and f therefore a perfect square. When f = 6xix2 it will be found that 0 = -f. The simplest form to which the quartic is in general reducible is +6mxix2+x2, involving one parameter m; then Ox = 2m (xi +x2) +2 (1-3m2) x2 ix2; i = 2 (t +3m2) ;j= '6m (1 - m) 2; t= (1 - 9m 2) (xi - x2) (x21 + x2) x i x 2. The .sextic covariant t is seen to be factorizable into three quadratic factors 4 = x 1 x 2, =x 2 1 - 1 - 2 2, 4) - x, which are such that the three mutual second transvectants vanish identically; they are for this reason termed conjugate quadratic factors. It is on a consideration of these factors of t that Cayley bases his solution of the quartic equation. For, since -2t 2 =0 3 -21f 2 ,6,-3j(-f) 3, he compares the right-hand side with cubic resolvent k 3 -21X 2 k - j 2. of f=0, :and notices that they become identical on substituting 0 for k, and -f for X; hence, if k1, k2, k 3 be the roots of the resolvent -21 2 = (o + k if) (A + k 2f)(o + k 3f); and now, if all the roots of f be different, so also are those of the resolvent, since the latter, and f, have practically the same discriminant; consequently each of the three factors, of -21 2, must be perfect squares and taking the square root 1 t = -' (1)ï¿½x4; and it can be shown that 0, x, 1P are the three conjugate quadratic factors of t above mentioned. We have A +k 1 f =0 2, O+k 2 f = x2, O+k3f =4) 2 , and Cayley shows that a root of the quartic can be xpressed in the determinant form 1, k, 0.1y the remaining roots being obtained by varying 1, k, x the signs which occur in the radicals 2 u The transformation to the normal form reduces 1, k 3 ,? the quartic to a quadratic. The new variables y1= 0 are the linear factors of 0. If 4) = rx.sx, the Y2 =1 normal form of a:, can be shown to be given by (rs) 4 .a x 4 = (ar) 4s: 6 (ar) 2 (as) 2rxsy -I- (as) 4rx; 4) is any one of the conjugate quadratic factors of t, so that, in determining rx, sx from J z+k 1 f =o, k 1 is any root of the resolvent. The transformation to the normal form, by the solution of a cubic and a quadratic, therefore, supplies a solution of the quartic. If (Xï¿½) is the modulus of the transformation by which a2 is reduced to 3 the normal form, i becomes (X /2) 4 i, and j, (Ap) 3 j; hence ? 3 is absolutely unaltered by transformation, and is termed the absolute invariant. Since therefore ? 2 - 9 m 2 (1 3 m 2)) 2 we have a cubic equation for determining m 2 as a function of the absolute invariant.
Remark.-Hermite has shown (Crelle, Bd. lii.) that the substitution, z =? -, reduce s x2ax1 -x10x2 to the form j Oz ON 2 1 1 j 2 i The Binary Quintic.-The complete system consists of 23 forms, of which the simplest are f =a:; the Hessian H = (f, f') 2 = (ab) 2axbz; the quadratic covariant i= (f, f) 4 = (ab) 4axbx; and the nonic co variant T = (f, (f', f") 2) 1 = (f, H) 1 = (aH) azHi = (ab) 2 (ca) axbycy; the remaining 19 are expressible as transvectants of compounds of these four.
There are four invariants (i, i')2; (13, H)6; (f2, 151c.; (f t, 17)14 four linear forms (f, i 2) 4; (f, i 3) 5; (i 4, T) 8; ( 2 5 , T)9 three quadratic forms i; (H, i 2)4; (H, 23)5 three cubic forms (f, i)2; (f, i 2) 3; (13, T)6 two quartic forms (H, i) 2; (H, 12)3. three quintic forms f; (f, i) 1; (i 2, T)4 two sextic forms H; (H, 1)1 one septic form (i, T)2 one nonic form T.
We will write the cubic covariant (f, i) 2 =j, and then remark that the result, (f,j) 3 = o, can be readily established. The form j is completely defined by the relation (f,j) 3 =o as no other covariant possesses this property.
Certain convariants of the quintic involve the same determinant factors as appeared in the system of the quartic; these are f, H, i, T and j, and are of special importance. Further, it is convenient to have before us two other quadratic covariants, viz. T = (j, j) 2 jxjx; 0 = (iT)i x r x; four other linear covariants, viz. a = - (ji) 2 jx; s = (ia)ix; Y = (ra)r x: (3= (T0)T x . Further, in the case of invariants, we write A= (1, i') 2 and take three new forms B = (i, T) 2; C = (r, r`) 2; R = (/y). Hermite expresses the quintic in a forme-type in which the constants are invariants and the variables linear covariants. If a, a be the linear forms, above defined, he raises the identity ax(0) =ax(aJ3) - (3x(aa) to the fifth power (and in general to the power n) obtaining (aa) 5 f = (a13) 5 az - 5 (a0) 4 (aa) ax?3 -F... - (aa) and then expresses the coefficients, on the right, in terms of the fundamental invariants. On this principle the covariant j is expressible in the form R 2 j =5 3 + BS 2 a+4ACSa 2 + C(3AB -4C)a3 when S, a are the above defined linear forms.
Hence, solving the cubic, R 2 j = (S -m i a) (S - m 2 a) (S - m3a) wherein m 1 m2, m 3 are invariants.
Sylvester showed that the quintic might, in general, be expressed as the sum of three fifth powers, viz. in the canonical form
f=k1(px)5 +k2(gx) 5 +k3(rx) 5 .
Now, evidently, the third transvectant of f, expressed in this form, with the cubic pxgxrx is zero, and hence from a property of the covariant j we must have j = pxgxrx; showing that the linear forms involved are the linear factors of j. We may therefore write I. / f = k1.(S-mia)5+k2(S-m2a)5+k3(6-m3a)5; and we have merely to determine the constants k1, k2, k3. To determine them notice that R = (a6) and then (f, a 5) 5 = - R 5 (k1 +k2+k3) (f, a 4 5) 5 = - 5R5 ( m 1 k 1+ m 2 k 2+ m 3 k 3), (f, a352) 5 = -10R5 (m21ke +m2k2+m3k3) three equations for determining k 1, k2, k3. This canonical form depends upon j having three unequal linear factors. When C vanishes j has the form j = pxg x , and (f,j) 3 = (ap) 2 (aq)ax = o. Hence, from the identity ax (pq) = px (aq) -qx (ap), we obtain (pet' = (aq) 5px - 5 (ap) (aq) 4 pxg x - (ap) 5 gi, the required canonical form. Now, when C = o, clearly (see ante) R 2 j = 6 2 p where p = S +2 B a; and Gordan then proves the relation 6R 4 .f = B65ï¿½5B64p - 4A2p5, which is Bring's form of quintic at which we can always arrive, by linear transformation, whenever the invariant C vanishes. Remark.-The invariant C is a numerical multiple of the resultant of the covariants i and j, and if C = o, p is the common factor of i and j. The discriminant is the resultant of ax and ax and of degree 8 in the coefficients; since it is a rational and integral function of the fundamental invariants it is expressible as a linear function of A 2 and B; it is independent of C, and is therefore unaltered when C vanishes; we may therefore take f in the canonical form
6R 4 f = BS5+5BS4p-4A2p5. f= ai; the Hessian H = (ab) 2 azbx; the quartic i= (ab) 4 axb 2 x; the covariants 1= (ai) 4 ay; T = (ab)2(cb)aybyci; and the invariants A = (ab) 6; B = (ii') 4 . There are 5 invariants: (a, b) 6, (i, i) 4', (1, l'), (f, l 3) 6, ((f, i),14)8; 6 of order 2.: 1, (i,1) 2 , (f, 12) 4, (1,1 2) 3 , (.f,1 3) 5, ((f, i), 13)6; 5 of order 4: i, (f,1) 2, (1, 1), (f,12)3, ((f, i), 12)4; 5 of order 6 :f, p = (ai) 2 axi 2 x, (f, 1), ((f, i), 1) 2, (p, l); 3 of order 8: H, (f, i), (H, 1); 1 of order 10: (H, i); 1 of order 12: T.
For a further discussion of the binary sextic see Gordan, loc. cit., Clebsch, loc. cit. The complete systems of the quintic and sextic were first obtained by Gordan in 1868 (Journ. f. Math. lxix. 323354). August von Gall in 1880 obtained the complete system of the binary octavic (Math. Ann. xvii. 31-52, 1 391 5 2, 45 6); and, in 1888, that of the binary septimic, which proved to be much more complicated (Math. Ann. xxxi. 318-336). Single binary forms of higher and finite order have not been studied with complete success, but the system of the binary form of infinite order has been completely determined by Sylvester, Cayley, MacMahon and Stroh, each of whom contributed to the theory.
As regards simultaneous binary forms, the system of two quadratics, and of any number of quadratics, is alluded to above and has long been known. The system of the quadratic and cubic, consisting of 15 forms, and that of two cubics, consisting of 26 forms, were obtained by Salmon and Clebsch; that of the cubic and quartic we owe to Sigmund Gundelfinger (Programm Stuttgart, 186 9, 1 -43); that of the quadratic and quintic to Winter (Programm Darmstadt, 1880); that of the quadratic and sextic to von Gall (Programm Lemgo, 3873); that of two quartics to Gordan (Math. Ann. ii. 227-281, 3870); and to Eugenio Bertini (Batt. Giorn. xiv. 1-14, 1876; also Math. Ann. xi. 30-41, 1877). The system of four forms, of which two are linear and two quadratic, has been investigated by Perrin (S. M. F. Bull. xv. 45 -61, 1887).
Ternary and Higher Forms.-The ternary form of order n is represented symbolically by (aixl+a2x2+a3x3)' =a'; and, as usual, b, c, d,... are alternative symbols, so that " a=bn=c"=d"=....
x x x To form an invariant or covariant we have merely to form a product of factors of two kinds, viz. determinant factors (abc), (abd), (bce), etc...., and other factors az, bx, cx,... in such manner, that each of the symbols a, b, c,... occurs n times. Such a symbolic product, if its does not vanish identically, denotes an invariant or a covariant, according as factors az, bz, cz,... do not or do appear. To obtain the real form we multiply out, and, in the result, substitute for the products of symbols the real coefficients which they denote.
For example, take the ternary quadratic (aixl+a2x2+a3x3) 2 =a2x, or in real form axi +bx2+cx3+2fx 2 x 3+ 2gx 3 x 1 +2hx i x 2. We can see that (abc)a x b x c x is not a covariant, because it vanishes identically, the interchange of a and b changing its sign instead of leaving it unchanged; but (abc) 2 is an invariant. If ai, bx, cx be different forms we obtain, after development of the squared determinant and conversion to the real form (employing single and double dashes to distinguish the real coefficients of bx and cz), a(b'c"+b"c'-2 f'f") +b(c'a"+c"a'-2g'g") +c(a' +a"b'-2h'h")+2f(g'h"+g"h'-a' + 2g (h ' f"+h"f'-b'g"-b"g')+2h(f'g"+f"g'-c'h"-c"h'); a simultaneous invariant of the three forms, and now suppressing the dashes we obtain 6 (abc+2fgh -af t - bg 2 -ch2), the expression in brackets being the S well-known invariant of az, the vanishing of which expresses the condition that the form may break up into two linear factors, or, geometrically, that the conic may represent two right lines. The complete system consists of the form itself and this invariant.
The ternary cubic has been investigated by Cayley, Aronhold, Hermite, Brioschi and Gordan. The principal reference is to Gordan (Math. Ann. i. 90-128, 1869, and vi. 436`512, 1873). The complete covariant and contravariant system includes no fewer than 34 forms; from its complexity it is desirable to consider the cubic in a simple canonical form; that chosen by Cayley was ax 3 +by 3 + cz 3 + 6dxyz (Amer. J. Math. iv. 1-16, 1881). Another form, associated with the theory of elliptic functions, has been considered by Dingeldey (Math. Ann. xxxi. 157-176, 1888), viz. xy 2 -4z 3 +g2x 2 y+g3x 3, and also the special form axz 2 -4by 3 of the cuspidal cubic. An investigation, by non-symbolic methods, is due to F. C. J. Mertens (Wien. Ber. xcv. 942-993, 1887). Hesse showed independently that the general ternary cubic can be reduced, by linear transformation, to the form x3+y3+z3+ 6mxyz, a form which involves 9 independent constants, as should be the case; it must, however, be remarked that the counting of constants is not a sure guide to the existence of a conjectured canonical form. Thus the ternary quartic is not, in general, expressible as a sum of five 4th powers as the counting of constants might have led one to expect, a theorem due to Sylvester. Hesse's canonical form shows at once that there cannot be more than two independent invariants; for if there were three we could, by elimination of the modulus of transformation, obtain two functions of the coefficients equal to functions of m, and thus, by elimination of m, obtain a relation between the coefficients, showing them not to be independent, which is contrary to the hypothesis.
The simplest invariant is S = (abc) (abd) (acd) (bcd) cf degree 4, which for the canonical form of Hesse is m(1 -m 3); its vanishing indicates that the form is expressible as a sum of three cubes. The Hessian is symbolically (abc) 2 azbzcz = H 3, and for the canonical form (1 +2m 3)xyz-m 2 (x 3 +y 3 +z 3). By the x process of Aronhold we can form the invariant S for the cubic ay+XH:, and then the coefficient of X is the second invariant T. Its symbolic expression, to a numerical factor pres, is (Hbc) (Hbd) (Hcd) (bcd), and it is clearly of degree 6.
One more covariant is requisite to make an algebraically complete set. This is of degree 8 in the coefficients, and degree 6 in the variables, and, for the canonical form, has the expression -9m 6 (x 3 +y 3 +z 3) 2 - (2m +5m 4 +20m 7) (x3 +y3+z3)xyz - (15m 2 +78m 5 -12m 8) Passing on to the ternary quartic we find that the number of ground forms is apparently very great. Gordan (Math. Ann. xvii. 217-233), limiting himself to a particular case of the form, has determined 54 ground forms, and G. Maisano (Batt. G. xix. 198-237, 1881) has determined all up to and including the 5th degree in the coefficients.
The system of two ternary quadratics consists of 20 forms; it has been investigated by Gordan (Clebsch-Lindemann's Vorlesungen i. 288, also Math. Ann. xix. 5 2 9-55 2); Perrin (S. M. F. Bull. xviii. 1-80, 1890); Rosanes (Math. Ann. vi. 264); and Gerbaldi (Annali (2), xvii. 161-196).
Ciamberlini has found a system of 127 forms appertaining to three ternary quadratics (Batt. G. xxiv. 141-157).
A. R. Forsyth has discussed the algebraically complete sets of ground forms of ternary and quaternary forms (see Amer. J. xii. 1-60, 115-160, and Camb. Phil. Trans. xiv. 409-466, 1889). He proves, by means of the six linear partial differential equations satisfied by the concomitants, that, if any concomitant be expanded in powers of xi, x 2, x 3, the point variables-and of u 8, u 2, u3, the contragredient line variables-it is completely determinate if its leading coefficient be known. For the unipartite ternary quantic of order n he finds that the fundamental system contains a (n+4) (n -1) individuals. He successfully considers the systems of two and three simultaneous ternary quadratics. In Part III. of the Memoir he discusses bi-ternary quantics, and in particular those which are lineo-linear, quadrato-linear, cubo-linear, quadrato-quadratic, cubo-cubic, and the system of two lineo-linear quantics. He shows that the system of the bi-ternary nÂ°m i Â° comprises 4 (n+1)(n+2)(m+1)(m+2)- 3 individuals.
Bibliographical references to ternary forms are given by Forsyth (Amer. J. xii. p. 16) and by Cayley (Amer. J. iv., 1881). Clebsch, in 3872, in papers in Abh. d. K. Akad. d. U. zu GÃ¶ttingen, t. xvii. and Math. Ann. t. v., established the important result that in the case of a form in n variables, the concomitants of the form, or of a system of such forms, involve in the aggregate n-1 classes of variables. For instance, those of a ternary form involve two classes which may be geometrically interpreted as point and line co-ordinates in a plane; those of a quaternary form involve three classes which may be geometrically interpreted as point, line and plane coordinates in space.
IV. Enumerating Generating Functions
Professor Michael Roberts (Quart. Math. J. iv.) was the first to remark that the study of covariants may be reduced to the study of their leading coefficients, and that from any relations connecting the latter are immediately derivable the relations connecting the former. It has been shown above that a covariant, in general, satisfies four partial differential equations. Two of these show that the leading coefficient of any covariant is an isobaric and homogeneous function of the coefficients of the form; the remaining two may be regarded as operators which cause the vanishing of the covariant. These may be written, for the binary nie, Zka k _. l aa k -x 2 d d- = 0; Z(nk)ak+l adk - x ldd2=0; or in the form d d 52-x 2(7 =0, O - x1ax2 = 0; where 0 = ao d a l + 2a 1 -?...+na,,_id an, 0 = nal dao -? (n -1)azd al -f-... + andan_l. Let a covariant of degree e in the variables, and of degree 8 in the coefficients (the weight of the leading coefficient being w and n8-2w = ï¿½), be Coxl -}- ec l l 1 x 2 -{-... .
Operating with 5l-xidxlwe find S2C 0 =o; that is to say, C Â° satisfies one of the two partial differential equations satisfied by an invariant. It is for this reason called a seminvariant, and every seminvariant is the leading coefficient of a covariant. The whole theory of invariants of a binary form depends upon the solutions of the equation SZ=o. Before discussing these it is best to trans form the binary form by substituting I !a i, 2 ! a 2 , 3 ! a 31 ...n !aï¿½, for a l, a 2, a 3 ...a n respectively; it then becomes a e xi +na l xi -I x 2 +n (n -1)a 9 xl -2 x2 +... +n ! anx2, and 52 takes the simpler form dd d d aodal+alda2+a2da,1--... + an -ldan. One advantage we have obtained is that, if we now write ao =o, and substitute a 8 _ 1 for a,, when s>o, we obtain d d aO da l +al da 2 +a2 da ï¿½....+an_2dan_1 which is the form of SZ for a binary (n- Henceby merely diminishing each suffix in a seminvariant by unity, we obtain another seminvariant of the same degree, and of weight w-8, appertaining to the (n-I) ic. Also, if we increase each suffix in a seminvariant, we obtain terms, free from a 0, of some seminvariant of degree 8 and weight w+8. Ex. gr. from the invariant a2 -2a 1 a 3 -2aoa4 of the quartic the diminishing process yields ai-2a 0 a 21 the leading coefficient of the Hessian of the cubic, and the increasing process leads to a3 -2a 2 a 4 +2a i a 5 which only requires the additional term-2aoa 6 to become a seminvariant of the sextic. A more important advantage, springing from the new form of S2, arises from the fact that if x"-aix n- +a2x n-2. ..(-)nan= (x- a1)(x-a2)... (x- an), the sums of powers Ea t, Za 3, Za 4, ...Za n all satisfy the equation Si=o. Hence, excluding ao, we may, in partition notation, write down the fundamental solutions of the equation, viz. (2), (3), (4),...(n), and say that with ao, we have an algebraically complete system. Every symmetric function denoted by partitions, not involving the figure unity (say a non-unitary symmetric function), which remains unchanged by any increase of n, is also a seminvariant, and we may take if we please another fundamental system, viz. a 0 ,(2), (3), (22), (32),...(24") or (32/(n-3)).
Observe that, if we subject any symmetric function the diminishing process, it becomes ao 1 - P2 (p2p3...)ï¿½ Next consider the solutions of 0=o o which are of degree 0 and weight w. The general term in a solution involves the product aoÂ°ai 1 a2 2 ...an" wherein Tr =0, Zs7r s =w; the number of such products that may appear depends upon the number of partitions of w into B or fewer parts limited not to exceed n in magnitude. Let this number be denoted by (w; 0, n). In order to obtain the seminvari ants we would write down the (w; 0, n) terms each associated with a literal coefficient; if we now operate with 52 we obtain a linear function of (w - I; 8, n) products, for the vanishing of which the literal coefficients must satisfy (w-I; 0, n) linear equations; hence (w; 8, n)-(w-I; 0, n) of these coefficients may be assumed arbitrarily, and the number of linearly independent solutions of 52=o, of the given degree and weight, is precisely (w; 8, n) - (w - I; 0, n). This theory is due to Cayley; its validity depends upon showing that the (w - I; 0, n) linear equations satisfied by the literal coefficients are independent; this has only recently been established by E. B. Elliott. These seminvariants are said to form an asyzygetic system. It is shown in the article on Combinatorial Analysis that (w; 0,n) is the coefficient of a e z w in the ascending expansion of the fraction 1-a. 1 -az. 1-az2....1-azn' Hence (w; 0, n) - (w - I; 0, n) is given by the coefficient of aez'Â° in the fraction 1-z 1 -a.1-az. 1 - az 2. ...1 - azn.' the enumerating generating function of asyzygetic seminvariants. We may, by a well-known theorem, write the result as a coefficient of z w in the expansion of 1 - z n+1. -v ol. - zn +9 1 -z2.1 -z3....1-z8; and since this expression is unaltered by the interchange of n and B we prove Hermite's Law of Reciprocity, which states that the asyzygetic forms of degree 0 for the /t ie are equinumerous with those of degree n for the The degree of the covariant in the variables is e=nO-2w; consequently we are only concerned with positive terms in the developments and (w, 0, n) - (w - r; 0, n) will be negative unless nO It is convenient to enumerate the seminvariants of degree 0 and order e=n0-2w by a generating function; so, in the first written generating function for seminvariants, write z2 for z and az n for a;. we obtain 1 - z - 2 1 -az". 1 -az 74 - 2.1 -azn-4....1 - azn+4.1 - az n+2.1 - az-n in which we have to take the coefficient of aezne-2', the expansion. being in ascending powers of a. As we have to do only with that part of the expansion which involves positive powers of z, we must try to isolate that portion, say A n (z). For n=2 we can prove that. the complete function may be written ll A2(z) i 2A2 (z/ ' A 2 z 1az2 1.1-a2; and this is the reduced generating function which tells us, by its. denominator factors, that the complete system of the quadratic is composed of the form itself of degree order I, 2 shown by az 2, and of the Hessian of degree order 2, o shown by a2.
Again, for the cubic, we can find A3(z) - -a6z6 1 -az 3.1 -a 2 z 2.1 -a 3 z 3.1 -a4 where the ground forms are indicated by the denominator factors, viz.: these are the cubic itself of degree order I, 3; the Hessian of degree order 2, 2; the cubi-covariant G of degree order 3, 3, and the quartic invariant of degree order 4, o. Further, the numerator factor establishes that these are not all algebraically independent,, but are connected by a syzygy of degree order 6, 6.
Similarly for the quartic A 4 /z) - -a s z 1 -az4.1 -a2.1-a2z4.1-a3 .1 establishing the 5 ground forms and the syzygy which connects them.
The process is not applicable with complete success to quintic and higher ordered binary forms. This arises from the circumstance that the simple syzygies between the ground forms are not all independent, but are connected by second syzygies, and these again by third syzygies, and so on; this introduces new difficulties which have not been completely overcome. As regards invariants. a little further progress has been made by Cayley, who established the two generating functions for the quintic 1 -a3s 11 -a8.1 a12.
and for the sextic 1 -a3Â° 1-a 2.1-a 4.1-a'.1 -a io. 1-a'5 Accounts of further attempts in this direction will be found in Cayley's Memoirs on Quantics (Collected"Papers), in the papers of Sylvester and Franklin (Amer. J. i.-iv.), and in Elliott's Algebra of Quantics, chap. viii.
Perpetuants.-Many difficulties, connected with binary forms of finite order, disappear altogether when we come to consider the (p1p2p3...) to where form of infinite order. In this case the ground forms, called also perpetuants, have been enumerated and actual representative seminvariant forms established. Putting n equal to co, in a generating function obtained above, we find that the function, which enumerates the asyzvgetic seminvariants of degree 0, is 1 1-z2.1-z3.1-z4....1-z0 that is to say, of the weight w, we have one form corresponding to each non-unitary partition of w into the parts 2, 3, 4,...0. The extraordinary advantage of the transformation of S2 to association with non-unitary symmetric functions is now apparent; for we may take, as representative forms, the symmetric functions which are symbolically denoted by the partitions referred to. Ex, gr., of degree 3 weight 8, we have the two forms (322), a(24). If we wish merely to enumerate those whose partitions contain the figure 0, and do not therefore contain any power of a as a factor, we have the generator ze 1-z2.1-z3.1-z4....1-z0.
If 0=2, every form is obviously a ground form or perpetuant, and the series of forms is denoted by (2), (22), (23),...(2K+1).... Similarly, if 0 =3, every form (3K+12,x) is a perpetuant. For these two cases the perpetuants are enumerated by z 2 23 -z2' and l -z2.1-z3 respectively.
When 0=4 it is clear that no form, whose partition contains a part 3, can be reduced; but every form, whose partition is composed of the parts 4 and 2, is by elementary algebra reducible by means of perpetuants of degree 2. These latter forms are enumer ated by I - z 24 I -z 4; hence the generator of quartic perpetuants must be z4 z4 z7 1-z 2.1 -z 3.1z 4 1-z 2.1-s 4 1-22.1-z3.1-z4' and the general form of perpetuants is (4 K+ 1 3A+1 2ï¿½).
When 0 _ 5, the reducible forms are connected by syzygies which there is some difficulty in enumerating. Sylvester, Cayley and MacMahon succeeded, by a laborious process, in establishing the generators for 0=5, and 0=6, viz.: 5 15 531 1 -z 2.1-z 3.1-z 4.1-z 5 ' 1-z2.1-z3.1-z4.1-z5.1-z6' but the true method of procedure is that of Stroh which we are about to explain.
it was noted that Stroh considers Method of Stroh.-In the section on " Symmetric Function," (alai +a 2 a 2 +... + veae)'Â°, where a1-i-a2+ï¿½ï¿½ï¿½+a0=0 and 7.= 2 =...= SB = a. symbolically, to be the fundamental form of seminvariant of degree 0 and weight w; he observes that every form of this degree and weight is a linear lic expressions. We may write function of such symbolic + (1+ai)(1+alt)...(16e)=1+A2t 2 +A3E 3 +...+A0 Â°.
If we expand the symbolic expression by the multinomial theorem, and remember that any symbolic product ai 1 a2 2 a3 3 ... retains the same value, however the suffixes be permuted, we shall obtain a i 7 2 ar a2 a33?Q"l 7r2 rr3 w hich in r a l sum of terms, such as w! - - - r 2 e Ex. gr. ?
7r 7T - 2 ! 2 ! 2r3!
form is w! a irl aï¿½ 2 a a3 ...Ev 1 02 2 ?3 3 ...; and, if we express Ea l v2 2 0-3 3 in terms of A2, A3 i ..., and arrange the whole as a linear function of products of A2, A3,..., each coefficient will be a seminvariant, and the aggregate of the coefficients will give us the complete asyzygetic system of the given degree and weight.
When the proper degree 0 is < w a factor ao -e must be of course understood.
2 ?(Qlal+a2a2+a3a3+Q4a4) 2 = 21??+Qi+ ala2nria2 =a2(-2A2) +alA 2 = (ai - 2a2)A2 = (2)A 2 =a8 (2)A2. In general the coefficient, of any product A n A m A 7, 3 ..., will have, as coefficient, a seminvariant which, when expressed by partitions, will have as leading partition (preceding in dictionary order all others) the partition (Tr1lr2lr3ï¿½..). Now the symbolic expression of the seminvariant can be expanded by the binomial theorem so as to be exhibited as a sum of products of seminvariants, of lower degrees if alai 0-2a2 +...+crea0 can be broken up into any two portions (alai -1-0-2a2-1-ï¿½ï¿½ï¿½ +asas) +(as+1as +1 +o-8+2as+2+ï¿½ï¿½ï¿½ +ooae), such that Q1 +a2+... +QS = 0, for then v8+1+ as+2+ï¿½ï¿½ï¿½+Cre= 0; and each portion raised to any power denotes a seminvariant.
Stroh assumes that every reducible seminvariant can in this way be reduced. The existence of such a relation, as 0-1+0-2+.,.+cr2=0, necessitates the vanishing of a certain function of the coefficients A2, A 3 ,...A 9, and as a consequence one product of these coefficients can be eliminated from the expanding form and no seminvariant, which appears as a coefficient to such a product (which may be the whole or only a part of the complete product, with which the seminvariant is associated), will be capable of reduction.
Ex. gr. for 0=2, (a l a i +v 2 a 2) w; either v l or cr 2 will vanish if a1a2=A2=o; but every term, in the development, is of the form (222...)Ar and therefore vanishes; so that none are left to undergo reduction. Therefore every form of degree 2, except of course that one whose weight is zero, is a perpetuant. The generating function is I - z2' 52 For 0 =3, (alai +a2a2+a3a3) 10; the condition is clearly a1a2a3 = A3 = 0, and since every seminvariant, of proper degree 3, is associated, as coefficient, with a product containing A3, all such are perpetuants.
The general form is (3'2 A and the generating function 3 3.
1-z:l-z For 0=4, (alai+a2a2+a3a3+a4a4) TO; the condition is ala2a3a4(Q1+a2)(01+a3) (al +Q4) =A4A 3 = 0.
Hence every product of A 1, A2, A3, A4, which contains the product A 4 A 3 disappears before reduction; this means that every seminvariant, whose partition contains the parts 4, 3, is a perpetuant. The general form of perpetuant is (4 K 3 A 2"`) and the generating function 1-z2.1-z3.1-z4 In general when 0 is even and =20, the condition is a l a 2 ...U 24 II(v 1 +a 2)II(a l +a 2 +cr 3)...II(Q 1 +a 2 -}-... -1-Q 4)) =0; and we can determine the lowest weight of a perpetuant; the degree in the quantities a is 20+(2)+(1)+...+() =2 2 Â° -1 -1 =2e-1-1. Again, if 0 is uneven =20+I, the condition is a 1 a 2 ...cr 241 II(a 1 +a 2)II(cr 1 +a 2 +(73)...II(a 1 +a 2 +...+ac) =0; and the degree, in the quantities a, is 20+1 + (42+1) +(21) ï¿½...-F(254)ï¿½1) =22Â°-1= 2e-1-1 Hence the lowest weight of a perpetuant is 2 0 - 1 -1, when 0 is >2. The generating function is thus z2e-1 - 1 (1 -z 2) (1 -z 3) (1 -z 4)... (1-20) The actual form of a perpetuant of degree 0 has been shown by MacMahon to be +1 K0_1+1 K 3+20-4 K2 ,01 ,0-2 ,0-3 ,...3 ,2), K 0 ,Ke -1 ,...K 2 being given any zero or positive integer values.
Forms.-Taking the two forms to be a o xi + pa l x i 1x2+p(p-1)a2xr2x2-I-... +aPx2, boxi +qb1 xi -1x2+q(q - 1) b 2 xPx2+... +bx 2, every leading coefficient of a simultaneous covariant vanishes by the operation of a+Sib=aoda +alda.2+...+a7,-1d a P+bod b
Observe that we may employ the principle of suffix diminution to obtain from any seminvariant one appertaining to a (p-I)i c and a q - I ie, and that suffix augmentation produces a portion of a higher seminvariant, the degree in each case remaining unaltered. Remark, too, that we are in association with non-unitary symmetric functions of two systems of quantities which will be denoted by partitions in brackets ()a, ()b respectively. Solving the equation
by the Ordinary Theory Of Linear Partial Differential Equations, We Obtain P Q 1 Independent Solutions, Of Which P Appertain To S2Au = 0, Q To 12 B U =0; The Remaining One Is Ab =Aobl A 1 Bo, The Leading Coefficient Of The Jacobian Of The Two Forms. This Constitutes An Algebraically Complete System, And, In Terms Of Its Members, All Seminvariants Can Be Rationally Expressed. A Similar Theorem Holds In The Case Of Any Number Of Binary Forms, The Mixed Seminvariants Being Derived From The Jacobians Of The Several Pairs Of Forms. If The Seminvariant Be Of Degree 0, 0' In The Coefficients, The Forms Of Orders P, Q Respectively, And The Weight W, The Degree Of The Covariant In The Variables Will Be P0 Qo' 2W =E, An Easy Generalization Of The Theorem Connected With A Single Form. The General Term Of A Seminvariant Of Degree 0, 0 And Weight W Will Be A A A AppbÂ°ObÂ°1BÂ°2...BÂ°4 _ 0 1 2 P 0 1 2 Q P Q P Q Where Ep S =0, Eas=0 And Esp, A Es,=W.
1111 The Number Of Such Terms Is The Number Of Partitions Of W Into 0 0 Parts, The Part Magnitudes, In The Two Portions, Being Limited Not To Exceed P And Q Respectively. Denote This Number By (W; 0, P; 0'. Q). The Number Of Linearly Independent Seminvariants Of The Given Type Will Then Be Denoted By (W; 0, P; 0', Q) (W; 0, P; 0', Q); And Will Be Given By The Coefficient Of A E B E 'Z W In L Z 1 A. 1 Az.1 Az. ... 1 Az P. 1 B. 1 Bz. 1 Bz 2 ... 1 Bz4' That Is, By The Coefficient Of Z W In Zp '. 1 Zp 2. ... 1 Zp 0.1 Z 4 1.1 Z 2. ... 1 Z4 E, 1 Z. 1 Z 2.1 Z 3 .... 1 Z 0.1 Z 2.1 Z 3 .... 1 Ze'; Which Preserves Its Expression When 0 And P And 0 And Q Are Separately Or Simultaneously Interchanged.
Taking The First Generating Function, And Writing Az P, Bz4, 2 For A, B And Z Respectively, We Obtain The Coefficient Of Aobe'Zpo 0' 2W That Is Of A E B E 'Z ï¿½, In 1 Z 2 1 Azp. 1 Azp 2....1 A2 P 2.1 Az P . Q 1 The Unreduced Generating Function Which Enumerates The Covariants Of Degrees 0, 0' In The Coefficients And Order E In The Variables. Thus, For Two Linear Forms, P =Q = I, We Find 1 Z 2 1 Az. 1 Az I. 1 Bz. 1Bzl' The Positive Part Of Which Is 1 1 Az. 1 Bz. 1 Ab' Establishing The Ground Forms Of Degrees Order (I, O; I), (O, I; I), (I, I; O), Viz: The Linear Forms Themselves And Their Jacobian J Ab. Similarly, For A Linear And A Quadratic, P= I, Q= 2, And The Reduced Form Is Found To Be 1 A2B2Z2 1 Az. 1 Bz 2.1 Abz. 1 B. 1 A2B' Where The Denominator Factors Indicate The Forms Themselves, Their Jacobian, The Invariant Of The Quadratic And Their Resultant; Connected, As Shown By The Numerator, By A Syzygy Of Degreesorder (2, 2; 2).
The Complete Theory Of The Perpetuants Appertaining To Two Or More Forms Of Infinite Order Has Not Yet Been Established. For Two Forms The Seminvariants Of Degrees I, I Are Enumerated By 1 Z, And The Only One Which Is Reducible Is Ao 0 Of Weight Zero; 1 Hence The Perpetuants Of Degrees I, I Are Enumerated By 11 1 ï¿½ Z 1Zz' And The Series Is Evidently A O B 1 Aibo, A 0 B 2 A B A2Bo, A O B 3 A L B 2 A 2 B 1 A3Bo, One For Each Of The Weights I, 2, 3,..Ad Infin. For The Degrees I, 2, The Asyzygetic Forms Are Enumerated By Z. 1 And The Actual Forms For The First Three Weights Are 1 Aobzo, (Ao B 1 A 1 B O) Bo, (A O B 2 A 1 2 0 Bo, Ao(B2, 3 A1B2 A2B1 A O (B L B 2 3B O B 3 ) A I (B 2 1 2B 0 B 2); Amongst These Forms Are Included All The Asyzygetic Forms Of Degrees 1, 1, Multiplied By Bo, And Also All The Perpetuants Of The Second Binary Form Multiplied By Ao; Hence We Have To Subtract From The 2 Generating Function 1Z And 1 Z Z2, And Obtain The Generating Function Of Perpetuants Of Degrees I, 2.
1 _ 1 _ Z 2 Z3 1 Z. 1 Z 2 1 Z 1 Z 2 I Z. 1 Z2' The First Perpetuant Is The Last Seminvariant Written, Viz.: A O (B O B 2 3B O B 3) A L (Bi 2B0B2), Or, In Partition Notation, Ao(21) B (1)A(2)B; And, In This Form, It Is At Once Seen To Satisfy The Partial Differential Equation. It Is Important To Notice That The Expression (0) A (0'Ls) B (01)A(0'18 1)B (812)A (0'18 2)B (Op). (0')B Denotes A Seminvariant, If 0, 0', Be Neither Of Them Unity, For, After Operation, The Terms Destroy One Another In Pairs: When 0, Must Be Taken To Denote Ao And So For 0'. In General It Is A Seminvariant Of Degrees 0, 0', And Weight 0 0' S; To This There Is An Exception, Viz., When 0=O, Or When 0'=O, The Corresponding Partial Degrees Are 1 And 1. When 0=0' =O, We Have The General Perpetuant Of Degrees I, I. There Is A Still More General Form Of Seminvariant; We May Have Instead Of 0, 0 Any Collections Of Nonunitary Integers Not Exceeding 0, 0 In Magnitude Respectively, (2 A2 3 A3 ...0 Ae)A(L S 2 G2 3 G3 ...0' Ge') B (12 A2 3 A3 ..0 Ab)A(1 S I 2 G2 3 G3 ...B Ge ) B (1 22A23A3 ...0 Ae) A(1822 G2 3 G3 ...0' Ge ') B () 8 (1 8 2 A2 3 A3 ...19'Â°) A(2 G2 3 G3 ...0' ' ') B, Is A Seminvariant; And Since These Forms Are Clearly Enumerated By 1 Z. 1 Z 2 .... 1 Z 0.1 Z 2.1 Z 3 .... 1 Ze An Expression Which Also Enumerates The Asyzygetic Seminvariants, We May Regard The Form, Written, As Denoting The General Form Of Asyzygetic Seminvariant; A Very Important Conclusion. For The Case In Hand, From The Simplest Perpetuant Of Degrees I, 2, We Derive The Perpetuants Of Weight W, Ao(21W 2)B A1(21"R 3) I A2(21" 4)B ..ï¿½ Man 2(2)B, Ao(221W 4(B Al(221W 5)B A2 (221 " S)B ... Maw 4(22)B, Ao(231W 9B A, (221" 7)B A2(231W 8)B ... T Aw 6(23)B A Series Of 2(W 2) Or Of 2(W I) Forms According As W Is Even Or Uneven. Their Number For Any Weight W Is The Number Of Ways Of Composing W 3 With The Parts I, 2, And Thus The Generating Function Is Verified. We Cannot, By This Method, Easily Discuss The Perpetuants Of Degrees 2, 2, Because A Syzygy Presents Itself As Early As Weight 2. It Is Better Now To Proceed By The Method Of Stroh.
We Have The Symbolic Expression Of A /? Seminvariant.
2 Qj?(Alaal A2A2 ..ï¿½ Â°E E 1 N 1 2 32 ï¿½ï¿½ï¿½ ' 'Te,So')W Where A S A S Oi 1 2 = Si= ... = A 8J Si = Si = ... = B8; And A L A ... Ae T 1 T ... Rb=0.
And This Is True For 0 0' = Pare The Case Of The Single Observe That, If There Weight Of The Simplest
2 0 0 ' 0 " .. 1 I, As Can
To Obtain Information Petuants, Write
2 As Well As For Other Values Of 0 0' (Com Binary Form).
Be More Than Two Binary Forms, The Perpetuant Of Degrees 0, 0', 0",... Is Be Seen By Reasoning Of A Similar Kind. Concerning The Actual Forms Of The Per
Proceeding as we did in the case of the single binary form we find that for a given total degree 0+0', the condition which expresses reducibility is of total degree in the coefficients a and T; combining this with the knowledge of the generating function of asyzygetic forms of degrees 0, 0', we find that the perpetuants, of these degrees are enumerated by z26"'-11 -z. 1-z 2.1 -z 3 .... 1-z e. 1-z 2.1-z 3 .... 1-2e ..(1 +aex) =1 +Aix-+2x2-F... +Aexe .(1 +Te'x) =1 +Bix+B2x2+...+Bo'xe' Al+B1=0.
the condition is a1Ti=A1B1=0, which since A i =o, is really a condition of weight unity. For w = i the form is A i ai+Bib i, which we may write aob l -albo = ao(I) b -(I)abo; the remaining perpetuants, enumerated by z I - 2' have been set forth above.
For the case 8=1, 0' =2, the condition is a i r 1 72 = A032=0; and the simplest perpetuant, derived directly from the product A 1 B 21 is (I)a(2)b-(21)b; the remainder of those enumerated by z3 I z. 1-e may be represented by the form (1 X 1 +1) a (2 g 2 +1) b - (1A1)a(2g2+11)b+ ... t (22P11),; X 1 and 12 each assuming all integer (including zero) values. For the case 0=0' =2, the condition is a 1 a 2 T 1 T 2(a 1+ a2) (al +TO (al +T2) = -A2B 1 B 2 -A l A 2 B2 = 0.
To represent the simplest perpetuant, of weight 7, we may take as base either A2B 1 B 2 or A l A 2 B2, and since Ai+Bi =o the former is equivalent to A 2 ArB 2 and the latter to A 2 B i B2; so that we have, (1 -f-aix) (1 + a2x).
(1 + r i x)(1 where For the case 0=i, 0' =1,
apparently, a choice of four products. A2B 1 B 2 gives (22) a (21) b - (221) a (2) b, and A i AgB 21 (2 2 I) a (2) b -(22)a(21) b; these two merely differ in sign; and similarly A 2 B 1 B2 yields (2)a(2 2 I) b -(21)a(22)b, and that due to A 1 A 2 B2 merely differs from it in sign. We will choose from the forms in such manner that the product of letters A is either a power of A i, or does not contain A i; this rule leaves us with A2B 1 B 2 and A 2 B,Bs; of these forms we will choose that one which in letters B is earliest in ascending dictionary order; this is A2B 1 B 21 and our earliest perpetuant is (22)a(21)b - (221)a(2)b, and thence the general form enumerated by the generating function Z7 is (1-z)(1 - z2)2 (2 A2+2) a (2ï¿½2 +1 1ï¿½1 +1) b - (2 A2+2 1)a (2 M2+1 1 ,ai)b ...
(2A2+2lï¿½1+1)a(2ï¿½2+1)b For the case 0=5, 0' =3, the condition is o 1 T 1 T 2 T 3 (a 1 + r)(r 1 + r 2)(o +T 3) =A1B3+AiB2B3=0.
By the rules adopted we take A?B 2 B 3, which gives (12)a(32)b - (1)a(321)b+ao(3212)b, the simplest perpetuant of weight 7; and thence the general form enumerated by the generating function 1 -z.1-z2.1 - z3 ?
ViZ:- (iAi+2)a(3ï¿½3+12ï¿½2+1)b-... a 0(3 ï¿½3+l 2 ï¿½2+1 1 Ai+2)b, For the case 0=2, 0' =3, the condition is a102T1T2T3(01+ 172)(61 +T1)(171+T2)(1T1+T3)(a2+T1)(12+T2)(cT2+T3) X (T 1 +T2) 3 (T 2 +T 3) = 0.
The calculation results in -A,113 B2Bi+2A2B3B2Bi-AzB3BaBi+A4B3B1-2AlB3B2B1 -MB2B2131+MB33231+A213MB1 + A 2 B a Bi -2A2B3B2B +A2B1B1=0.
By the rules we select the product A2B 3 B 2 Bi, giving the simplest perpetuant of weight 15, viz: (24) a (3212) b - (241)a(321)b+(2412)a(32)b; and thence the general form (2A2+4)a(3ï¿½3+12ï¿½2+11ï¿½1+ 2)b -... (2A2+ 41ï¿½i +2)a(3ï¿½3+12ï¿½2+ 1) b, due to the generating function 2 15 (1 -z)(1 -z 2) 2 (1 -z3) For the case 0=r, 0'=4., the condition is a 1 r 1 T 2 r 3 7 4 (a i + T 1)(a + T 2)(a 1 T (a i + T 4) II (T s T t) 0; the calculation gives Selecting the product A;B 4 B 3 B2, we find the simplest perpetuant A1B4(A 1 B2+A1B3+B4)(-B3-A1B2B3-ATB4) =0.
(14) a (4322) b - (13) a (432 2 1) b + (12)a (432212) b - (1) a (432213)b +ao(432214)b, and thence the general form (1 A i + 4) a (4ï¿½4 + 1.3 ï¿½ 3 + 121 1, 2 + 2) b - ... ?ao(4ï¿½4+13ï¿½a+l2ï¿½2+21 Al+4)b, due to the generating function 2 15 1 -z. 1 - z 2.1 -z 3.1 - z4' The series may be continued, but the calculations soon become very laborious.
V. Restricted Substitutions We may regard the factors of a binary n ip equated to zero as denoting n straight lines through the origin, the co-ordinates being Cartesian and the axes inclined at any angle. Taking the variables to be x, y and effecting the linear transformation x = X1X+1.11Y, y = X2X+It2Y, X 2 +Y2X Y Xl - X2 y = _ x X I + AI R X 122 so that - ï¿½l b it is seen that the two lines, on which lie (x, y), (X, Y), have a definite projective correspondence. The linear transformation replaces points on lines through the origin by corresponding points on projectively corresponding lines through the origin; it therefore replaces a pencil of lines by another pencil, which corresponds projectively, and harmonic and other properties of pencils which are unaltered by linear transformation we may expect to find indicated in the invariant system. Or, instead of looking upon a linear substitution as replacing a pencil of lines by a projectively corresponding pencil retaining the same axes of co-ordinates, we may look upon the substitution as changing the axes of co-ordinates retaining the same pencil. Then a binary n", equated to zero, represents n straight lines through the origin, and the x, y of any line through the origin are given constant multiples of the sines of the angles which that line makes with two fixed lines, the axes of co-ordinates. As new axes of co-ordinates we may take any other pair of lines through the origin, and for the X, Y corresponding to x, y any new constant multiples of the sines of the angles which the line makes with the new axes. The substitution for x, y in terms of X, Y is the most general linear substitution in virtue of the four degrees of arbitrariness introduced, viz. two by the choice of axes, two by the choice of multiples. If now the nti c denote a given pencil of lines, an invariant is the criterion of the pencil possessing some particular property which is independent alike of the axes and of the multiples, and a covariant expresses that the pencil of lines which it denotes is a fixed pencil whatever be the axes or the multiples.
Besides the invariants and covariants, hitherto studied, there are others which appertain to particular cases of the general linear substitution. Thus what have been called seminvariants are not all of them invariants for the general substitution, but are invariants for the particular substitution xl = X11 + J-s12, X 2 = 112 Again, in plane geometry, the most general equations of substitution which change from old axes inclined at w to new axes inclined at w' =13 - a, and inclined at angles a, l3 to the old axis of x, without change of origin, are x-sin(wa)X+sin(w -/3)Y sin w sin ' _sin ax y sin w a transformation of modulus sin w' sin w' The theory of invariants originated in the discussion, by George Boole, of this system so important in geometry. Of the quadratic axe+2bxy+cy2, he discovered the two invariants ac-b 2 , a-2b cos w+c, and it may be verified that, if the transformed of the quadratic be AX2=2BXY+CY2, sin w 2 AC -B 2 =) (ac-b2), sin w A-2B cos w'+C = (sin w'1 2(a - 2bcosw+c). sin w The fundamental fact that he discovered was the invariance of 2 COS w xy+y 2 , viz. 2 cos w xy+y 2 = X 2 +2 cos w'XY+Y2, from which it appears that the Boolian invariants of axe+2bxy-y2 are nothing more than the full invariants of the simultaneous quadratics ax2+2bxy+y2, x 2 +2 cos coxy+y2, the word invariant including here covariant. In general the Boolian system, of the general n i Â°, is coincident with the simultaneous system of the n i Â°' and the quadratic x 2 +2 cos w xy+y2. Orthogonal System.-In particular, if we consider the transformation from one pair of rectangular axes to another pair of rectangular axes we obtain an orthogonal system which we will now briefly inquire into. We have cos w' = cos w = o and the substitution x 1 =cos OX, -sin 0(2 x 2 = sin OX i +cos 6X2, with modulus unity. This is called the direct orthogonal substitution, because the sense of rotation from the axis of X i to the axis of X, is the same as that from that of x i to that of x 2. If the senses of rotation be opposite we have the skew orthogonal substitution x1 =cos0Xi+sinOX2r x 2 = sin Â°Xicos OX2r of modulus -1. In both cases ddl and dal are cogredient with xl and x 2; for, in the case of direct substitution, dxi = cost dX i - sin 00-(2, ad2 =sin B dX i +cos O dX 2, and for skew substitution dai = cos B dX i +sin 0d2, c-&-- 2 n d =sin -coseax2.
Hence, in both cases, contragrediency and cogrediency are identical, and contravariants are included in covariants. Consider the binary n^{ie}. (a_{1}x_{1}+a_{2}x_{2})^{n}=a^{n}_{2}, and the direct substitution
xi = XXi - ,LX2, X2 = ,hX replacing cos 0, sin 0 respectively. In the a x = aixi+a2x2, observe that a a = a2, ab = aibi +a2b2.
Suppose that is transformed into Ax=BX=CX=...
then of course (AB) = (ab) the fundamental fact which appertains to the theory of the general linear substitution; now here we have additional and equally fundamental facts; for since A i = Xa i +,ia2, A2= - ï¿½ay + X a2, AA =A?-}-A2= (X2 +M 2)(a i+ a z) =aa; A B =AjBi+A2B2= (X2 +, U2)(albi+a2b2) =ab; (XA) = X i A2 - X2 Ai = (Ax i + /-Lx2) (- /-jai + Xa2) - (- / J.x i '+' Axe) (X a i +%Ga^2) = (X2 + ,u 2) ( x a - = showing that, in the present theory, a a, a b , and (xa) possess the invariant property. Since +xZ=x x we have six types of symbolic factors which may be used to form invariants and covariants, viz. (ab), aa, ab, (xa), ax, xx. The general form of covariant is therefore (ab) h i (ac) h2 c) (b h3 a i bb2c'e3...abia?2b?3... X (xa) ki (xb) k2 (xc) k3...axibx2cx3...xx = (AB) hi (AC) h2 (BC) h3...A11 4 13 A1,14131 A B I ?C"' B C "' X (XA) ki (XB) k2 (XC) k3...AXB122cCk...X If this be of order e and appertain to an nie L Eke-/1+2m =e, h i+h2+ï¿½ï¿½ï¿½+221+ji+j2+ï¿½ï¿½ï¿½+kl+li =n, hi+h3+..ï¿½+222+ji+j3+ï¿½ï¿½ï¿½+k2+12 = n, h2+h3+ï¿½ï¿½ï¿½+223+j2+%3+ï¿½.ï¿½+k3+13 =n; viz., the symbols a, b, c,... must each occur n times. It may denote a simultaneous orthogonal invariant of forms of orders n i, n2, n3,...; degree 0 of the covariant in the coefficients. The coefficients of the symbols must then present themselves n 1 , n 2, n 3 ...times respectively. The number of different symbols a, b, c,...denotes the the covariants are homogeneous, but not in general isobaric functions, of the coefficients of the original form or forms. Of the above general form of covariant there are important transformations due to the symbolic identities: - (ï¿½b) 2 2)2 = a b - a b; (xï¿½ = as a consequence any even power of a determinant factor may be expressed in terms of the other symbolic factors, and any uneven power may be expressed as the product of its first power and a function of the other symbolic factors. Hence in the above general form of covariant we may suppose the exponents h 1, h2, h3,...ki, k2, k3,...
if the determinant factors to be, each of them, either zero or unity. Or, if we please, we may leave the determinant factors untouched and consider the exponents ji, j2, j3,ï¿½ï¿½ï¿½11, 12, 1 3 ,... to be, each of them, either zero or unity. Or, lastly, we may leave the exponents h, k, j,1, untouched and consider the product i i i 2 . nt a s b b c? w...xz to be reduced either to the form g i g where g is a symbol of the series a, b, c,... or to a power of x . To assist us in handling the symbolic products we have not only the identity (ab) cx + (bc) a x + (ca) bx =0, but also ( ab) x x+ (b x) a + (ax) b x = 0, (ab)a+(bc)a s +(ca)a b = 0, and many others which may be derived from these in the manner which will be familiar to students of the works of Aronhold, Clebsch and Gordan. Previous to continuing the general discussion it is useful to have before us the orthogonal invariants and covariants of the binary linear and quadratic forms.
For the linear forms aoxi+aix2=ax = b x there are four fundamental forms ax=a:,x i --+a i x 2 of degree-order (1, 1), x7-1--4_ ï¿½ (0, 2), a i x, 1), a b =a2+ai ï¿½ (2, 0), (iii.) and (iv.) being the linear covariant and the quadrinvariant respectively. Every other concomitant is a rational integral function of these four forms. The linear covariant, obviously the Jacobian of a x and x x is the line perpendicular to x and the vanishing of the quadrinvariant a x is the condition that a x passes through one of the circular points at infinity. In general any pencil of lines, connected with the line a x by descriptive or metrical properties, has for its equation a rational integral function of the four forms equated to zero.
For the quadratic aoxi +2a i x i x 2 +a 2 x, we have (i.) ax = 7/1x1+2aixix2-I-7/24, (ii.) xx=xi+xzi (ab) 2 =2(aoa2 - ai), a a = a o+712, _ (v.) (xa)ax= i'?- (a2 - ao)xix2 - aix2. This is the fundamental system; we may, if we choose, replace (ab) 2 by ab =a, 2 ,+2a1+a2 since the identity a a b b - a, = (ab) 2 shows the syzygetic relation (74+a 2) 2 - (ao +2a +ï¿½2) = 2(aoa2 - ai).
There is no linear covariant, since it is impossible to form a symbolic product which will contain x once and at the same time appertain to a quadratic. (v.) is the Jacobian; geometrically it denotes the bisectors of the angles between the lines ax, or, as we may say, the common harmonic conjugates of the lines and the lines x x . The linear invariant a s is such that, when equated to zero, it determines the lines ax as harmonically conjugate to the lines xx; or, in other words, it is the condition that may denote lines at right angles.
Salmon, Lessons Introductory to the Modern Higher Algebra (Dublin, 1885); E. B. Elliott, Algebra of Quantics (Oxford, 1895) ; F. Brioschi, Teoria dei Covarianti (Rome, 1861); W. Fiedler, Die Elemente der neueren Geometrie and der Algebra der binären Formen (Leipzig, 1862) ; A. Clebsch, Theorie der binären Algebraischen Formen (Leipzig, 1872); Vorlesungen über Geometrie (Leipzig, 1875) ; Faà de Bruno, Theorie des formes binaires (Turin, 1876) ; P. Gordan, Vorlesungen über Invariantentheorie, Bd. i. “Determinanten” (Leipzig, 1885); Bd. ii. " Binäre Formen " (Leipzig, 1887) ; G. Rubini, Teoria delle forme in generale, e specialmente delle binarie (Leue, 1886) ; E. Study, Methoden zur Theorie der Terrdren Formen (Leipzig, 1889) ; Lie, Theorie der Transformationsgruppen (Leipzig, 1888-1890) ; Franz Meyer, Bericlit über den gegenwärtigen Stand der Invariantentheorie; Jahresbericht der Deutschen Mathematiker-Vereinigung, Bd. i. (Berlin, 1892) ; Encyklopäidie der mathematischen Wissenschaften, Bd. i., Heft 3, 4, by Heinrich Burkhardt and Franz Meyer (Leipzig, 1899) ;
J. H. Grace and A. Young, The Algebra of Invariants (Cambridge, 1903).
- ↑ ^{1.0} ^{1.1} The elementary theory is given in the article Determinant.