On a Class of Differential Equations, Including Those Which Occur in Dynamical Problems.--Part II
Author(s)
W. F. Donkin
Year
1855
Volume
145
Pages
61 pages
Language
en
Journal
Philosophical Transactions of the Royal Society of London
Full Text (OCR)
XV. On a Class of Differential Equations, including those which occur in Dynamical Problems.—Part II. By W. F. Donkin, M.A., F.R.S., F.R.A.S., Savilian Professor of Astronomy in the University of Oxford.
Received February 17,—Read March 22, 1855.
The following paper forms the continuation and conclusion of one on the same subject presented to the Royal Society last year, and printed in the Philosophical Transactions for 1854. I have however put it, as far as possible, in such a form as to be independently intelligible.
The fourth Section (the first of this Part) contains a recapitulation of some of the most important results of the former Part, in the form of seven theorems, here enunciated without demonstration.
In the fifth Section the method of the variation of elements is treated under that aspect which belongs to it in connexion with the general subject. It is applied, by way of example, to deduce the expressions for the variations of the elliptic elements of a planet's orbit from the results of art. 30 (Part I.), on undisturbed elliptic motion; this example was chosen, partly because the resulting expressions are required in a future section, and partly for the sake of incidentally calling attention to a fallacy which has been, perhaps, often committed, and certainly seldom noticed. The same method, under a slightly different and possibly new point of view, is applied, as a second example, to the problem of the motion of a free simple pendulum, omitting the effect of the earth's rotation. I believe the methods of this paper might be advantageously employed in the treatment of that general form of the problem of a free pendulum which has been considered by Professor Hansen in his Prize Essay. I was unwilling, however, to attempt what might have turned out to be merely an unconscious plagiarism, without having seen the Essay in question, of which I only succeeded in obtaining a copy on the day of writing this preface. As I now perceive that the investigation would be quite independent, I hope to enter upon it at some future time.
The sixth Section contains some general theorems concerning the transformation of systems of differential equations of the form considered in this paper, by the substitution of new variables. The most important case consists in the transformation from fixed to moving axes of coordinates, in dynamical problems. Some of the results are, I think, interesting, and perhaps new.
The seventh and last Section contains an application of the preceding theorems, in connexion with the variation of elements, to the transformation of the differential
equations of the planetary theory. This investigation, if interesting at all, will probably be so to the mathematician rather than to the astronomer. I think, however, that if the theories of physical astronomy were more frequently treated rigorously and symmetrically, apart from any approximate integrations; and if, when the latter are introduced, more care were taken to give a clear and exact view of the nature of the reasoning employed, it might be possible to draw the attention and secure the cooperation of a class of mathematicians who now may well be excused, if, after a slight trial, they turn from the subject in disgust, and prefer to expatiate in those beautiful fields of speculation which are offered to them by other branches of modern geometry and analysis.
The contents of the two last Sections are more or less closely connected with the subjects of various memoirs by other writers, especially Professor Hansen and the Rev. B. Bronwin. I cannot pretend to that degree of acquaintance with them which would enable me to give an exact statement of the amount of novelty to be found in my own researches. I believe it is enough to justify me in offering them to the Society; beyond this I make no claim.
Oxford, Feb. 15, 1855.
Section IV.
49. The following theorems were demonstrated in the former part of this essay, and are recapitulated here for convenience of reference. (As before, total differentiation with respect to the independent variable $t$ will, in general, be denoted by accents, which will be used for no other purpose.)
Theorem I.—If $X$ be a function of $n$ variables $x_1, x_2, \ldots x_n$, and if $y_1, y_2, \ldots y_n$ be $n$ other variables connected with the former by the $n$ equations
$$\frac{dX}{dx_1} = y_1, \frac{dX}{dx_2} = y_2, \ldots \frac{dX}{dx_n} = y_n, \ldots \ldots \ldots \ldots (50.)$$
then will the values of $x_1, x_2, \ldots x_n$, expressed by means of these equations in terms of $y_1, \ldots y_n$, be of the form
$$x_1 = \frac{dY}{dy_1}, x_2 = \frac{dY}{dy_2}, \ldots x_n = \frac{dY}{dy_n}; \ldots \ldots \ldots \ldots (51.)$$
and if $p$ be any other quantity explicitly contained in $X$, then also
$$\frac{dX}{dp} + \frac{dY}{dp} = 0 \ldots \ldots \ldots \ldots \ldots \ldots \ldots \ldots (52.)$$
(the differentiation with respect to $p$ being in each case performed only so far as $p$ appears explicitly in the function).
The value of $Y$ is given by the equation
$$Y = -(X) + (x_1)y_1 + (x_2)y_2 + \ldots + (x_n)y_n, \ldots \ldots \ldots \ldots (53.)$$
where the brackets indicate that $x_1 \ldots x_n$ are supposed to be expressed in terms of $y_1 \ldots y_n$ (arts. 2, 3.).
Theorem II.—Suppose the function X to contain explicitly, besides the n variables \(x_1 \ldots x_n\), another variable \(t\), and also n constants \(a_1, a_2, \ldots a_n\); and in addition to the equations (50.), let the following be assumed:
\[
\frac{dX}{da_1} = b_1, \ldots \frac{dX}{da_n} = b_n,
\]
where \(b_1, \ldots b_n\) are n other constants; so that, by virtue of the \(2n\) equations (50.), (54.), the \(2n\) variables \(x_1 \ldots x_n, y_1 \ldots y_n\), may be considered as functions of the \(2n\) constants \(a_1, \ldots a_n, b_1, \ldots b_n\), and \(t\). Then if from the equations (50.), (54.), and their total differential coefficients with respect to \(t\), the \(2n\) constants be eliminated, there will result the following \(2n\) simultaneous differential equations of the first order; viz.
\[
x'_i = \frac{dZ}{dy_i}, \quad y'_i = -\frac{dZ}{dx_i},
\]
where \(Z\) is a function of \(x_1 \ldots x_n, y_1 \ldots y_n\) (which will in general also contain \(t\) explicitly), and is given by the equation
\[
Z = -\left(\frac{dX}{dt}\right).
\]
In this equation \(\frac{dX}{dt}\) represents the partial differential coefficient of X taken with respect to \(t\) so far as \(t\) appears explicitly in the original expression for X in terms of \(x_1 \ldots x_n, a_1 \ldots a_n\) and \(t\); and the brackets indicate that \(a_1, \ldots a_n\) are afterwards to be expressed in terms of the variables by means of the equations (50.), (arts. 5, 6.)
Theorem III.—Let the supposition that the \(2n\) variables \(x_1 \ldots x_n, y_1 \ldots y_n\) are expressed in terms of the \(2n\) constants and \(t\), be called Hypothesis I.; and the converse supposition that \(a_1 \ldots a_n, b_1 \ldots b_n\) are expressed in terms of the \(2n\) variables and \(t\), Hypothesis II.; then will the following relations subsist:
\[
\begin{align*}
\frac{dx_i}{da_j} &= -\frac{db_j}{dy_i}, \quad \frac{dx_i}{db_j} = \frac{da_j}{dy_i}, \\
\frac{dy_i}{da_j} &= \frac{db_j}{dx_i}, \quad \frac{dy_i}{db_j} = -\frac{da_j}{dx_i}.
\end{align*}
\]
(In each of these equations the first member refers to Hyp. I., and the second to Hyp. II.; and since there is no connexion between the indices of the variables and those of the constants, the case of \(i=j\) has no peculiarity.)
Theorem IV.—Let the symbol \([p, q]\) be an abbreviation for the expression
\[
\Sigma_i \left( \frac{dp}{dy_i} \frac{dq}{dx_i} - \frac{dp}{dx_i} \frac{dq}{dy_i} \right)
\]
(where \(p, q\) are any functions of the \(2n\) variables, which may also contain any other quantities explicitly; and the differentiations are performed only so far as \(x_1, \&c., y_1, \&c.\) appear explicitly in \(p, q\)); then if \(a_1, \ldots a_n, b_1, \ldots b_n\) be expressed (Hyp. II.) in terms of the \(2n\) variables and \(t\), the following equations subsist identically:
\[
[a_i, b_i] = -[b_i, a_i] = 1, \quad [a_i, b_j] = [a_i, a_j] = [b_i, b_j] = 0.
\]
(i being different from \( j \)); and obviously in all cases
\[
[p, q] = -[q, p], \quad \text{and} \quad [p, p] = 0 \quad (\text{art. 9.})
\]
**Theorem V.**—If \( u, v \) be either (1) any two functions whatever of the \( 2n \) constants \( a_1, \&c., b_1, \&c. \), or (2) any two functions whatever of the \( 2n \) variables \( x_1, \&c., y_1, \&c. \) (which may in either case also contain \( t \) explicitly)*, then
\[
\Sigma_i \left\{ \frac{du}{dy_i} \frac{dv}{dx_i} - \frac{du}{dx_i} \frac{dv}{dy_i} \right\} = \Sigma_i \left\{ \frac{du}{da_i} \frac{dv}{db_i} - \frac{du}{db_i} \frac{dv}{da_i} \right\}. \quad \ldots \ldots \ldots \ldots \quad (59.)
\]
(When \( u, v \) represent functions of the constants, the differential coefficients in the first member of this equation refer to Hyp. II.; and, when functions of the variables, those in the second member refer to Hyp. I.) (art. 10.)
**Theorem VI.**—Let \( x_1, \ldots x_n, y_1, \ldots y_n \) be \( 2n \) variables concerning which no supposition is made except that they are connected by \( n \) equations of the form
\[
a_i = \varphi_i(x_1, x_2, \ldots x_n, y_1, y_2, \ldots y_n) \quad \ldots \ldots \ldots \ldots \quad (a.)
\]
(where the functions on the right are only subject to the condition that the \( n \) equations (a.) shall be algebraically sufficient to determine \( y_1, \ldots y_n \) in terms of \( x_1, \ldots x_n, a_1, \&c. \), and may contain explicitly any other quantities besides \( x_1, \&c., y_1, \&c. \)).
Then, if by means of the equations (a.) the \( n \) variables \( y_1, y_2, \ldots y_n \) be expressed as functions of \( x_1, x_2, \&c., a_1, \&c. \); in order that the \( \frac{n(n-1)}{2} \) conditions
\[
\frac{dy_i}{dx_j} = \frac{dy_j}{dx_i}
\]
may subsist identically, it is necessary and sufficient that each of the \( \frac{n(n-1)}{2} \) expressions \([a_i, a_j]\) vanish identically.
**Theorem VII.**—Let \( Z \) be any function whatever of \( 2n \) variables \( x_1, \ldots x_n, y_1, \ldots y_n \), and \( t \). If of the system of \( 2n \) simultaneous differential equations of the first order
\[
x'_i = \frac{dZ}{dy_i}, \quad y'_i = -\frac{dZ}{dx_i} \quad \ldots \ldots \ldots \ldots \quad (I.)
\]
there be given \( n \) integrals involving \( n \) arbitrary constants \( a_1, a_2, \ldots a_n \), so that each of these constants may be expressed as a function of the variables \( x_1, \&c., y_1, \&c. \) (with or without \( t \)); then if the \( \frac{n(n-1)}{2} \) conditions \([a_i, a_j] = 0\) subsist identically, the remaining \( n \) integrals may be found, as follows. By means of the \( n \) given integrals let the \( n \) variables \( y_1 \ldots y_n \) be expressed in terms of \( x_1, \&c., a_1, \&c. \); and let \((Z)\) be what \( Z \) becomes when \( y_1 \ldots y_n \) are thus expressed. These values of \( y_1, y_2 \ldots y_n \) and \(- (Z)\), will be the partial differential coefficients with respect to \( x_1, x_2, \ldots x_n \) and \( t \), of one and the same function; call this function \( X \), then, since its partial differential coefficients are
* It was inadvertently stated in art. 10, that \( u, v \) must not contain \( t \) explicitly. But it is evident that no such limitation is implied in the demonstration of the theorem. The preceding theorem is obviously a particular case of this; namely, the case in which \( u = a_j, v = b_j \).
all given (by the equations \( \frac{dX}{dx_i} = y_i, \frac{dX}{dt} = -(Z) \)), X may be found by simple integration, and is therefore to be considered a given function of \( x_1, \ldots x_n, a_1, \ldots a_n \) and \( t \). The remaining \( n \) integrals are then given by the \( n \) equations
\[
\frac{dX}{da_i} = b_i,
\]
\( b_1, \ldots b_n \) being \( n \) new arbitrary constants.
[On the relation between this theorem and the theories of Sir W. R. Hamilton and Jacobi, see arts. 15–20.]
50. Other results established in the former part will be referred to as occasion may require. To the theorems enunciated in the preceding article, the following may now be added.
Returning to the equations (50.), (54.), (55.), we may observe, that if, in the first members of (55.), \( x_i, y_i \) be supposed expressed in terms of \( a_1, \&c., b_1, \&c. \) and \( t \), then \( \frac{dx_i}{dt}, \frac{dy_i}{dt} \) may be written instead of \( x'_i, y'_i \); since on this hypothesis the total differential coefficients of \( x_i, y_i \) are obtained by differentiating with respect to \( t \) as it appears explicitly. We have therefore
\[
\frac{dx_i}{dt} = \frac{dZ}{dy_i}, \quad \frac{dy_i}{dt} = -\frac{dZ}{dx_i},
\]
where the first members refer to Hyp. I., and the second to Hyp. II. But since the equations (50.), (54.) involve \( a, b \), exactly in the same way as they involve \( x, y \), it is obvious that the same reasoning which leads to the equations just written, would lead, mutatis mutandis, to the following, which may be considered as an addition to the system of equations (57.) (Theorem III.):
\[
\frac{da_i}{dt} = \frac{dZ}{db_i}, \quad \frac{db_i}{dt} = -\frac{dZ}{da_i}. \quad \ldots \ldots \ldots \ldots \quad (60.)
\]
In these equations, \( a_i, b_i \) in the first members are supposed to be expressed in terms of the variables (Hyp. II.), whilst in the second members \( x_i, \&c., y_i, \&c. \) are supposed to be expressed in terms of the constants and \( t \) (Hyp. I.). As before, \( Z = -\frac{dX}{dt} \), but in (60.) \( Z \) is differently expressed, being what the \( Z \) of (55.) becomes when \( x_i, \&c., y_i, \&c. \) are expressed according to Hyp. I.
It is to be remembered that all consequences deduced from the form of the system (50.), (54.) belong to the system of equations, obtained as in Theorem VII., which express the solution of the differential equations (I.). Such a solution will be called, as before, a normal solution; and the system of equations obtained by expressing \( a_i, \&c., b_i, \&c. \) in terms of the variables and \( t \), will be called a system of normal integrals. (See art. 20, and the note to art. 29.)
51. Let \( a_i, \&c., b_i, \&c. \) be called, as before, elements. If then \( c \) be any function of the elements, when the latter are expressed in terms of the variables and \( t \)
(Hyp. II.), \( c \) becomes also a function of the same; and we have
\[
\frac{dc}{dt} = \sum_i \left( \frac{dc}{da_i} \frac{da_i}{dt} + \frac{dc}{db_i} \frac{db_i}{dt} \right) = \sum_i \left( \frac{dc}{da_i} \frac{dZ}{db_i} - \frac{dc}{db_i} \frac{dZ}{da_i} \right)
\]
(see the last article). But, by Theorem V., this becomes
\[
\frac{dc}{dt} = [c, Z].
\]
It is worth observing that both this equation and (60.) might have been obtained indirectly as follows. Since \( c \) is constant, we have \( c' = 0 \); that is, \( \frac{dc}{dt} + [Z, c] = 0 \) (see (32.), art. 22.); this gives (62.), since \([Z, c] = -[c, Z]\), and again, by Theorem V., is changed into (61.); and if, in the latter, we put successively \( c = a_j, c = b_j \), we obtain the system (60.).
Section V.—On the Variation of Elements.
52. The following general problem includes, I believe, all the cases which occur in practice. Let \( P_1, \ldots, P_n, Q_1, \ldots, Q_n \) be any functions whatever of the \( 2n \) variables \( x_1, \ldots, x_n, y_1, \ldots, y_n \) and \( t \). It is required to express the \( 2n \) integrals of the system of \( 2n \) simultaneous differential equations of the first order
\[
x'_i = P_i, \quad y'_i = Q_i
\]
in the same form as the integrals (supposed given) of the canonical system
\[
x'_i = \frac{dZ}{dy_i}, \quad y'_i = -\frac{dZ}{dx_i}
\]
by substituting functions of \( t \) for the constant elements of the latter system.
Suppose a normal solution (see end of art. 50.) of the system (I.) to be employed. The elements \( a_i, b_i \) represent the same functions of \( x_i, \&c., y_i, \&c. \) and \( t \) as before, but are now variable; consequently we have
\[
a'_i = \frac{da_i}{dt} + \sum_j \left[ \frac{da_i}{dx_j} x'_j + \frac{da_i}{dy_j} y'_j \right] = \frac{da_i}{dt} + \sum_j \left[ P_j \frac{da_i}{dx_j} + Q_j \frac{da_i}{dy_j} \right],
\]
with a similar expression for \( b'_i \). But, by equations (57.) and (60.), these are immediately transformed into the following:
\[
a'_i = \frac{dZ}{db_i} + \sum_j \left[ Q_j \frac{dx_j}{db_i} - P_j \frac{dy_j}{db_i} \right]
\]
\[
b'_i = -\frac{dZ}{da_i} - \sum_j \left[ Q_j \frac{dx_j}{da_i} - P_j \frac{dy_j}{da_i} \right]
\]
where \( Z, Q_j, P_j, x_j, y_j \) in the second members are supposed to be expressed (Hyp. I.) in terms of the elements and \( t \). Thus the system (63.) is transformed into a system involving the new variables \( a_i, b_i \), instead of the original variables \( x_i, y_i \).
53. If, instead of employing a set of normal integrals of the pattern system (I.), we take any complete set of integrals \( c_1, c_2, \ldots, c_{2n} \), then \( c_1, \&c. \) may be considered as
functions of \(a_1, \&c.\), and again, through them, of the variables. We have then
\[
c'_i = \frac{dc_i}{da_1}a'_1 + \ldots + \frac{dc_i}{db_1}b'_1 + \ldots;
\]
and if in this equation the values of \(a'_1, \&c.\) be introduced from the formula (E.) of the last article, the following expression results:
\[
c'_i = \{Z, c_i\} + \Sigma_j(Q_j\{x_j, c_i\} - P_j\{y_j, c_i\})
\]
(in which the symbol \(\{p, q\}\) is used to denote
\[
\Sigma_k\left\{\frac{dp}{db_k} \frac{dq}{da_k} - \frac{dp}{da_k} \frac{dq}{db_k}\right\},
\]
so that by (59.) (Theorem V.) we have \(\{p, q\} = -[p, q]\); but in \(\{p, q\}\) \(p\) and \(q\) are considered as functions of \(a_1, \&c., b_1, \&c.\), whilst in \([p, q]\) they are considered as functions of \(x_1, \&c., y_1, \&c.\). Now, considering \(p, q\) as functions of \(c_1, \&c.\), and through these, of \(a_1, \&c.\), we have (by reasoning exactly similar to that employed in deducing equation (24.), art. 9.)
\[
\{p, q\} = \Sigma\left(\{c_\alpha, c_\beta\}\left(\frac{dp}{dc_\alpha} \frac{dq}{dc_\beta} - \frac{dp}{dc_\beta} \frac{dq}{dc_\alpha}\right)\right)
\]
(the summation referring to all binary combinations of the indices \(\alpha, \beta\)). Hence we have, putting \(q = c_i\),
\[
\{p, c_i\} = \Sigma_\alpha\left(\{c_\alpha, c_i\}\frac{dp}{dc_\alpha}\right), \ldots \ldots \ldots \ldots \quad (64.)
\]
and consequently the above expression for \(c'_i\) becomes
\[
c'_i = \{Z, c_i\} + \Sigma_\alpha\Sigma_j\left(\{c_\alpha, c_i\}\left(Q_j\frac{dx_j}{dc_\alpha} - P_j\frac{dy_j}{dc_\alpha}\right)\right), \ldots \ldots \ldots \quad (F.)
\]
an equation which is easily seen to become identical with (E.), art. 52, when \(c_1, \ldots c_{2n}\) represent \(a_1, \ldots a_n, b_1, \ldots b_n\).
54. The simplest case is that in which the system of equations (63.), whose integrals are sought, are of the canonical form; that is, where
\[
P_i = \frac{dW}{dy_i}, \quad Q_i = -\frac{dW}{dx_i},
\]
\(W\) being a given function of the variables (with or without \(t\)). In this case the formula (E.) becomes
\[
\begin{align*}
a'_i &= \frac{dZ}{db_i} - \frac{dW}{db_i} \\
b'_i &= -\frac{dZ}{da_i} + \frac{dW}{da_i}
\end{align*}
\]
whilst (F.) is easily found to be reducible, by the help of (64.), to either of the following forms:
\[
c'_i = \{Z, c_i\} - \{W, c_i\} \ldots \ldots \ldots \ldots \quad (66.)
\]
\[
c'_i = \Sigma_\alpha\left(\{c_\alpha, c_i\}\left(\frac{dZ}{dc_\alpha} - \frac{dW}{dc_\alpha}\right)\right). \ldots \ldots \ldots \ldots \quad (67.)
\]
If we put \( W = Z + \Omega \), so that \( \Omega \) may be called the "disturbing function," the above formulae become
\[
a_i' = -\frac{d\Omega}{db_i}, \quad b_i' = \frac{d\Omega}{da_i}. \quad \ldots \quad \ldots \quad \ldots \quad \ldots \quad \ldots \quad (68.)
\]
\[
c_i' = \Sigma_a \left( \{ c_i, c_a \} \frac{d\Omega}{dc_a} \right). \quad \ldots \quad \ldots \quad \ldots \quad \ldots \quad \ldots \quad (69.)
\]
On the first of these forms see the note to art. 38. With respect to the form (69.), if we put for \( \{ c_i, c_a \} \) its equivalent \([c_i, c_a]\), or \([c_a, c_i]\) (see Theorem V. art. 49.), we obtain the well-known expression
\[
c_i' = \Sigma_a \left( [c_a, c_i] \frac{d\Omega}{dc_a} \right).
\]
The difference between this last form and (69.) consists in this; that in the latter the coefficients \([c_a, c_i]\) are obtained from the expressions for \(c_i, c_a, \&c.\) in terms of the variables; whereas in (69.) the coefficients \(\{ c_i, c_a \}\) are similarly obtained from the expressions for \(c_i, \&c.\) in terms of the normal elements \(a_i, \&c., b_i, \&c.*\); and when a normal solution of the undisturbed problem has been obtained, the latter process will generally be found much more convenient than the former, since the elements \(c_i, \&c.\) will usually be much simpler functions of the normal elements than of the variables.
55. In illustration of this, it will be worth while to deduce the expressions for the variations of the ordinary elliptic elements of a planet's orbit from those of the normal elements given in art. 30.
Let \(a\) and \(e\) be the semiaxis major and eccentricity, \(i\) the inclination of the orbit to a fixed ecliptic, \(v\) the longitude of the node, \(w\) the longitude of the perihelion, \(nt + (\varepsilon)\) the mean longitude of the planet; longitudes being reckoned in the plane of the ecliptic (from a fixed origin) as far as the node, and then on the plane of the orbit. As usual, \(n\) stands for \(\frac{\mu^{\frac{1}{2}}}{a^{\frac{3}{2}}}\). Also let \(nt + (\varepsilon) = \int_0^n ndt + \varepsilon\), so that \(\varepsilon' = (\varepsilon)' + tn'\).
If, then, we call the six normal elements \(a_1, a_2, a_3, \beta_1, \beta_2, \beta_3\), we have (see art. 30.)
\[
\alpha_1 = \frac{m\mu}{2a}, \quad \beta_1 = \frac{w - (\varepsilon)}{n},
\]
\[
\alpha_2 = m\sqrt{\mu a(1-e^2)}, \quad \beta_2 = w - v,
\]
\[
\alpha_3 = m\sqrt{\mu a(1-e^2)\cos i}, \quad \beta_3 = v;
\]
from which, conversely,
\[
a = \frac{m\mu}{2a_1}, \quad w = \beta_2 + \beta_3,
\]
\[
1 - e^2 = \frac{2a_1a_3^2}{m^3\mu^2}, \quad v = \beta_3,
\]
\[
\cos i = \frac{a_3}{a_2}, \quad (\varepsilon) = \beta_2 + \beta_3 - \frac{(2a_1)^{\frac{3}{2}}}{m^{\frac{3}{2}}\mu} \beta_1.
\]
* \(\{ c_i, c_a \} = \Sigma_j \left( \frac{dc_i}{db_j} \frac{dc_a}{da_j} - \frac{dc_i}{da_j} \frac{dc_a}{db_j} \right)\).
From these expressions the values of \{a, e\}, \{a, i\}, &c. are found with the greatest simplicity, and the results are
\[ m_\mu \{a, (\varepsilon)\} = 2na^2, \quad m_\mu \{(\varepsilon), e\} = \frac{na}{e} \sqrt{1-e^2}(1-\sqrt{1-e^2}), \]
\[ m_\mu \{w, e\} = \frac{na}{e} \sqrt{1-e^2}, \quad m_\mu \{(\varepsilon), i\} = \frac{na}{\sqrt{1-e^2}} \tan \frac{i}{2}, \]
\[ m_\mu \{w, i\} = \frac{na}{\sqrt{1-e^2}} \tan \frac{i}{2}, \quad m_\mu \{v, i\} = \frac{na}{\sin i \sqrt{1-e^2}}, \]
the rest all vanishing. Hence, observing that if \( R \) be taken in its usual signification we have \( \Omega = -R \), we obtain*
\[ \mu a' = 2na^2 \frac{dR}{d(\varepsilon)}, \]
\[ \mu e' = -\frac{na}{e} \sqrt{1-e^2} \left[ \frac{dR}{d\omega} + (1-\sqrt{1-e^2}) \frac{dR}{d(\varepsilon)} \right], \]
\[ \mu (\varepsilon)' = -2na^2 \frac{dR}{da} + \frac{na}{e} \sqrt{1-e^2} (1-\sqrt{1-e^2}) \frac{dR}{de} + \frac{na}{\sqrt{1-e^2}} \tan \frac{i}{2} \frac{dR}{di}, \]
\[ \mu w' = na \left[ \frac{\sqrt{1-e^2}}{e} \frac{dR}{de} + \frac{1}{\sqrt{1-e^2}} \tan \frac{i}{2} \frac{dR}{di} \right], \]
\[ \mu i' = -\frac{na}{\sqrt{1-e^2}} \left[ \frac{1}{\sin i} \frac{dR}{dv} + \tan \frac{i}{2} \left( \frac{dR}{d(\varepsilon)} + \frac{dR}{d\omega} \right) \right], \]
\[ \mu v' = \frac{na}{\sin i \sqrt{1-e^2}} \frac{dR}{di}, \]
in which we may, as usual, put \( \varepsilon \) for \( (\varepsilon) \), provided that in forming the term \( \frac{dR}{da} \), \( nt \) be exempt from differentiation with respect to \( a \).
56. A comparison of the above process with that by which the corresponding
* If we consider \( R \) as a function of \( p, q \) instead of \( i, v \), where \( p = \tan i \cos v, q = \tan i \sin v \), we find
\[ \frac{dR}{di} = \sec^2 i \left( \cos v \frac{dR}{dp} + \sin v \frac{dR}{dq} \right) \]
\[ \frac{dR}{dv} = \tan i \left( \cos v \frac{dR}{dq} - \sin v \frac{dR}{dp} \right), \]
and consequently
\[ \mu p' = \frac{-na(\sec i)^2}{\sqrt{1-e^2}} \left[ \sec i \frac{dR}{dq} + \tan \frac{i}{2} \cos v \left( \frac{dR}{d(\varepsilon)} + \frac{dR}{d\omega} \right) \right] \]
\[ \mu q' = \frac{na(\sec i)^2}{\sqrt{1-e^2}} \left[ \sec i \frac{dR}{dp} - \tan \frac{i}{2} \sin v \left( \frac{dR}{d(\varepsilon)} + \frac{dR}{d\omega} \right) \right]. \]
The formulæ will then agree with those of the Mécanique Céleste (Supplement to vol. iii. p. 360, ed. 1844), if we allow for the different mode of measuring longitudes, and neglect, as Laplace does, terms of the second order with respect to \( i \) and \( \frac{dR}{di} \). (Laplace uses \( R \) with the opposite sign.) Those in the text agree (allowing for notation) with the expressions given by Professor Hansen, Astr. Nachr. No. 166, art. 3, equations (2).
MDCCCLV.
expressions are obtained by Pontécoulant*, will show the convenience of using the coefficients \{c_i, c_j\} instead of \([c_i, c_j]\) (in Pontécoulant's notation \((c_i, c_j)\)).
[It will be observed that the formulae for \((\varepsilon)', \varpi', i'\) at the end of the last article, do not agree with those of Pontécoulant (p. 330) for the variations of the corresponding quantities \(i, \omega, \phi\). The reason of this is as follows:—In Pontécoulant's notation \(\phi\) expresses the same as \(i\) in this paper, and \(\alpha\) the same as \(\nu\). But \(\omega\) (the longitude of the perihelion) is not the same as \(\varpi\); the former being measured entirely in the plane of the orbit from a radius vector, fixed in that plane†, and assumed as the origin of longitudes. Consequently \(i\), in Pontécoulant (which we will call \(\varepsilon_i\) for distinction), is not the same as \((\varepsilon)\) in the present paper. In fact, if we equate the expressions for the mean anomaly in the two notations, we have
\[
\varepsilon_i - \omega = (\varepsilon) - \varpi;
\]
also it is evident that if we put \(\beta\) for the angle between the node and the origin from which \(\omega\) is measured, we have \(d\beta = -\cos i d\nu\), and \(\varpi = \nu + \beta + \omega\), so that
\[
d\varpi = d\omega + (1 - \cos i) d\nu.
\]
If then it were allowable to consider \(R\) as capable of being expressed as a function of \(\omega\) and \(\varepsilon_i\) instead of \(\varpi\) and \((\varepsilon)\), and if we represented by \((R)\) the expression for \(R\) so transformed, we should have
\[
\frac{dR}{d\varpi} d\varpi + \frac{dR}{d(\varepsilon)} d(\varepsilon) + \text{&c.} = \frac{d(R)}{d\omega} d\omega + \frac{d(R)}{d\varepsilon_i} d\varepsilon_i + \text{&c.};
\]
and if, in the two first terms, we put for \(d\varpi\) and \(d(\varepsilon)\) the values \(d\varpi = d\omega + (1 - \cos i) d\nu\), \(d(\varepsilon) = d\varepsilon_i + (1 - \cos i) d\nu\), and compare the two expressions, we find
\[
\frac{dR}{d\varpi} = \frac{d(R)}{d\omega}, \quad \frac{dR}{d(\varepsilon)} = \frac{d(R)}{d\varepsilon_i},
\]
\[
\frac{dR}{d\nu} + (1 - \cos i) \left( \frac{dR}{d(\varepsilon)} + \frac{dR}{d\varpi} \right) = \frac{d(R)}{d\nu}.
\]
These relations, together with the equation
\[
\varpi' = \omega' + (1 - \cos i)' ,
\]
are easily seen to render the expressions at the end of art. 55 identical with those of Pontécoulant; in fact, it is by an equivalent transformation that the latter are finally obtained by that author from the correct expressions in p. 328. But it is to be observed that this proceeding is founded upon a false assumption; for it is impossible to express \(R\) as a function of \(a, e, i, \nu, \varepsilon_i, \omega\), as is obvious from the consideration that \(R\), in its original form, is not a function of \((\varepsilon) - \varpi\) merely, but also of \((\varepsilon)\); whilst \((\varepsilon)\) is not expressible as a function of the new elements, as is shown by the equation \(d(\varepsilon) = d\varepsilon_i + (1 - \cos i) d\nu\)‡. It would be out of place to enter further into this sub-
* Théorie Anal. du Système du Monde, tome i. pp. 316–330.
† On the meaning of this expression, see below, art. 73.
‡ It would be a work of some trouble to trace accurately the process by which Laplace arrives at the for-
ject here, especially as some of the most important principles involved in it have been discussed elsewhere*. See also Appendix B.]
57. Returning to the expression (69.), art. 54, it may be observed that the coefficients \{c_i, c_j\} are to be expressed in terms of \(c_1, c_2, &c.\), and this involves no difficulty when each of the two sets of elements \(c_1, &c., b_1, &c.\) can be expressed in terms of the other explicitly, as was the case in the example just discussed. Suppose, however, that the normal set \(a_1, &c., b_1, &c.\) are given in terms of the set \(c_1, &c.\), but that it is impracticable or inconvenient to obtain the converse equations expressing the latter in terms of the former. In this case we may proceed as follows.
Adopting the notation of art. 1†, and putting \(f, g\) for any two of the set \(c_1, c_2, &c.\), we have
\[
\{f, g\} = \sum_{i} d(f, g)
\]
suppose this equation written at length, and then, after multiplying by \(\frac{d(b_i, a_j)}{d(f, g)}\), let each side be summed with respect to all binary combinations \(f, g\). The result is (see art. 1, equation (4.)),
\[
\Sigma \left( \frac{d(b_j, a_i)}{d(f, g)} \cdot \{f, g\} \right) = 1 \ldots \ldots \ldots \ldots (70.)
\]
(the summation referring to the combinations \(f, g\)). Again, if the former equation be multiplied by \(\frac{d(p, q)}{d(f, g)}\), where \(p, q\) represent any two of the normal elements, \(a_i, &c., b_i, &c.\), except a conjugate pair, and the sum be taken as before, we have
\[
\Sigma \left( \frac{d(p, q)}{d(f, g)} \cdot \{f, g\} \right) = 0. \ldots \ldots \ldots \ldots (71.)
\]
The two formulae (70.), (71.) give \(n(2n-1)\) linear equations for determining the \(n(2n-1)\) unknown quantities \{f, g\}; the coefficients of the latter being all given functions of \(c_i, &c.\). But such cases will hardly occur in practice. (With respect to the form of the above system of linear equations, it is easy to show that the complete determinant of the coefficients is =1.)
58. The integration of the formulæ (65.), art. 53, would give the means of expressing the solution of the system
\[
x'_i = \frac{dW}{dy_i}, \quad y'_i = -\frac{dW}{dx_i}
\]
mulæ alluded to in a preceding note, as the various steps of it are to be found in different places, the notation is somewhat inconsistent, and the results do not profess to be rigorous. My impression is, however, that Laplace nowhere commits the fallacy of assuming (for example) that \(R\) is a function of \(r, v, z,\) or \(r, v, s\) (see vol. i. p. 295), where \(v\) is the angle described by the radius vector on the varying plane of the orbit.
* See Jacobi's two letters to Professor Hansen in Crelle's Journal, vol. xlii.
† i.e. using \(d(u,v)\) as an abbreviation for \(\frac{du}{dx}\frac{dv}{dy} - \frac{du}{dy}\frac{dv}{dx}\).
in the form of a normal solution of any other similar system
\[ x'_i = \frac{dZ}{dy_i}, \quad y'_i = -\frac{dZ}{dx_i}, \]
which may be chosen as the pattern.
In the most usual examples the function to be chosen for \( Z \) is naturally suggested by the circumstance, that \( W \) presents itself under the form of the sum of two functions \( Z + \Omega \), of which the former, taken alone, gives an integrable system. But this is not necessarily the case; and it is worth while to observe that the formulæ (65.) take a simple and remarkable form whatever \( Z \) may be, provided that it be a function not containing \( t \) explicitly. For then, assuming the "integral of vis viva," \( Z = h \), as one of the normal integrals of the pattern system*, the element conjugate to \( h \) is \( \tau \) (the constant added to \( t \)); and observing that \( Z \), in (65.), being expressed in terms of the elements, reduces itself simply to \( h \), we shall have \( \frac{dZ}{dh} = 1 \), whilst the differential coefficients of \( Z \) with respect to all the other elements vanish; so that, if we put \( a_1, \ldots a_{n-1}, b_1, \ldots b_{n-1} \) for the remaining elements, the system (65.) takes the following form:
\[
\begin{align*}
h' &= -\frac{dW}{d\tau}, & \tau' &= -1 + \frac{dW}{dh} \\
a'_i &= -\frac{dW}{db_i}, & b'_i &= \frac{dW}{da_i}
\end{align*}
\]
This, in dynamics, gives the process to be used in the following problem: "To express the solution of any dynamical problem in the form of the solution of any other (involving the same number of variables) in which the principle of vis viva subsists."
59. As an example of the above process we may apply it to determine the motion of a simple free pendulum (not taking into account the earth's rotation).
Let \( l \) be the length of the pendulum, and let the mass of the material point \( m \) placed at its extremity be represented by unity. Also let \( x, y, z \) be the rectangular coordinates of \( m \), the origin being at the position of rest of \( m \), and the axis of \( z \) directed vertically upwards. The equation to the sphere described by \( m \) is
\[ x^2 + y^2 + z^2 - 2lz = 0, \]
and the force-function \( U \) is \(-gz\).
Hence if we take, as the two independent coordinates, the radius vector \( \varepsilon \) of the projection of \( m \) on the plane of \( xy \), and the angle \( \theta \) between \( \varepsilon \) and the axis of \( x \), we shall have for the differential equations of motion,
\[
\begin{align*}
\varepsilon' &= \frac{dW}{du}, & \theta' &= \frac{dW}{dv} \\
u' &= -\frac{dW}{d\varepsilon}, & v' &= -\frac{dW}{d\theta}
\end{align*}
\]
(A.)
* See art. 19 (where \( h_i \) in equation (29.) is a misprint for \( b_i \)).
where \( u, v \) are the variables conjugate respectively to \( \xi, \theta \), and defined by the equations
\[
u = \frac{dT}{d\xi}, \quad v = \frac{dT}{d\theta};
\]
and \( W \) is \( T - U \) expressed in terms of \( \xi, \theta, u, v \).
Now \( x = \xi \cos \theta, y = \xi \sin \theta, z = l - \sqrt{l^2 - \xi^2} \); hence
\[
T = \frac{1}{2}(x'^2 + y'^2 + z'^2) = \frac{1}{2}\left(\frac{l^2}{l^2 - \xi^2} \xi'^2 + \xi^2 \theta'^2\right),
\]
from which the following expression for \( W \) is easily obtained:
\[
W = \frac{1}{2}\left(\frac{l^2 - \xi^2}{l^2} u^2 + \frac{v^2}{\xi^2}\right) + g(l - \sqrt{l^2 - \xi^2}). \quad \ldots \ldots \ldots \quad (W.)
\]
Now let us take as a model for the solution of the above system, a set of normal integrals (in polar coordinates) of the system
\[
x'' + n^2 x = 0, \quad y'' + n^2 y = 0, \quad \ldots \ldots \ldots \ldots \ldots \quad (B.)
\]
where \( n^2 = \frac{g}{l} \). In this system we have \( U = -\frac{1}{2}n^2 \xi^2 \); and proceeding exactly as in art. 27, we obtain the following results: the two integrals of vis viva and of areas are
\[
h = \frac{1}{2}\left(u^2 + \frac{v^2}{\xi^2} + n^2 \xi^2\right) \quad \ldots \ldots \ldots \ldots \ldots \quad (i.)
\]
these are to be solved for \( u, v \); and then \( V \) is to be obtained from the equation
\[
dV = ud\xi + vd\theta. \quad \text{This gives}
\]
\[
V = c\theta + \int d\xi \left[2h - n^2 \xi^2 - \frac{c^2}{\xi^2}\right]^{\frac{1}{2}};
\]
and the remaining integrals are given by the equations
\[
\frac{dV}{dh} = t + \tau, \quad \frac{dV}{dc} = \varpi,
\]
\( \tau \) and \( \varpi \) being the elements conjugate respectively to \( h \) and \( c \). Performing the differentiations first, and taking the integrals in the second term so as to vanish with the expression \( \left\{2h - n^2 \xi^2 - \frac{c^2}{\xi^2}\right\}^{\frac{1}{2}} \) (see Appendix A.), we find easily the final equations
\[
n^2 \xi^2 = h + \sqrt{h^2 - n^2 c^2} \cdot \cos 2n(t + \tau)
\]
\[
c^2 \xi^{-2} = h - \sqrt{h^2 - n^2 c^2} \cdot \cos 2(\theta - \varpi),
\]
in which \( \varpi \) is the angle between the axis of \( x \) and a (distant) apse, and \( -\tau \) is the time of passage through that apse. The four equations (i.), (ii.) comprise a complete normal solution of the equations (B.). The last is the polar equation to the elliptic orbit; and if we call \( a, b \) the semiaxes of the ellipse, we have
\[
c = nab, \quad h = n^2 \frac{a^2 + b^2}{2}.
\]
60. The solution of the system (A.) of the last article will now be expressed by the same equations (i.), (ii.), if the elements \( h, c, \tau, \varpi \) be variables defined by the system (see art. 58.)
\[
h' = -\frac{dW}{d\tau}, \quad \tau' = -1 + \frac{dW}{dh}
\]
\[
c' = -\frac{dW}{d\varpi}, \quad \varpi' = \frac{dW}{dc},
\]
where \( W \) is to be obtained by substituting in the expression \((W.)\), art. 59, the values of \( g, \theta, n, v \) in terms of the elements and \( t \), derived from equations (i.), (ii.). The result of this substitution is
\[
W = \frac{h^2}{2} + \frac{c^2}{4l^2} - \frac{h^2}{4n^2l^2} - \frac{1}{2} \sqrt{h^2 - n^2c^2} \cos 2n(t+\tau)
\]
\[
+ \frac{h^2 - n^2c^2}{4n^2l^2} \cos 4n(t+\tau) + n^2l(l - \sqrt{l^2 - \xi^2}),
\]
in which the value of \( \xi^2 \) in the last term must be understood to be substituted from the first of equations (ii.). If we call \( \phi \) the angle between the pendulum and the vertical, we shall have evidently
\[
n^2l(l - \sqrt{l^2 - \xi^2}) = n^2l^3(1 - \cos \phi),
\]
and the differential coefficient of this term with respect to any constant \( k \) involved in the value of \( g \) will be \( \frac{n^2}{2 \cos \phi} \frac{dg}{dk} \). Observing this, we obtain the following expressions for the variations of the elements:
\[
h' = -n(\sec \phi - 1)\sqrt{h^2 - n^2c^2} \sin 2n(t+\tau) + \frac{h^2 - n^2c^2}{nl^2} \sin 4n(t+\tau)
\]
\[
\tau' = -\frac{h}{2n^2l^2} + \frac{1}{2}(\sec \phi - 1)\left(1 + \frac{h}{\sqrt{h^2 - n^2c^2}} \cos 2n(t+\tau)\right) + \frac{h}{2n^2l^2} \cos 4n(t+\tau)
\]
\[
c' = 0
\]
\[
\varpi' = \frac{c}{2l^2} - \frac{1}{2}(\sec \phi - 1)\frac{n^2c}{\sqrt{h^2 - n^2c^2}} \cos 2n(t+\tau) - \frac{c}{2l^2} \cos 4n(t+\tau).
\]
The third of these equations gives \( ab = \text{constant} \); hence, by means of the equations at the end of art. 59, the following expressions are easily deduced:
\[
a' = -\frac{b'}{b} = \frac{n}{2}\left(-(\sec \phi - 1) \sin 2n(t+\tau) + \frac{a^2 - b^2}{l^2} \sin 4n(t+\tau)\right)
\]
\[
\tau' = \frac{a^2 + b^2}{4l^2}(-1 + \cos 4n(t+\tau)) + \frac{1}{2}(\sec \phi - 1)\left(1 + \frac{a^2 + b^2}{a^2 - b^2} \cos 2n(t+\tau)\right)
\]
\[
\varpi' = nab\left\{\frac{1 - \cos 4n(t+\tau)}{2l^2} - (\sec \phi - 1)\frac{\cos 2n(t+\tau)}{a^2 - b^2}\right\}.
\]
These equations are rigorous, and in general not easier to integrate than the original system of which they are a transformation; but they may be integrated approxi-
mately on particular hypotheses. For instance, if the pendulum never deviates much from the vertical, \( \frac{g^2}{l^2} \) is always small, and \( \sec \phi - 1 = \frac{g^2}{2l^2} \) nearly; introducing this value, and substituting for \( g^2 \), we have for the non-periodic parts of the above expressions
\[
\frac{d'}{a} = -\frac{b'}{b} = 0
\]
\[
\tau' = -\frac{a^2 + b^2}{16l^2}, \quad \omega' = \frac{3nab}{8l^2}.
\]
Hence the axes of the mean ellipse are constant; also if \( T \) be the time of describing the ellipse, we shall have approximately \( T = \frac{2\pi}{n} \cdot \frac{1}{1+\tau'} \), whence it follows that the motion of the apse during this period will be \( \frac{3nab}{8l^2} \cdot \frac{2\pi}{n(1+\tau')} \), or \( \frac{3ab}{8l^2} \cdot 2\pi \) nearly.
This agrees with the statement of the Astronomer Royal*. The above approximations would cease to be sufficiently accurate if \( b \) were very nearly equal to \( a \), or the motion very nearly circular; but they apply to any other case, and in particular to that in which \( b \) is very small, or the motion nearly rectilinear. In the case of nearly circular motion, it would be necessary to develope \( \sec \phi \) in a series of powers of \( a^2 - b^2 \), and the results would be applicable whether \( \phi \) were small or not. But I shall not pursue this subject further here.
**Section VI.—On the Transformation of Variables.**
61. The method of the variation of elements, theoretically considered, consists merely in a transformation of variables of a particular kind; that kind namely, which leads to a new system of differential equations belonging to the same general class as the original system. But practically, the choice of variables is determined by the well-known considerations from which the method derives its name.
It is the object of the present section to consider the general class of transformations of which the method in question is a particular, and not the only useful case.
62. **Definition of Normal Transformations.**
Let \( \xi_1, \xi_2, \ldots, \xi_n, \eta_1, \eta_2, \ldots, \eta_n \) be new variables connected with the original variables \( x_1, \&c., y_1, \&c. \) by \( 2n \) equations (which may also involve \( t \) explicitly), such that each variable of either set may be considered as a function of the variables of the other set (with or without \( t \)). Let \( P \) be any function of \( \xi_1, \xi_2, \ldots, \xi_n, y_1, y_2, \ldots, y_n \) and \( t \); then if the equations connecting the old and new variables can be put in the form
\[
\frac{dP}{dy_i} = x_i, \quad \frac{dP}{d\xi_i} = \eta_i, \ldots \ldots \ldots \ldots \ldots \ldots \ldots \ldots (73.)
\]
I propose to call the transformation normal.
* Proceedings of the Royal Astronomical Society, vol. xi. p. 160.
[It follows from Theorem I. art. 49, that the system (73.) is equivalent to each of the following:
\[
\frac{dQ}{dx_i} = y_i, \quad \frac{dQ}{d\xi_i} = -\eta_i,
\]
in which \( Q = -P + \Sigma_i(x_i y_i) \), and is expressed in terms of \( x_1, \ldots x_n, \xi_1, \ldots \xi_n \); or
\[
\frac{dR}{d\eta_i} = \xi_i, \quad \frac{dR}{dy_i} = -x_i,
\]
in which \( R = -P + \Sigma_i(\xi_i \eta_i) \), and is expressed in terms of \( y_1, \ldots y_n, \eta_1, \ldots \eta_n \); or lastly,
\[
\frac{dS}{d\eta_i} = \xi_i, \quad \frac{dS}{dx_i} = y_i,
\]
in which \( S = -P + \Sigma_i(x_i y_i + \xi_i \eta_i) \), and is expressed in terms of \( x_1, \ldots x_n, \eta_1, \ldots \eta_n \).
Any one of these forms might be used; but I shall employ the form (73.) for reasons of convenience.]
63. Inasmuch as the equations (73.) of the last article are of the same general form as the system (54.), art. 49, all the conclusions deduced from that form will subsist, mutatis mutandis; so that we may apply the Theorems (II.), (III.), (IV.), (V.), art. 49, by merely changing \( X \) into \( P \), and
\( x_1, \ldots x_n, y_1, \ldots y_n, a_1, \ldots a_n, b_1, \ldots b_n \) respectively into
\( \xi_1, \ldots \xi_n, \eta_1, \ldots \eta_n, y_1, \ldots y_n, x_1, \ldots x_n \);
observing that instead of \( x'_i, y'_i \) we must now write \( \frac{d\xi_i}{dt}, \frac{d\eta_i}{dt} \). We thus obtain the following relations:
\[
\frac{d\xi_i}{dt} = \frac{d\Psi}{d\eta_i}, \quad \frac{d\eta_i}{dt} = -\frac{d\Psi}{d\xi_i},
\]
where \( \Psi \) is a function of \( \xi_1, \ldots \xi_n, \eta_1, \ldots \eta_n \) and \( t \), defined by the equation
\[
\Psi = -\left( \frac{dP}{dt} \right),
\]
the brackets indicating that the expressions for \( y_1, \ldots y_n \) in terms of the new variables \( \xi_1, \ldots \xi_n, \eta_1, \ldots \eta_n \) are to be substituted in \( \frac{dP}{dt} \) after the differentiation with respect to \( t \); which is performed so far as \( t \) appears explicitly in the original expression for \( P \) as a function of \( \xi_1, \ldots \xi_n, y_1, \ldots y_n \) and \( t \). (See Theorem II.)
We have also the system
\[
\begin{align*}
\frac{d\xi_i}{dy_j} &= -\frac{dx_j}{d\eta_i}, & \frac{d\xi_i}{dx_j} &= \frac{dy_j}{d\eta_i}, \\
\frac{d\eta_i}{dy_j} &= \frac{dx_j}{d\xi_i}, & \frac{d\eta_i}{dx_j} &= -\frac{dy_j}{d\xi_i}.
\end{align*}
\]
* For in the original theorems \( x'_i \) is the same thing as the differential coefficient of \( x_i \) taken with respect to \( t \), as \( t \) appears explicitly in the expression for \( x_i \) in terms of \( a_1, \ldots a_n, b_1, \ldots b_n \) and \( t \); the analogous quantity in the present case is therefore the differential coefficient of \( \xi_i \) taken with respect to \( t \), as \( t \) appears explicitly in the expression for \( \xi_i \) in terms of \( x_1, \ldots x_n, y_1, \ldots y_n \) and \( t \). But this must not now be denoted by \( \xi'_i \), inasmuch as \( x_1, \ldots x_n, y_1, \ldots y_n \) are themselves afterwards to be considered as functions of \( t \).
(see Theorem III.). And if \( p, q \) be any two functions of the variables (with or without \( t \)), then
\[
\Sigma_i \left( \frac{dp}{dy_i} \frac{dq}{dx_i} - \frac{dp}{dx_i} \frac{dq}{dy_i} \right) = \Sigma_i \left( \frac{dp}{d\xi_i} \frac{dq}{d\eta_i} - \frac{dp}{d\eta_i} \frac{dq}{d\xi_i} \right),
\]
where \( p \) and \( q \) in the first member are supposed to be expressed in terms of \( x_1, \&c., y_1, \&c. \), and, in the second, in terms of \( \xi_1, \&c., \eta_1, \&c. \). In other words, the value of \([p, q]\) is the same, whether it be obtained from the expressions for \( p, q \) in terms of the original variables, or by an analogous process from their expressions in terms of the new.
Particular cases of (77.) are the relations
\[
[\xi_i, \eta_i] = -1, \quad [\xi_i, \xi_j] = [\eta_i, \eta_j] = [\xi_i, \xi_j] = 0.
\]
(See Theorems IV., V.)
64. The relations (74.), (76.), (77.), (78.) of the last article depend solely upon the form of the equations (73.), art. 62, which connect the new variables with the old; and are independent of any supposition as to the equations which may determine either set of variables as functions of \( t \). Let us now, however, introduce the supposition that the original variables \( x_1, \ldots x_n, y_1, \ldots y_n \) are determined as functions of \( t \) by the system of differential equations,
\[
x'_i = \frac{dZ}{dy_i}, \quad y'_i = -\frac{dZ}{dx_i}.
\]
The relations just established enable us immediately to transform this system into another involving the new variables instead of the old; for we have
\[
\xi'_i = \frac{d\xi_i}{dt} + \Sigma_j \left( \frac{d\xi_i}{dx_j} x'_j + \frac{d\xi_i}{dy_j} y'_j \right);
\]
now
\[
\frac{d\xi_i}{dt} = \frac{d\Psi}{d\eta_i} \quad \text{(see (74.), art. 63)};
\]
and if in the remaining term we substitute for \( x'_j, y'_j \) their values from (I.), and for \( \frac{d\xi_i}{dx_j}, \frac{d\xi_i}{dy_j} \) their values \( \frac{dy_j}{d\eta_i}, -\frac{dx_j}{d\eta_i} \), it becomes
\[
\Sigma_j \left( \frac{dZ}{dx_j} \frac{dx_i}{d\eta_i} + \frac{dZ}{dy_j} \frac{dy_i}{d\eta_i} \right),
\]
which is equivalent to \( \frac{dZ}{d\eta_i} \), if \( Z \) be supposed expressed in terms of the new variables.
We have then
\[
\xi'_i = \frac{d\Psi}{d\eta_i} + \frac{dZ}{d\eta_i},
\]
and, exactly in the same way,
\[
\eta'_i = -\frac{d\Psi}{d\xi_i} - \frac{dZ}{d\xi_i}.
\]
This result may be stated in the form of the following Theorem VIII*. If the system
* This theorem, in its general form, is, to the best of my knowledge, new. But that case of it in which \( P \) does not contain \( t \) explicitly has already been proved in a different way by M. Desboves, who has, by means
of differential equations (I.) be transformed by the introduction of new variables \( \xi_1, \ldots, \xi_n, \eta_1, \ldots, \eta_n \), connected with the original variables \( x_1, \ldots, x_n, y_1, \ldots, y_n \), by the equations
\[
\frac{dP}{dy_i} = x_i, \quad \frac{dP}{d\xi_i} = \eta_i,
\]
where \( P \) is any function of \( \xi_1, \ldots, \xi_n, y_1, \ldots, y_n \), which may also contain \( t \) explicitly, then the transformed equations are
\[
\dot{\xi}_i = \frac{d\Phi}{d\eta_i}, \quad \dot{\eta}_i = -\frac{d\Phi}{d\xi_i},
\]
in which \( \Phi \) is defined by the equation
\[
\Phi = Z - \frac{dP}{dt},
\]
and is to be expressed in terms of the new variables. (The substitution of the new variables in \( \frac{dP}{dt} \) is to be made after the differentiation. See art. 63.).
**Corollary.**—If \( P \) do not contain \( t \) explicitly, \( \frac{dP}{dt} = 0 \) and \( \Phi = Z \); so that in this case the transformation is effected merely by expressing \( Z \) in terms of the new variables.
65. It follows from (77.), art 63, that if \( f, g \) be any two integrals of the system (I.), the value of \( [f, g] \) is the same whether it be derived from these integrals in their original form, or similarly obtained from the same integrals after transformation by the introduction of the new variables. And consequently if \( n \) integrals \( a_1, a_2, \ldots, a_n \) of the original system be given, which satisfy the \( \frac{n(n-1)}{2} \) conditions \( [a_i, a_j] = 0 \), they will continue, after a normal transformation, to satisfy the analogous conditions, so that the method of finding the remaining integrals given in Theorem VII. art. 49, will also continue to be applicable. We had an instance of this in the case of the problem of central forces (art. 27.), where the above conditions were found to subsist after the transformation from rectangular to polar coordinates. (It will be shown presently that every transformation of coordinates is a normal transformation.)
66. It was shown in Part I. (art. 18.), that if \( W \) be any function of \( x_1, x_2, \ldots, x_n, x'_1, x'_2, \ldots, x'_n \) (which may also contain \( t \) explicitly), the system of \( n \) differential equations of the second order
\[
\left( \frac{dW}{dx'_i} \right)' = \frac{dW}{dx_i}
\]
may be changed into a system of \( 2n \) equations of the first order of the form (I.), of it, deduced Jacobi's form of the method of the Variation of Elements (namely, the equations (68.), art. 54), from the similar form of Lagrange, in which the elements are the initial values of the variables. It will appear in the sequel that the extension to the case in which \( P \) may contain \( t \), is of importance. If the expression were not already appropriated, I should have proposed definitively to call \( P \) the "modulus of transformation;" and I shall use this term provisionally in the present paper, not being able to suggest a tolerable substitute. After all, as the word "modulus" itself is used without confusion in very different senses according to the subject matter, there is, perhaps, no reason why a similar liberty should not be allowed in the use of the proposed expression.
art. 64, by putting $y_i = \frac{dW}{dx_i}$, and taking $Z$ a function of $x_1$, &c., $y_1$, &c. and $t$, defined by the equation
$$Z = -W + \Sigma_i (x'_i y_i),$$
in which $x'_1, \ldots x'_n$ are to be expressed in terms of $y_1$, &c., $x_1$, &c.*
Conversely, a system of the form (I.) being given, it may be changed into a system of the form (80.) as follows: by means of the equations $x'_i = \frac{dZ}{dy_i}$, let $y_1, \ldots y_n$ be expressed in terms of $x'_1$, &c., $x_1$, &c.; it follows from Theorem I. (art. 49.) that we shall have
$$y_i = \frac{dW}{dx'_i} \quad \ldots \ldots \ldots \ldots \ldots \ldots \ldots (a.)$$
and
$$\frac{dZ}{dx_i} = -\frac{dW}{dx'_i}, \quad \ldots \ldots \ldots \ldots \ldots \ldots \ldots (b.)$$
where $W$ is a function of $x_1$, &c., $x'_1$, &c. defined by equation
$$W = -Z + \Sigma_i (x'_i y_i),$$
in which $y_1, \ldots y_n$ are to be expressed in terms of $x'_1$, &c., $x_1$, &c. The $n$ equations $y'_i = -\frac{dZ}{dx_i}$ are then changed by (a.) and (b.) into the form (80.).
**On the Transformation of Coordinates.**
67. It has been seen in the preceding article, that we can always change the system of $2n$ equations of the first order of the form (I.), art. 64, into a system of $n$ equations of the second order of the form (80.). In this latter form the equations of dynamics naturally present themselves.
Now in the case of the dynamical equations, $x_1, x_2, \ldots x_n$ are the independent coordinates of the system (the word coordinates being taken in its most general sense), and when the equations are to be changed into the form (I.), the additional variables $y_1, \ldots y_n$ are defined by the equations $\frac{dW}{dx'_i} = y_i$. In this case a transformation of coordinates, in the most general sense, consists in taking $n$ new variables $\xi_1, \xi_2, \ldots \xi_n$, connected with the original coordinates $x_1, \ldots x_n$ by $n$ equations, which may also involve $t$ explicitly. It is a well-known theorem, that the transformation of the equations (80.) is effected merely by expressing $W$ in terms of the new coordinates $\xi_1, \ldots \xi_n$, and their differential coefficients $\xi'_1, \ldots \xi'_n$, instead of the old; so that the new equations are
$$\left(\frac{dW}{d\xi'_i}\right)' = \frac{dW}{d\xi'_i}: \quad \ldots \ldots \ldots \ldots \ldots \ldots \ldots (81.)$$
the proof of this theorem does not depend upon the form of the function $W$; and we know also (see arts. 18 & 66.), that whatever be the form of $W$, these new equations may be again transformed to a system of the form (I.), by taking $n$ additional
* This theorem is a generalization of Sir W. R. Hamilton's transformation of the Dynamical Equations, See Part I. art. 18.
variables \( n_1, \ldots, n_n \), defined by the equations
\[
n_i = \frac{dW}{d\xi_i}.
\]
[It is to be observed that this last transformation is not, in general, equivalent to expressing the original \( Z \) in terms of the new variables; for the original \( Z \) is \(-W + \Sigma(x_i y_i)\) (art. 66.), and the analogous expression derived from (81.) is \(-W + \Sigma(\xi_i n_i)\), which is not, in general, equivalent to the former. It will be seen presently that the two expressions are equivalent when the equations connecting \( x_1, \ldots, x_n \) with \( \xi_1, \ldots, \xi_n \) do not involve \( t \) explicitly.]
68. Following the analogy of the dynamical equations, I shall adopt the following as the
**Definition of a transformation of coordinates.**
The original equations (I.), art. 64, having been changed into the form (80.), art. 66, let \( \xi_1, \xi_2, \ldots, \xi_n \) be \( n \) new variables connected with the \( n \) variables \( x_1, x_2, \ldots, x_n \) by \( n \) equations, which may also involve \( t \) explicitly; and let \( n_1, n_2, \ldots, n_n \) be \( n \) other new variables defined by the equations \(\frac{dW}{d\xi_i} = n_i\) (where \( W \) has been expressed in terms of \( \xi_1, \&c., \xi_n, \&c. \)).
By means of the \( 2n \) assumed relations, the \( 2n \) original variables \( x_1, \ldots, x_n, y_1, \ldots, y_n \) can be expressed as functions of the \( 2n \) new variables \( \xi_1, \ldots, \xi_n, n_1, \ldots, n_n \). Let this substitution be called a "transformation of coordinates."
It has been seen in the preceding article that the original equations (I.) are changed by a transformation of coordinates into a system of the same form, which however cannot in general be obtained by merely expressing \( Z \) in terms of the new variables. But we are not at liberty to assume (and it is not generally true) that a change of the system (I.) into another of the same form is a normal transformation (art. 62.). It has already been stated, however (end of art. 65.), that this is true in the present case; a proposition which I proceed to establish.
69. **Every transformation of coordinates is a normal transformation.**
To prove this theorem, we have to show that every transformation of the kind described in the last article is also of the kind defined in art. 62; in other words, that it is possible to assign a function \( P \) of \( \xi_1, \ldots, \xi_n, y_1, \ldots, y_n \) (with or without \( t \)), such that the given relations between \( x_1, \&c., \xi_1, \&c., \) which define the transformation of coordinates, shall be equivalent to the system of equations
\[
\frac{dP}{dy_i} = x_i, \quad \frac{dP}{d\xi_i} = n_i.
\]
Take
\[
P = (x_1)y_1 + (x_2)y_2 + \ldots + (x_n)y_n. \quad \ldots \quad \ldots \quad \ldots \quad \ldots \quad \ldots \quad (82.)
\]
(the brackets indicating that \( x_1, \ldots, x_n \) are to be expressed in terms of \( \xi_1, \ldots, \xi_n \); so that \( P \) is a function of \( \xi_1, \ldots, \xi_n, y_1, \ldots, y_n \), with or without \( t \) according as the equations connecting \( x_1, \ldots, x_n \) with \( \xi_1, \ldots, \xi_n \) do or do not contain \( t \) explicitly). Then \( P \) is the func-
tion required. For we have, at once, \( \frac{dP}{dy_i} = x_i \); also
\[
\frac{dP}{d\xi_i} = y_1 \frac{d(x_1)}{d\xi_i} + y_2 \frac{d(x_2)}{d\xi_i} + \ldots + y_n \frac{d(x_n)}{d\xi_i};
\]
now the definition of \( n_i \) is \( n_i = \frac{dW}{d\xi_i} \), where
\[
W = -(Z) + x'_1 y_1 + x'_2 y_2 + \ldots + x'_n y_n
\]
(expressed in terms of \( \xi_1, \ldots \xi_n, \xi'_1, \ldots \xi'_n \) (art. 66.)); hence, putting \( Z \) (without brackets) for the original form of \( Z \) in terms of \( x_i, \&c., y_i, \&c. \), and observing that \( (Z) \) becomes a function of \( \xi_i, \&c. \), only through \( y_i, \&c. \), we have
\[
\frac{dW}{d\xi_i} = -\frac{dZ}{dy_1} \frac{dy_1}{d\xi_i} - \frac{dZ}{dy_2} \frac{dy_2}{d\xi_i} - \ldots - \frac{dZ}{dy_n} \frac{dy_n}{d\xi_i}
\]
\[
+ x_1 \frac{dx'_1}{d\xi_i} + x_2 \frac{dx'_2}{d\xi_i} + \ldots + x_n \frac{dx'_n}{d\xi_i}
\]
\[
+ y_1 \frac{dx'_1}{d\xi_i} + y_2 \frac{dx'_2}{d\xi_i} + \ldots + y_n \frac{dx'_n}{d\xi_i}.
\]
The two first lines of this expression vanish by virtue of the equations \( x'_i = \frac{dZ}{dy_i} \); and since
\[
x'_j = \frac{d(x_j)}{dt} + \frac{d(x_j)}{d\xi_1} \xi_1' + \frac{d(x_j)}{d\xi_2} \xi_2' + \ldots + \frac{d(x_j)}{d\xi_n} \xi_n',
\]
we have
\[
\frac{dx'_j}{d\xi_i} = \frac{d(x_j)}{d\xi_i};
\]
so that we have, finally,
\[
n_i = \frac{dW}{d\xi_i} = y_1 \frac{d(x_1)}{d\xi_i} + y_2 \frac{d(x_2)}{d\xi_i} + \ldots + y_n \frac{d(x_n)}{d\xi_i},
\]
which is evidently equivalent (see the expression (82.)) to
\[
n_i = \frac{dP}{d\xi_i};
\]
the proposition in question is thus established, and may be enunciated as follows:
70. Theorem IX.—Every transformation of coordinates is a normal transformation, of which the modulus * P is a function of \( \xi_1, \ldots \xi_n, y_1, \ldots y_n \) (with or without t) given by the equation
\[
P = (x_1)y_1 + (x_2)y_2 + \ldots + (x_n)y_n
\]
(the brackets indicating that \( x_1, \ldots x_n \) are to be expressed in terms of \( \xi_1, \ldots \xi_n \)). (See arts. 62, 68.)
This theorem will be made more intelligible by applying it to a very simple example.
Let it be proposed, then, to transform from rectangular to polar coordinates the differential equations of any dynamical problem referring to the motion of a single material point whose mass is \( m \). Let \( x, y, z \) be the rectangular coordinates of \( m \),
* See note on Theorem VIII. art. 64.
and \( u, v, w \) the variables conjugate to them; so that, putting
\[ T = \frac{1}{2}m(x^2 + y^2 + z^2), \]
we have \( u = \frac{dT}{dx}, \ldots \), &c.; whence
\[ T = \frac{1}{2m}(u^2 + v^2 + w^2), \]
and the equations of motion are
\[ x' = \frac{dZ}{du}, \]
\[ u' = -\frac{dZ}{dx}, \ldots \text{&c., where } Z = \frac{1}{2m}(u^2 + v^2 + w^2) - U, \]
and \( U \) is a given function of \( x, y, z \), with or without \( t \).
Now let \( r, \theta, \phi \) be polar coordinates of \( P \), so that
\[ x = r \sin \theta \cos \phi, \quad y = r \sin \theta \sin \phi, \quad z = r \cos \theta. \]
Let \( u, v, w \) be the variables conjugate to \( r, \theta, \phi \). Then the ordinary process of transformation would be as follows:
1. to express \( x', y', z' \) in terms of \( r, \theta, \phi, r', \theta', \phi' \), and thus transform \( T \) into a function of the latter quantities;
2. to define \( u, v, w \) by the equations
\[ u = \frac{dT}{dr'}, \quad v = \frac{dT}{d\theta'}, \quad w = \frac{dT}{d\phi'}, \]
and by means of these relations to express \( r', \theta', \phi' \) in terms of \( u, v, w, r, \theta, \phi \), so that \( x', y', z' \), and therefore, finally, \( T \) and \( Z \), might be expressed as functions of the six new variables.
Instead of this, let us adopt the method indicated by the theorem at the beginning of this article.
We have then, for the modulus of transformation,
\[ P = (x)u + (y)v + (z)w, \]
in which \( x, y, z \) are to be expressed in terms of \( r, \theta, \phi \); so that the proper form of \( P \) is
\[ P = ur \sin \theta \cos \phi + vr \sin \theta \sin \phi + wr \cos \theta, \]
and the equations (corresponding to \( z_i = \frac{dP}{d\xi_i} \) (art. 69.)) which define the new variables \( u, v, w \), are
\[ u = \frac{dP}{dr}, \quad v = \frac{dP}{d\theta}, \quad w = \frac{dP}{d\phi}. \]
These give
\[ u = \sin \theta(u \cos \phi + v \sin \phi) + w \cos \theta, \]
\[ v = r \cos \theta(u \cos \phi + v \sin \phi) - rw \sin \theta, \]
\[ w = -r \sin \theta(u \sin \phi - v \cos \phi), \]
from which the values of \( u, v, w \) are easily obtained in terms of the six new variables.
But in order to effect the transformation of \( T \), we have only to square each side of these equations, and add them, after dividing the second by \( r^2 \) and the third by \( (r \sin \theta)^2 \); we thus obtain
\[ u^2 + v^2 + w^2 = u^2 + \frac{v^2}{r^2} + \frac{w^2}{(r \sin \theta)^2}. \]
and the transformed equations of motion are therefore
\[
mr' = u, \quad mb' = vr^{-2}, \quad m\phi' = w(r \sin \theta)^{-2},
\]
\[
mu' = r^{-3}(v^2 + w^2(\sin \theta)^{-2}) + m \frac{dU}{dr},
\]
\[
mv' = (r \sin \theta)^{-3}w^2 \cos \theta + m \frac{dU}{d\phi}, \quad w' = \frac{dU}{d\phi}.
\]
The preceding process is not, of course, given for the sake of the result, which may easily be verified directly, but in order to illustrate the meaning of the theorem on which it depends. It is hardly necessary to add, that if the problem involved the independent rectangular coordinates of any number of material points, the transformation to polar coordinates would be effected in the same way, by merely adding to \( P \) analogous terms for each point.
71. Before proceeding further there is an important remark to be made.
It has been hitherto assumed that the modulus of transformation \( P \) was a function of no other quantities than the \( 2n \) variables \( \xi_1, \ldots, \xi_n, y_1, \ldots, y_n \) and \( t \). But if every step of the demonstrations of the theorems of transformation which have been given (art. 62, &c.) be examined, it will be seen that they continue to hold good in the following more general form.
Take
\[
P = f(\xi_1, \xi_2, \ldots \xi_n, y_1, y_2, \ldots y_n, p, q, r, \ldots, t),
\]
where \( p, q, r, \ldots \) are any functions of any or all the variables, old and new, with or without \( t \).
Let the equations connecting the old and new variables be, as before,
\[
\frac{dP}{dy_i}x_i, \quad \quad \frac{dP}{d\xi_i}=\eta_i \ldots \ldots \ldots (83.)
\]
with the condition that \( p, q, r, \ldots \) are exempt from differentiation in forming these equations.
Then take \( \Psi = -\left(\frac{d(P)}{dt}\right) \), with the following signification; (1) \( \frac{d(P)}{dt} \) denotes the differential coefficient of \( P \) with respect to \( t \), so far as \( t \) is contained explicitly, and also through the variables in \( p, q, r, \ldots \); that is to say,
\[
\frac{d(P)}{dt} = \frac{dp}{dt} + \frac{dp}{dx_1}x'_1 + \ldots + \frac{dp}{dq}q' + \ldots
\]
(where \( p'\) = \(\frac{dp}{dt} + \frac{dp}{dx_1}x'_1 + \ldots \) etc., but this substitution is not to be made at this stage);
(2) \( \frac{d(P)}{dt} \) denotes the result of substituting in the above expression the values of \( y_1, \ldots, y_n \) in terms of \( \xi_1, \ldots, \xi_n, \eta_1, \ldots, \eta_n, p, q, r, \ldots \) from (83.), so far as \( \frac{d(P)}{dt} \) contains \( y_1, \ldots, y_n \) explicitly (i.e. not involved in \( p, q, r, \ldots \)).
Lastly, take
\[
\Phi = (Z) + \Psi,
\]
where \((Z)\) denotes the result of substituting in \( Z \) the values of the old variables as
given in terms of \( \xi_1, \ldots \eta_1, \ldots p, q, r, \ldots \) from (82.). We shall then have
\[
\xi_i' = \frac{d\Phi}{d\eta_i}, \quad \eta_i' = -\frac{d\Phi}{d\xi_i}, \quad \ldots \ldots \ldots \ldots \quad (84.)
\]
where \( \Phi \) is in general a function of
\[
\xi_1, \ldots \xi_n, \eta_1, \ldots \eta_n, p, q, r, \ldots p', q', r', \ldots \text{and } t;
\]
and the differentiations with respect to \( \xi_i, \eta_i \) are performed only so far as those variables appear explicitly in \( \Phi \). But after these differentiations, we may introduce the actual values of \( p, \ldots p', \ldots \) in terms of the variables and their differential coefficients.
It is obvious that the original variables will not, in general, have been eliminated from the system (84.); but of course the elimination may be afterwards completed*. Similar considerations apply to that particular case of transformation which we have called a transformation of coordinates (art. 68.). We have then
\[
P = (x_1)y_1 + \ldots + (x_n)y_n,
\]
and the relations connecting \( x_1 \ldots x_n \) with \( \xi_1, \ldots \xi_n \), may contain \( p, p, r, \ldots \); so that \( (x_i), \&c. \) are functions of \( p, q, r, \ldots \) as well as of \( \xi_1, \ldots \xi_n \).
We might have deduced the preceding conclusions from the following simple consideration. Since \( p, q, r, \ldots \) are actually functions of \( t \), though unknown functions, we may imagine them to be known, and to be expressed explicitly in terms of \( t \); and then the case resolves itself into that of art. 62, &c., so far as the demonstration is concerned. But as a doubt might possibly have arisen whether any fallacy was involved in the circumstance that \( p, q, r, \ldots \) involve (when supposed to be expressed in terms of \( t \)) the arbitrary constants of the problem itself, it seemed best to refer to the original reasoning; the most important part of which is that contained (mutatis mutandis) in art. 6. (For the "mutanda" see the beginning of art. 63.) It is then apparent that this circumstance is perfectly immaterial with reference to the conclusions in question, though it may be important in other points of view.
72. This being premised, we will proceed to an example of transformation more interesting than the former, namely, the
**Transformation from fixed to moving axes of coordinates.**
Let \( x, y, z, u, v, w \) have the same signification as in art. 70, and let \( x, y, z, u, v, w \) be the new variables, where \( x, y, z \) are rectangular coordinates, referring to a system of moving axes of which the origin always coincides with that of the original fixed axes of \( x, y, z \).
Let the direction cosines of the new (moving) axes with respect to the old be \( \lambda_0, \mu_0, \nu_0; \lambda_1, \mu_1, \nu_1; \lambda_2, \mu_2, \nu_2 \), thus†:
* The final equations in this case will not in general have the canonical form.
† I do not know who first used this convenient way of indicating the nine direction cosines by a diagram, but I first saw it in one of M. LAMÉ's works.
where $\lambda_0$, &c. are functions of $t$, which may be either given explicitly, or implicitly through the variables (see the last article).
The modulus of transformation $P$ is found (art. 69.) by substituting for the variables $x$, $y$, $z$ in the expression $xu + yv + zw$, their values in terms of $x$, $y$, $z$; we have therefore
$$P = (\lambda_0 x + \lambda_1 y + \lambda_2 z)u + (\mu_0 x + \mu_1 y + \mu_2 z)v + (\nu_0 x + \nu_1 y + \nu_2 z)w,$$
and then the three equations
$$\frac{dP}{dx} = u, \quad \frac{dP}{dy} = v, \quad \frac{dP}{dz} = w,$$
give
$$\begin{align*}
u &= \lambda_0 u + \mu_0 v + \nu_0 w \\
v &= \lambda_1 u + \mu_1 v + \nu_1 w \\
w &= \lambda_2 u + \mu_2 v + \nu_2 w
\end{align*},$$
whence
$$\begin{align*}
u &= \lambda_0 u + \lambda_1 v + \lambda_2 w \\
v &= \mu_0 u + \mu_1 v + \mu_2 w \\
w &= \nu_0 u + \nu_1 v + \nu_2 w.
\end{align*}$$
(85.)
Also we have (see Theorem VIII. art. 64.), since $P$ is to be considered to contain $t$ explicitly through $\lambda_0$, &c., only,
$$\frac{dP}{dt} = (\lambda_0' x + \lambda_1' y + \lambda_2' z)u + (\mu_0' x + \mu_1' y + \mu_2' z)v + (\nu_0' x + \nu_1' y + \nu_2' z)w,$$
in which expression the values of $u$, $v$, $w$ in terms of the new variables are to be substituted from (85.). Now if we put $\omega_0$, $\omega_1$, $\omega_2$ for the angular velocities of the moving system of axes about the axes of $x$, $y$, $z$, respectively, so that
$$\omega_0 = \lambda_0' \lambda_1' + \mu_0' \mu_1' + \nu_0' \nu_1' = -(\lambda_1' \lambda_2' + \mu_1' \mu_2' + \nu_1' \nu_2'),$$
&c.,
it will be immediately seen that the usual relations between the nine direction cosines enable us to put the result of the substitution in the following form:
$$\left(\frac{dP}{dt}\right) = \omega_0(yw - zv) + \omega_1(zu - xw) + \omega_3(xv - yu).$$
The original differential equations
$$x' = \frac{dZ}{du}, \quad u' = -\frac{dZ}{dx}, \quad \text{&c.}$$
are then transformed into
$$x' = \frac{d\Phi}{du}, \quad u' = -\frac{d\Phi}{dx}, \quad \text{&c. (art. 64.),}$$
where
$$\Phi = (Z) - \left(\frac{dP}{dt}\right).$$
Introducing the above value of \(\frac{dP}{dt}\), and omitting the brackets, we obtain for the system of transformed equations,
\[
\begin{align*}
x' &= \frac{dZ}{du} + \omega_2 y - \omega_1 z, \\
y' &= \frac{dZ}{dv} + \omega_0 z - \omega_2 x, \\
z' &= \frac{dZ}{dz} + \omega_1 x - \omega_0 y,
\end{align*}
\]
in which \(Z\) is supposed to be expressed in terms of the new variables.
73. On the principles of the integration of this, and of transformed systems in general, I shall make some remarks hereafter. For the present, the following may be observed. If, in the transformation of the last article, we suppose the motion of the new axes given, then \(\lambda, \&c.\), and therefore also \(\omega_0, \omega_1, \omega_2\), are given explicit functions of \(t\). But if the motion of the new axes is only given by connecting it with the motion of the point \(m\) itself, then the above quantities are given functions of the variables and their differential coefficients.
The most interesting case of the latter kind is that in which the motion of the new axes is assumed to satisfy the equations
\[
\frac{\omega_0}{x} = \frac{\omega_1}{y} = \frac{\omega_2}{z},
\]
which express the condition that the instantaneous axis of rotation (of the moving axes) always coincides with the radius vector of the moving point \(m\).
The radius vector traces, in fixed space, a certain conical surface. It also traces, with reference to the moving axes, another conical surface; and we might always assume as one of the conditions defining their motion, that this latter should be any proposed surface; that is, we might assume that the new coordinates \(x, y, z\) should always satisfy the equation \(\phi(x, y, z) = 0\), \(\phi\) representing any given homogeneous function. If to this last assumption we add the two conditions expressed by the formula \((\omega)\), we further assume that the conical surface traced by the radius vector with reference to the moving axes, rolls upon that traced in fixed space.
Suppose, for example, we assume for the equation \(\phi(x, y, z) = 0\), simply \(z = 0\). This, with the conditions \((\omega)\), will express that the radius vector is always in the plane of \(xy\), and that this plane rolls upon the conical surface traced by the radius vector in fixed space. We may then say that the plane of \(xy\) is the "plane of the orbit," and that the axes of \(xy\), or any lines fixed with reference to them in their plane, are "fixed in the plane of the orbit."
* See Jacobi's first letter to Professor Hansen (Crelle's Journal, vol. xlii. p. 21). This letter appears to refer to some unpublished (?) results of Professor Hansen, which may possibly be similar to those of this article.
† The student of elementary treatises is, I believe, always left to find out for himself what this expression means, or ought to mean.
Now since we are supposing the equations to belong to a dynamical problem, we have \( u = mx' \), \( v = my' \), \( w = mz' \), and if we substitute on each side of these equations the values derived from (85.), art. 72, we find easily
\[
u = m(x' + \omega_1 z - \omega_2 y), \quad v = m(y' + \omega_2 x - \omega_1 z), \quad w = m(z' + \omega_0 y - \omega_1 x),
\]
relations which are true on all suppositions as to the motion of the axes; but the assumptions (\(\omega\)) reduce them to
\[
u = mx', \quad v = my', \quad w = mz'.
\]
The further assumption \( z = 0 \), which involves \( z' = 0 \), gives \( w = 0 \), and also (by the equations (\(\omega\))) \( \omega_2 = 0 \). Thus the equations (86.) are reduced to
\[
x' = \frac{dZ}{du}, \quad u' = -\frac{dZ}{dx}
\]
\[
y' = \frac{dZ}{dv}, \quad v' = -\frac{dZ}{dy}
\]
\[
0 = \frac{dZ}{dw}, \quad 0 = -\frac{dZ}{dz} + \omega_1 u - \omega_0 v,
\]
where in \( \frac{dZ}{du} \) and \( \frac{dZ}{dz} \), \( z \) and \( w \) are to be put \( = 0 \) after the differentiation. It is to be observed also, that the values of \( u, v, w \) above given reduce the three first of the equations (86.) to the form \( u = m \frac{dZ}{du}, v = m \frac{dZ}{dv}, w = m \frac{dZ}{dw} \), from which it is evident that
\[
Z = \frac{1}{2m} (u^2 + v^2 + w^2) - U,
\]
where \( U \) is the original force-function, expressed in terms of the new variables. This depends only upon the conditions (\(\omega\)); but in the case now considered we have also \( w = 0 \).
Let the origin of coordinates be the Sun; \( m \) a planet disturbed by another planet \( m_i \) whose original coordinates are \( x_p, y_p, z_p \), and new coordinates \( x_p, y_p, z_p \); also let \( x^2 + y^2 + z^2 = r^2 \), \( x'^2 + y'^2 + z'^2 = r'^2 \), \( x^2 + y^2 + z^2 = r^2 \), \( x'^2 + y'^2 + z'^2 = r'^2 \), and \( (x_i - x)^2 + (y_i - y)^2 + (z_i - z)^2 = \delta^2 \); we have evidently \( r^2 = r'^2 \), \( r^2 = r'^2 \), and
\[
(x_i - x)^2 + (y_i - y)^2 + (z_i - z)^2 = (x_i - x)^2 + (y_i - y)^2 + (z_i - z)^2,
\]
and
\[
xx_i + yy_i + zz_i = xx_i + yy_i + zz_i.
\]
We have then originally
\[
U = \frac{mu}{r} + mm_i \left( \frac{1}{\delta} - \frac{xx_i + yy_i + zz_i}{r'^3} \right),
\]
where \( \mu = m + \text{mass of Sun} \); and it is evident that this expression preserves the same form when expressed in terms of the new coordinates, and also (which is essential to the validity of what follows), that \( \frac{dU}{dx} \), &c. are the same whether the differentiation
be performed before or after the substitution for $x_i$, $y_i$, $z_i$, in terms of $x_p$, $y_p$, $z_p$. If then we put $z=0$ and
$$R=m_i\left(\frac{1}{\delta}-\frac{xx_i+yy_i}{r^3}\right),$$
we shall have, from the fourth and fifth of the transformed differential equations*,
$$\begin{align*}
x'' &= -\frac{\mu x}{r^3} + \frac{dR}{dx}, \\
y'' &= -\frac{\mu y}{r^3} + \frac{dR}{dy},
\end{align*}$$
from which it is evident, that, assuming the motion of $m_i$ to be known relatively to the new axes, the variations of the four elements of the orbit of $m$ which determine the dimensions of the orbit, its position relatively to lines fixed in its own plane, and the time of perihelion passage, will be expressed in terms of the differential coefficients of $R$ in the same way as if the plane of the orbit were fixed. But the motion of the node of the orbit upon the fixed plane of $xy$, and its inclination to that plane, must be determined by means of the last of the differential equations, as follows: that equation gives
$$\omega_1 u - \omega_0 v = \left(\frac{dZ}{dz}\right)(z=0);$$
or if we put $-\Omega$ for the term multiplied by $mm_i$ in the value of $U$ given above,
$$\omega_1 u - \omega_0 v = \frac{d}{dz}\left(-\frac{\mu}{r} + \Omega\right),$$
and $z$ is to be put $=0$ after the differentiation, which reduces the above to
$$\omega_1 u - \omega_0 v = \left(\frac{d\Omega}{dz}\right).$$
Let $\omega_0^2 + \omega_1^2 = \alpha^2$, so that $\alpha$ is the angular velocity of the plane of the orbit about the radius vector; then (observing that $u=mx'$, &c.) we have
$$\frac{\omega_0}{x} = \frac{\omega_1}{y} = \frac{\alpha}{r} = \frac{v\omega_0 - u\omega_1}{m(xy' - x'y')},$$
whence
$$v\omega_0 - u\omega_1 = \frac{ma}{r}(xy' - x'y')$$
$$= \frac{ma}{r}\sqrt{\mu a(1-e^2)}\frac{d\Omega}{dz},$$
and therefore
$$ma = \frac{r}{\sqrt{ua(1-e^2)}}\left(\frac{d\Omega}{dz}\right),$$
which gives $\alpha$ in terms of the four elements referred to above, and of $t$. And if we put $i$ for the inclination of the orbit to the plane of $xy$, $v$ for the longitude of the node.
* These equations (87.) have been obtained in a different way by Mr. Bronwin. Camb. Math. Journ. vol. iv. p. 245.
† See below, art. 80.
referred to the axis of \( x \), and \( \beta \) for the angle between the axis of \( x \) and the node, the usual formulæ of rotation give
\[
\begin{align*}
t' &= \omega_0 \cos \beta - \omega_1 \sin \beta \\
v' \sin t &= \omega_0 \sin \beta + \omega_1 \cos \beta \\
\beta' &= \omega_2 - v' \cos t
\end{align*}
\]
(88.)
If in these expressions we put \( \omega_0 = \frac{x}{r} \alpha \), \( \omega_1 = \frac{y}{r} \alpha \), \( \omega_2 = 0 \), and call \( \Omega \) the angle between the radius vector and the node, so that \( \omega_0 \cos \beta - \omega_1 \sin \beta = r \cos \Omega \), \( \omega_0 \sin \beta + \omega_1 \cos \beta = r \sin \Omega \), we obtain finally
\[
\begin{align*}
t' &= \alpha \cos \Omega \\
v' &= \alpha \frac{\sin \Omega}{\sin t} \\
\beta' &= -\alpha \sin \Omega \cot t,
\end{align*}
\]
in which the expression above given for \( \alpha \) is to be substituted.
The actual value of \( \left( \frac{d\Omega}{dx} \right) \) is \( mmz_i \left( \frac{1}{r^3} - \frac{1}{\delta^3} \right) \), which (since \( z_i \), \( r \), &c. are supposed given in terms of \( t \)) may be expressed in terms of the four elements first mentioned, and \( t \).
I propose to consider the transformation of the differential equations of the planetary theory in a more general manner in the following section. At present I shall add some remarks on normal transformations in general.
74. Theorem. If
\[
x'_i = \frac{dZ}{dy_i}, \quad y'_i = -\frac{dZ}{dx_i}
\]
(89.)
be a system of \( 2n \) simultaneous differential equations, where
\[
Z = f(x_1, y_1, x_2, y_2, \ldots p, q, r, \ldots, t),
\]
and \( p, q, r, \ldots \) are also explicit functions of \( x_i, \&c., y_i, \&c. \) and \( t \), but are exempt from differentiation in taking the differential coefficients \( \frac{dZ}{dx_i}, \frac{dZ}{dy_i} \); and if these equations be transformed by a normal substitution of new variables \( \xi_i, \&c., \eta_i, \&c. \) (art. 62, equation (73.)), then the transformed equations are, as in art. 64,
\[
\xi'_i = \frac{d\Psi}{d\eta_i} + \frac{dZ}{d\eta_i}, \quad \eta'_i = -\frac{d\Psi}{d\xi_i} - \frac{dZ}{d\xi_i},
\]
in which \( Z \) is expressed in terms of \( \xi_i, \&c., \eta_i, \&c. \), but the differentiations with respect to \( \xi_i, \eta_i \) are performed before the substitution of these variables in \( p, q, r, \&c.; \) in other words, \( p, q, r, \ldots \) are still to be exempt from differentiation in forming the differential equations.
This may be proved simply by repeating the reasoning of art. 64. The only difference is, that in the term
\[
\Sigma_j \left\{ \frac{dZ}{dx_j} \frac{dx_j}{d\eta_i} + \frac{dZ}{dy_j} \frac{dy_j}{d\eta_i} \right\}
\]
the differential coefficients \( \frac{dZ}{dx_j}, \frac{dZ}{dy_j} \) are now taken only so far as \( Z \) contains \( x_j, y_j \) inde-
pendently of \( p, q, r, \&c. \); and therefore the term represents the differential coefficient
\[
\frac{dZ}{dy_i} \quad \text{taken so far as } Z \text{ contains } z_i \text{ independently of } p, q, r, \&c.
\]
The same reasoning applies to the corresponding term in the value of \( z_i \). The theorem is thus established.
It is evident that it may be combined with that given in art. 71, where other functions analogous to \( p, q, r, \&c. \) are introduced by the modulus of transformation \( P \).
If we call the form of the system of differential equations (89.) canonical when the differentiations of \( Z \) with respect to \( x_1, \&c., y_1, \&c. \) are total, we might call it pseudo-canonical when \( Z \) contains functions of \( x_1, \&c., y_1 \), which are exempt from differentiation in forming the differential equations.
In like manner, if we call a transformation of variables normal, when the differentiations of the modulus \( P \) (equations (73.), art. 62.) with respect to \( \xi_1, \&c., y_1, \&c. \) are total (as in art. 62.), we might call the transformation pseudo-normal when \( P \) contains functions of the variables which are exempt from differentiation in forming the equations of transformation (as in art. 71.).
Adopting these designations, we may enunciate the following general theorem of transformation:
**Theorem X.**—If a pseudo-canonical system be transformed by a normal or pseudo-normal substitution, the transformed equations are also pseudo-canonical, and may be formed by the rules applying to normal transformations of canonical systems, provided that the functions which are originally exempt from differentiation with respect to the variables, be continued exempt to the end of the process; but if such functions occur in the modulus of transformation \( P \), they are subject to total differentiation with respect to \( t \) in forming the term \(\frac{dP}{dt}\). (See art. 71.)
[With respect to this theorem there is one important remark to be made. If \( u, v \) be any two functions of \( x_1, \&c., y_1, \&c. \) (with or without \( p, q, r, \&c. \) and \( t \)), the equation
\[
\Sigma_i \left( \frac{du}{dy_i} \frac{dv}{dx_i} - \frac{du}{dx_i} \frac{dv}{dy_i} \right) = \Sigma_i \left( \frac{du}{dy_i} \frac{dv}{d\xi_i} - \frac{du}{d\xi_i} \frac{dv}{dy_i} \right) \quad \text{(art. 63.)}
\]
is now only true on condition that the substitution of the actual values of \( p, q, r, \&c. \) in terms of the variables be not performed till after all the differentiations.]
75. The theory of the variation of elements affords an interesting example of the theorem given in the last article. Consider the following system of differential equations,
\[
x'_i = \frac{dZ}{dy_i} + \frac{d\Omega}{dy_i}, \quad y'_i = -\frac{dZ}{dx_i} - \frac{d\Omega}{dx_i}, \quad \ldots \ldots \ldots \ldots \quad (90.)
\]
where in \(\frac{dZ}{dx_i}, \frac{dZ}{dy_i} \) the differentiations are total, but \( \Omega \) is supposed to contain functions of \( x_1, \&c., y_1, \&c. \), which are exempt from differentiation in forming the above equations. The system
\[
x'_i = \frac{dZ}{dy_i}, \quad y'_i = -\frac{dZ}{dx_i};
\]
is canonical. Let us assume then that a complete set of normal integrals \(a_1, \ldots, a_n\), \(b_1, \ldots, b_n\) of this latter system is known, so that we have
\[a_i = \varphi_i(x_1, \&c., y_1, \&c., t), \quad b_i = \chi_i(x_1, \&c., y_1, \&c., t).\]
The assumption of these last equations to represent the solution of the complete system (90.) is simply a transformation of variables, \((a_1, \&c., b_1, \&c.\) being the new variables); it is also a normal transformation, since the equations connecting the new and old variables may be put in the form (see Theorem VII. art. 49, and art. 62.)
\[\frac{dX}{dx_i} = y_i, \quad \frac{dX}{da_i} = b_i,\]
where \(X\) (the modulus of transformation) is a function of \(x_1, \ldots, x_n, a_1, \ldots, a_n, t\). The function \(\Psi\) of art. 62 is now obtained by expressing \(-\frac{dX}{dt}\) in terms of \(a_1, \ldots, b_1, \ldots\); but since \(Z\) is \(-\frac{dX}{dt}\) expressed in terms of \(x_1, \&c., y_1, \&c.\), it follows that when \(Z\) is expressed in terms of the new variables \(a_1, \&c., b_1, \&c.\), it becomes identical with \(\Psi\).
Now if the process of art. 63 be followed, mutatis mutandis, it will be seen that in the present case we obtain
\[a'_i = \frac{d\Psi}{db_i} - \frac{dZ}{db_i} - \sum_j \left( \frac{d\Omega}{dy_j} \frac{dy_j}{db_i} \right),\]
in which expression the first two terms destroy one another, and the remaining term is evidently the differential coefficient of \(\Omega\) with respect to \(b_i\), taken so far as \(\Omega\) contains \(b_i\) independently of those functions which were exempt from differentiation in forming the original differential equations (90.). Similar reasoning applies to the expression for \(b'_i\).
As this result will be useful, I shall enunciate it separately as
**Theorem XI.**—If the original system of differential equations be formed by treating certain functions, \(p, q, r, \ldots\), contained in the disturbing function \(\Omega\), as exempt from differentiation with respect to \(x_1, \&c., y_1, \&c.\), the equations which determine the variations of any set of normal elements \(a_1, \&c., b_1, \&c.\)
\[a'_i = -\frac{d\Omega}{db_i}, \quad b'_i = \frac{d\Omega}{da_i}\]
on condition that \(p, q, r, \ldots\) be treated, in forming these equations, as exempt from differentiation with respect to \(a_1, \&c., b_1, \&c.\).
[It is important to recollect, that after these equations are formed, \(p, q, r, \&c.\) are to be expressed in terms of \(a_1, \&c., b_1, \&c.\), and in the integration of the system \(a_1, \&c., b_1, \&c.\) are to be treated indiscriminately as variables, whether they originally entered through \(p, q, r, \ldots\) or not].
The Theorem XI. may also be immediately obtained from the general equations (E.) of art. 52 (in which it is to be remembered that \(Z\) includes the disturbing
function). The above method of deducing it is given as an additional illustration of the general theory of transformation.
If, instead of the normal elements \(a_1, \&c., b_1, \&c.\), we employ any other elements, \(c_1, c_2, \&c.\), which can be expressed as functions of the former, the formula (69.) of art. 54 will still be applicable, with the condition that \(p, q, r, \&c.\) are exempt from differentiation with respect to \(c_1, c_2, \&c.\).
**Section VII.—On the Differential Equations of the Planetary Theory.**
76. The differential equations which determine the motions of a system of mutually attracting material points relatively to one of them, do not, as is well known, naturally present themselves under the canonical form (I.), art. 49. It is possible indeed to reduce them, by different artifices, to that form; but it seems doubtful whether any practical advantage is gained by doing so. When the ordinary method is followed in the case of a planetary system referred to the sun, there is a distinct disturbing function for each planet; but it is easily seen that the usual expressions for the variations of the elements hold good, not merely for each planet on the hypothesis that the motions of the rest are known, but as a complete and rigorous set of simultaneous differential equations involving all the elements of all the orbits, and their differential coefficients with respect to \(t\) (and containing of course also \(t\) explicitly). It does not appear that we are practically farther from the attainment of the rigorous integration of this system, than we should be if it had the canonical form, as it might be made to have if it were derived from an original system of that form; and so far as the development of the disturbing functions is concerned, the most troublesome part of them, which is that depending on the mutual distances of the planets, is not likely to be got rid of by any conceivable artifice.
However this may be, all that I propose to do in the present section is to take the original differential equations in their ordinary form referred to rectangular axes passing through the sun and parallel to fixed directions, and then to exhibit in a general manner the effect of a transformation to new rectangular axes, still passing through the sun, but changing their directions in space according to any arbitrary law.
77. Let \(M\) be the mass of the sun, and \(m, m_p, m_n, \&c.\), the masses of the planets; and put \(M+m=\mu, M+m_p=\mu_p, M+m_n=\mu_n, \ldots M+m_{(i)}=\mu_{(i)}\). And, referred to the original axes, let \(x, y, z\) be the coordinates of \(m, x_p, y_p, z_p\) of \(m_p, \ldots, x_{(i)}, y_{(i)}, z_{(i)}\) of \(m_{(i)}\). Also let \(R, R_p, \ldots R_{(i)}\) have their usual significations, so that
\[
R = \sum \left[ \frac{m_{(i)}}{\left((x_{(i)}-x)^2+(y_{(i)}-y)^2+(z_{(i)}-z)^2\right)^{3/2}} - \frac{m_{(i)}(xx_{(i)}+yy_{(i)}+zz_{(i)})}{(x^2_{(i)}+y^2_{(i)}+z^2_{(i)})^{3/2}} \right],
\]
the summation extending to all the planets except \(m\).
Then if we put \(x^2_{(i)}+y^2_{(i)}+z^2_{(i)}=r^2_{(i)}\), the original differential equations of the second order are such as \(x''_{(i)}+\mu_{(i)}\frac{x_{(i)}}{r^3_{(i)}}=\frac{dR_{(i)}}{dx_{(i)}}\). Let \(u_{(i)}, v_{(i)}, w_{(i)}\) be the variables conjugate to
Then instead of the original system of differential equations of the second order, we have the following system of the first order:
\[ x'_i = \frac{dZ_i}{du_i}, \quad y'_i = \frac{dZ_i}{dv_i}, \quad z'_i = \frac{dZ_i}{dw_i} \]
\[ u'_i + \frac{dZ_i}{dx_i} = 0, \quad v'_i + \frac{dZ_i}{dy_i} = 0, \quad w'_i + \frac{dZ_i}{dz_i} = 0, \]
in which
\[ Z_i = \frac{1}{2m_i}(u_i^2 + v_i^2 + w_i^2) - m_i\left(\frac{\mu_i}{r_i} + R_i\right). \]
These equations are not of the canonical form, because \( R_i \) is not the same for all the planets. But it is easy to put them in the pseudo-canonical form (art. 74.), a process which is not necessary, but saves trouble by bringing them under the operation of the general rules of transformation established in former articles.
In fact, if we take
\[ Z = \sum \left( \frac{u_i^2 + v_i^2 + w_i^2}{2m_i} - \frac{m_j\mu_j}{r_{ij}} \right) \]
\[ Q = \sum m_i m_j \left( \frac{x_i x_j + y_i y_j + z_i z_j}{r_{ij}^2} + \frac{y_i x_j + y_j x_i + z_i z_j}{r_{ij}^2} + \frac{z_i x_j + z_j x_i + y_i y_j}{r_{ij}^2} \right) \]
where the summation in the first term extends to all the planets, and in the second to all their binary combinations, and a horizontal line placed over any letter indicates exemption from differentiation, we shall have
\[ x'_i = \frac{dZ}{du_i} + \frac{dQ}{du_i}, \quad u'_i = -\frac{dZ}{dx_i} - \frac{dQ}{dx_i}, \ldots \ldots \ldots \ldots \quad (91.) \]
with similar equations for \( y'_i, z'_i, v'_i, w'_i \).
[The terms \( \frac{dQ}{du_i} \) &c. are only written for the sake of uniformity, being really \( = 0 \), since \( Q \) does not involve \( u_i, v_i, w_i \).]
In these equations \( Z \) and \( Q \) are the same for the whole system, and the differentiations of \( Z \) are total; but those of \( Q \) are restricted to the quantities not marked by the horizontal line, so that \( \frac{dQ}{dx_i} \) is really the same thing as \( -m_i \frac{dR_i}{dx_i} \).
78. Let us now refer the whole system to new (moving) rectangular axes, whose position at any instant with respect to the original axes is defined, as in art. 72, by the variable direction-cosines \( \lambda_1, \lambda_2, \lambda_3 \). Let \( x_i, y_i, z_i \) be the new coordinates, and \( u_i, v_i, w_i \) the new variables conjugate to them. The transformation will be effected, as in art. 72, by taking for the modulus
\[ P = \sum (\lambda_1 x_i + \lambda_2 y_i + \lambda_3 z_i) u_i + (\mu_1 x_i + \mu_2 y_i + \mu_3 z_i) v_i + (\nu_1 x_i + \nu_2 y_i + \nu_3 z_i) w_i, \]
and the result will be as follows: put
\[ \Omega = Q + \bar{\omega}_0 \sum (z_i v_i - y_i w_i) + \bar{\omega}_1 \sum (x_i w_i - z_i u_i) + \bar{\omega}_2 \sum (y_i u_i - x_i v_i) \]
(where $Q$ is expressed in terms of the new variables, and $\omega_0$, $\omega_1$, $\omega_2$ (the angular velocities of the moving system of axes about the three moving axes themselves) are marked with the horizontal line to show that these quantities are exempt from differentiation in forming the following system of differential equations, though they may be functions of the variables); then the system (91.) is transformed into
$$x'_i = \frac{dZ}{du_i} + \frac{d\Omega}{du_i}, \quad u'_i = -\frac{dZ}{dx_i} - \frac{d\Omega}{dx_i}, \quad \ldots \ldots \ldots \quad (92.)$$
with similar equations for $y'_i$, &c.
In these equations $Z$ is to be expressed in terms of the new variables; and it is evident from the original form of $Z$ and $Q$, that when so expressed, these two quantities are the same functions of the new variables that they were of the old, and involve (see art. 74.) the quantities exempt from differentiation in the same way*. Thus the transformed system (92.) contains no terms explicitly depending upon the motion of the axes, except those introduced by the three terms multiplied by $\omega_0$, $\omega_1$, $\omega_2$ in the value of $\Omega$ given above; and the addition of these terms constitutes the only difference between the form of the old and of the new system.
79. We may now apply the method of the variation of elements to the system (92.) as follows:
The system obtained by omitting the disturbing function $\Omega$, namely,
$$x'_i = \frac{dZ}{du_i}, \quad y'_i = \frac{dZ}{dv_i}, \quad z'_i = \frac{dZ}{dz_i},$$
$$u'_i + \frac{dZ}{dx_i} = 0, \quad v'_i + \frac{dZ}{dy_i} = 0, \quad w'_i + \frac{dZ}{dz_i} = 0 \quad \ldots \ldots \ldots \quad (93.)$$
is canonical, and consists simply of the aggregate of the equations representing, for each planet, undisturbed elliptic motion about the sun † (relatively to the new axes of coordinates).
The integrals of these equations may therefore be expressed in any of the usual forms. We will suppose that the elements chosen are
$a, e, \varpi, (\varepsilon), i, \nu,$
with significations corresponding to those given to the same symbols in art. 55. These letters unaccented will apply to the planet $m$, and $a_i, e_i, \&c., a_{ii}, e_{ii}, \&c., a_{(i)}, e_{(i)}, \&c.$, to the planets $m_i, m_{ii}, m_{(i)}, \&c.$
The definitions of the elements $a, e, \varpi, \&c.$ are their expressions in terms of the six
* Since the direction cosines $\lambda_0, \&c.$ are exempt from differentiation in forming the equations connecting the old and new variables from the modulus $P$, they continue exempt throughout. (Theorem X. art. 74.) Hence we have
$$xx_i + yy_i + zz_i = (\lambda_0 x + \lambda_1 y + \lambda_2 z)(\lambda_0 x_i + \lambda_1 y_i + \lambda_2 z_i) + \&c. = xx_i + yy_i + zz_i,$$
and similarly for the rest.
† $Z = \Sigma \left( \frac{u^2 + v^2 + w^2}{2m} - \frac{\mu m}{r} \right)$, the summation extending to all the planets.
variables \( x, y, z, u, v, w \) *, and \( t \), and the same expressions continue to be their definitions when they become the variable elements of the disturbed motion.
Now the general formulæ for the variations of the set of elements here chosen have already been given in art. 55; for it is evident that the process in the present case would merely be a repetition, for each planet, of the process there employed. Here however we are to use in every case the disturbing function \( \Omega \) given in art. 78; but if we observe the effect of the marks of exemption, it will be evident that, for the planet \( m \), the only effective part of \( \Omega \) is
\[
\bar{\omega}_0(zv - yw) + \bar{\omega}_1(xw - zu) + \bar{\omega}_2(yu - xv) - mR;
\]
and similarly the effective part of \( \Omega \) for any other planet will be given by suffixing the corresponding number of accents to \( x, y, z, u, v, w, m, R \).
Now if we put \( A_0, A_1, A_2 \) for the terms multiplied respectively by \( \omega_0, \omega_1, \omega_2 \) in the above expression, we have, by the definitions of the elements,
\[
(A_0^2 + A_1^2 + A_2^2)^{1/2} = m\sqrt{\mu a(1-e^2)}
\]
\[
A_2 = -m\sqrt{\mu a(1-e^2)} \cdot \cos t, \quad A_1 = m\sqrt{\mu a(1-e^2)} \cdot \cos v \sin t
\]
\[
A_0 = -m\sqrt{\mu a(1-e^2)} \cdot \sin v \sin t,
\]
so that the disturbing function, so far as \( m \) is concerned, becomes
\[
-m\{R + \sqrt{\mu a(1-e^2)}(\bar{\omega}_0 \sin v \sin t - \bar{\omega}_1 \cos v \sin t + \bar{\omega}_2 \cos t)\}. \quad . . . . (94.)
\]
Consequently, since the expressions in art. 55 were obtained by taking \(-mR\) for the disturbing function, we have merely to add to them the additional terms derived from the part added to \( R \) in (94.). Performing the differentiations, and omitting afterwards the symbols of exemption over \( \omega_0, \omega_1, \omega_2 \) which cease to be of any use, we obtain, after obvious reductions, the following simple results: if \( \frac{\partial a}{dt}, \frac{\partial e}{dt}, \&c. \) represent those parts of the differential coefficients of \( a, e, \&c., \) with respect to \( t, \) which depend upon the motion of the axes of coordinates, then
\[
\begin{align*}
\frac{\partial a}{dt} &= 0, \\
\frac{\partial e}{dt} &= 0 \\
\frac{\partial (\varepsilon)}{dt} &= \frac{\partial \varphi}{dt} = \tan \frac{1}{2}(\omega_1 \cos v - \omega_0 \sin v) - \omega_2 \\
\frac{\partial i}{dt} &= -(\omega_0 \cos v + \omega_1 \sin v) \\
\frac{\partial v}{dt} &= -\omega_2 + \cot t (\omega_0 \sin v - \omega_1 \cos v)
\end{align*}
\]
where it is evident that we may write \( \varepsilon \) instead of \( (\varepsilon) \) (see art. 55.).
* Not their expressions in terms of \( x, y, z, x', y', z' \); for though these are equivalent in the undisturbed equations, they are not here equivalent in the disturbed equations, and therefore the general theory, which assumes the former mode of expression, is not here applicable to the latter.
The complete variations of the elements are then found by adding to the terms just written the expressions given in art. 25.
It is easily seen that the expressions (95.) might have been deduced from geometrical considerations alone, if we had been at liberty to assume beforehand that the mechanical and geometrical parts of the variations might be calculated separately; the former as if the axes were at rest, and the latter as if there were no disturbing forces. It would not, I believe, be difficult to establish by *a priori* and simple reasoning the validity of such an assumption, and then the above results would only serve as a verification of the method which has been employed to obtain them.
80. In order however that no obscurity may rest upon the interpretation of the formulæ obtained in the last articles, it is necessary to consider the physical (or rather geometrical) meaning of the elements $a$, $e$, &c., which we have so far only defined by means of their expressions in terms of the variables $x$, $y$, $z$, $u$, &c., and to ascertain what relation they bear to the elements similarly defined by means of the original variables $x$, $y$, &c., which refer to axes whose directions are invariable.
The relations between the variables ((85.), art. 72.) give immediately
$$x^2 + y^2 + z^2 = x^2 + y^2 + z^2$$
$$u^2 + v^2 + w^2 = u^2 + v^2 + w^2$$
$$ux + vy + zw = ux + vy + zw;$$
and if we put
$$yw - zv = A, \quad zu - xw = B, \quad xv - yu = C$$
$$yw - zv = A, \quad zu - xw = B, \quad xv - yu = C,$$
we find, by virtue of the relations $\mu_1\nu_2 - \nu_1\mu_2 = \lambda_0$, &c., the following equations:
$$A = \lambda_0 A + \mu_0 B + \nu_0 C$$
$$B = \lambda_1 A + \mu_1 B + \nu_1 C$$
$$C = \lambda_2 A + \mu_2 B + \nu_2 C$$
and
$$A^2 + B^2 + C^2 = A^2 + B^2 + C^2.$$
Now $A (= yw - zv = m(yz' - zy'))$ is the projection on the plane of $yz$ of the areal velocity of $m$ (relative to fixed space) multiplied by the mass, and $B$, $C$ have analogous meanings; hence it is evident from the above equations that $A$, $B$, $C$ are the projections on the three moving coordinate planes of $yz$, $zx$, $xy$ of the absolute areal velocity of $m$ relative to fixed space, multiplied by the mass. (The projections of the areal velocity relative to the moving axes would be $yz' - zy'$, &c., which are not proportional to $yw - zv$, &c., since $u$, $v$, $w$ are not the same as $mx'$, $my'$, $mz'$, except on a particular hypothesis as to the motion of the axes. See art. 73.).
Inasmuch as the definitions of the elements $a$, $e$, $i$ involve the variables only in the forms $x^2 + y^2 + z^2$, $u^2 + v^2 + w^2$, $A$, $B$, $C$, it follows that these three elements are respectively the semiaxes, eccentricity, and inclination to the plane of $xy$, of the absolute osculating ellipse of the orbit in fixed space. Thus the instantaneous ellipse, relatively to the moving axes, is of the same dimensions and in the same plane as the true
osculating ellipse; and it only remains to show that it coincides with the latter in position, for which purpose we must prove that it touches the true orbit. (It does not in general touch the relative orbit, because $mx', my', mz'$ are not in general the same functions of the elements and $t$ that $u, v, w$ are.)
81. If we suppose the coordinates $x, y, z$ of $m$ expressed in terms of the elements $a, e, \&c.$ and $t$, and denote by $\frac{dx}{dt}, \&c.$ their differential coefficients taken so far as $t$ appears explicitly in these expressions, then $\frac{dx}{dt}, \frac{dy}{dt}, \frac{dz}{dt}$ are proportional to the direction cosines, relatively to the axes of $x, y, z$, of the tangent to the (relative) instantaneous ellipse. And therefore the direction cosines of this same tangent referred to the fixed axes of $x, y, z$, are proportional respectively to
$$\lambda_0 \frac{dx}{dt} + \lambda_1 \frac{dy}{dt} + \lambda_2 \frac{dz}{dt}, \quad \mu_0 \frac{dx}{dt} + \mu_1 \frac{dy}{dt} + \mu_2 \frac{dz}{dt}, \quad \nu_0 \frac{dx}{dt} + \nu_1 \frac{dy}{dt} + \nu_2 \frac{dz}{dt}.$$
On the other hand, the direction cosines of the tangent to the absolute instantaneous ellipse, referred to the fixed axes, are proportional to $\frac{dx}{dt}, \frac{dy}{dt}, \frac{dz}{dt}$ (the differential coefficients of $x, y, z$, taken so far as $t$ appears explicitly in the expressions for those variables in terms of the elements of the absolute ellipse). The identity of the two tangents will therefore be established, if we can show that
$$\frac{dx}{dt} = \lambda_0 \frac{dx}{dt} + \lambda_1 \frac{dy}{dt} + \lambda_2 \frac{dz}{dt}, \&c.$$
Now $\frac{dx}{dt}$ is that part of $x'$ which does not depend upon the disturbing function; i.e. (equations (92.), art. 78.) we have identically
$$\frac{dx}{dt} = \frac{dZ}{du}, \quad \frac{dy}{dt} = \frac{dZ}{dv}, \quad \frac{dz}{dt} = \frac{dZ}{dw};$$
and, in like manner,
$$\frac{dx}{dt} = \frac{dZ}{du}, \quad \frac{dy}{dt} = \frac{dZ}{dv}, \quad \frac{dz}{dt} = \frac{dZ}{dw},$$
where $Z$ is the same as before, but expressed in terms of $x, y, z, u, v, w$, instead of $x, y, z, u, v, w$. Now let the latter set of variables be expressed in terms of the former by the formulæ of art. 72, and we have
$$\frac{dZ}{du} = \frac{dZ}{du} \frac{du}{du} + \frac{dZ}{dv} \frac{dv}{du} + \frac{dZ}{dw} \frac{dw}{du}, \&c.,$$
but
$$\frac{du}{du} = \lambda_0, \quad \frac{dv}{du} = \lambda_1, \quad \frac{dw}{du} = \lambda_2 ((85.), art. 72.), \&c.,$$
and therefore
$$\frac{dx}{dt} = \lambda_0 \frac{dx}{dt} + \lambda_1 \frac{dy}{dt} + \lambda_2 \frac{dz}{dt}, \&c.,$$
as was to be proved.
82. It follows, then, that the mode of treating the problem adopted in the preceding articles is equivalent to representing the motion of each planet by means of the true osculating ellipse of its actual orbit (relatively to the sun) in fixed space.
The definitions of all the elements (relative to the moving axes) in terms of the six new variables \( x, y, z, u, v, w \), have the same form as those of the corresponding elements (relative to the fixed axes) in terms of \( x, y, z, u, v, w \). The two relative elements \( a, e \) are the same as the corresponding absolute elements; \( i \) is the inclination of the plane of the ellipse to the moving plane of \( xy \), and \( \nu \) the longitude of the node reckoned from the axis of \( x \); and since the place of the body in the ellipse is evidently the same, the relations between the remaining elements \( \varpi \) and \( (\varepsilon) \) (or \( \varepsilon \)) and the corresponding absolute elements are purely geometrical.
Comparing these results with those of art. 79, we see that the independence of the formulæ for the mechanical and geometrical variations of the elements of the true osculating ellipse is completely established.
83. In all that precedes, the three variables \( \omega_0, \omega_1, \omega_2 \) (the angular velocities of the system of moving axes about the axes themselves) are entirely arbitrary; they may be either explicit functions of \( t \), involving only determinate constants, or they may depend in any way upon the relative or absolute elements of the orbits of any or all the planets, and their differential coefficients with respect to \( t \). In the case in which the expressions for \( \omega_0, \omega_1, \omega_2 \) involve only the relative elements, when these expressions are introduced in the formulæ (95.), art. 79, and these formulæ completed by the addition of the terms in art. 55, and when the corresponding sets of equations are formed for each planet, we obtain a set of simultaneous differential equations involving all the elements of all the orbits, and their differential coefficients with respect to \( t \). The integration of these equations would determine all the elements as functions of \( t \), and thus the motion of all the planets, relatively to the axes of coordinates, would be known. Lastly, the motion of the whole system, relatively to fixed space, would be found by integrating the system of equations
\[
\begin{align*}
L' &= \omega_0 \cos X - \omega_1 \sin X \\
N' \sin L &= \omega_0 \sin X + \omega_1 \cos X \\
X' &= \omega_2 - N' \cos L
\end{align*}
\]
where \( \omega_0, \omega_1, \omega_2 \) are now given functions of \( t \), and \( L \) is the inclination of the plane of \( xy \) to that of \( xy \), \( N \) is the longitude of the ascending* node of the plane of \( xy \), reckoned from the axis of \( x \), and \( X \) is the longitude of the axis of \( x \), reckoned upon the plane of \( xy \), from the node, in the direction of positive rotation.
In the case in which \( \omega_0, \omega_1, \omega_2 \) cannot be expressed in terms of the relative elements, the integrations which determine the relative motion of the system cannot be separated from those which determine the position of the axes in fixed space; but the equations (96.) must be considered simultaneously with the other differential equations of the problem.
* Ascending relatively to a positive rotation, i.e. from \( x \) to \( y \).
Application to the Problem of Three Bodies.
84. I propose to exemplify the preceding principles by applying them to the transformation of the equations which determine the motion of three mutually attracting bodies, considered as material points. Let them be called the sun and two planets, and let the origin of coordinates be placed at the sun, and the notation be the same as in arts. 77–79, so that $M, m, m_i$ are the three masses, $M + m = \mu$, $M + m_i = \mu_i$, &c. Let the elements be chosen as in art. 55, and longitudes be measured, as before, along the plane of $xy$ (from the axis of $x$) as far as the node, and then along the plane of the orbit.
Putting for convenience $mR = \Omega$, $m_i R_i = \Omega_i$, we have
$$\Omega = mm_i \left( r^2 + r_i^2 - 2rr_i \cos \chi \right)^{-\frac{1}{2}} - rr_i^{-2} \cos \chi,$$
$$\Omega_i = mm_i \left( r^2 + r_i^2 - 2rr_i \cos \chi \right)^{-\frac{1}{2}} - rr_i^{-2} \cos \chi,$$
where $\chi$ is the angle between the two radii vectores. Let $I$ be the mutual inclination of the planes of the two orbits, and let the angular distances, on the planes of the orbits, between their ascending nodes on the plane of $xy$ and their line of intersection, be respectively $\nu, \nu_i$; so that $\theta - \nu - \nu_i, \theta_i - \nu_i - \nu$ are the angular distances of the two radii vectores from the line of intersection (which we may call, simply, the line of nodes). We shall then have
$$\cos \chi = \cos (\theta - \nu - \nu_i) \cos (\theta_i - \nu_i - \nu) + \sin (\theta - \nu - \nu_i) \sin (\theta_i - \nu_i - \nu) \cos I,$$
$$\cos I = \cos \nu \cos \nu_i + \sin \nu \sin \nu_i \cos (\nu_i - \nu),$$
and $\nu, \nu_i$ are functions of $\iota, \iota_i, \nu_i - \nu$, determined by the equations
$$\cot \nu \sin (\nu_i - \nu) = - \cot \nu_i \sin \iota + \cos (\nu_i - \nu) \cos \iota,$$
$$\cot \nu_i \sin (\nu_i - \nu) = \cot \nu \sin \iota_i + \cos (\nu_i - \nu) \cos \iota_i.$$
Now considering $\Omega$ as expressed, on the one hand, in terms of $r, r_i, \theta, \theta_i, \nu, \nu_i, \iota, \iota_i$, and on the other in terms of all the elements and $t$, we have rigorously, as may easily be proved in the usual way†,
$$\frac{d\Omega}{d\theta} = \frac{d\Omega}{d\varepsilon} + \frac{d\Omega}{d\varpi}, \quad \frac{d\Omega}{d\theta_i} = \frac{d\Omega}{d\varepsilon_i} + \frac{d\Omega}{d\varpi_i},$$
* The arrangement referred to here and in the following articles will be made clear by the accompanying diagram.
† Since the values of $r$ and $\theta$ in terms of the elements and $t$ are of the forms
$$r = f(\int ndt + \varepsilon - \varpi), \quad \theta = f(\int ndt + \varepsilon - \varpi),$$
we have
$$\frac{dr}{d\varepsilon} + \frac{dr}{d\varpi} = 0, \quad \frac{d\theta}{d\varepsilon} + \frac{d\theta}{d\varpi} = 1;$$
from which the equations in the text follow immediately. It may be as well to remind the reader who may happen to recollect the note to the Astronomer Royal's tract on the Planetary Theory (p. 91, ed. 1831), that that note refers to a different way of measuring longitudes.
with similar equations for $\Omega_i$. In what follows, I shall, as an abridgment, employ the symbol $E$ to denote the operation $\frac{d}{ds} + \frac{d}{d\omega}$, and in like manner I shall put $E_i$ for $\frac{d}{ds_i} + \frac{d}{d\omega_i}$, so that
$$E\Omega = \frac{d\Omega}{ds} + \frac{d\Omega}{d\omega}, \text{ &c.} \ldots \ldots \ldots \ldots \ldots (99.)$$
Since $r$ and $\theta$, when expressed in terms of the elements and $t$, do not contain $\iota$, $\iota_p$, $\nu$, $\nu_p$, we have
$$\frac{d\Omega}{d\nu} = \frac{d\Omega}{d\cos \chi} \cdot \frac{d\cos \chi}{d\nu}, \quad \frac{d\Omega}{dt} = \frac{d\Omega}{d\cos \chi} \cdot \frac{d\cos \chi}{dt}$$
and
$$\frac{d\cos \chi}{d\nu} = -(1 + \frac{du}{d\nu}) \frac{d\cos \chi}{d\theta} - \frac{du}{d\nu} \frac{d\cos \chi}{d\theta_i}$$
$$+ \sin \iota \sin \iota_p \sin (\theta - \nu - \nu) \sin (\theta_i - \nu_i - \nu_i) \sin (\nu_i - \nu),$$
in which expression the values of $\frac{du}{d\nu}$, $\frac{du}{d\nu_i}$ are to be obtained by differentiating the equations (98.). In like manner we find
$$\frac{d\cos \chi}{dt} = -\frac{du}{dt} \frac{d\cos \chi}{d\theta} - \frac{du}{dt} \frac{d\cos \chi}{d\theta_i}$$
$$+ \sin (\theta - \nu - \nu) \sin (\theta_i - \nu_i - \nu_i)(-\sin \iota \cos \iota + \cos \iota \sin \iota \cos (\nu_i - \nu)).$$
Analogous expressions may be found for $\frac{d\Omega}{d\nu_i}$, $\frac{d\Omega}{dt_i}$.
85. Hitherto we have assumed nothing concerning the motion of the axes of coordinates. Let us now however take as a first assumption that the plane of $xy$ shall always pass through the line of nodes. This implies the conditions*
$$\nu_i = \nu, \quad \nu = 0, \quad \nu_i = 0,$$
and consequently $I = \iota - \iota$.
Introducing these conditions in the expressions at the end of the last article, and
* The legitimacy of the following processes will be apparent to the reader who shall have followed the general reasoning of former articles, though probably not to others. In either case it may be useful here briefly to recapitulate the principles now to be applied. The results of art. 79, and in particular the expressions (94.), (95.), were established independently of any assumption as to the values of $\omega_0$, $\omega_1$, $\omega_2$, which are perfectly arbitrary. We are therefore at liberty to assume that $\omega_0$, $\omega_1$, $\omega_2$ are such functions of $t$ that any three functions of the variables shall constantly $= 0$. Thus the first assumption made in the text is that $\nu_i - \nu$ shall constantly $= 0$. But since such assumptions may cause (as in this case) certain elements to disappear from the expression of $\Omega$, it is necessary to perform the differentiations of $\Omega$ with respect to such elements first; the differential coefficients may or may not vanish, on afterwards introducing the assumptions; if not, we find expressions for them in terms of differential coefficients with respect to other elements which do not disappear; so that instead of differentiations which ought to be performed before the assumed conditions are introduced, we have finally only such as may be performed afterwards. The expression "disappear" does not, it must be observed, necessarily mean the same as "vanish." Thus the condition $\nu_i - \nu = 0$, causes $\nu$ and $\nu_i$ both to disappear from $\Omega$, but does not of course imply that they both vanish.
observing that \( \frac{d\Omega}{d\theta} = E\Omega, \) &c. (see equation (99.)), we have evidently
\[
\frac{d\Omega}{dv} = -\left(1 + \left(\frac{du}{dv}\right)\right)E\Omega - \left(\frac{du}{dv}\right)E_i\Omega
\]
\[
\frac{d\Omega}{di} = -\left(\frac{du}{di}\right)E\Omega - \left(\frac{du}{di}\right)E_i\Omega - \frac{d\Omega}{dT};
\]
and it only remains to find the values of \( \left(\frac{du}{dv}\right), \) &c., namely, the values of \( \frac{du}{dv}, \) &c., which correspond to the assumed conditions \( v_i - v = 0, \) \( v = v_i = 0. \)
Now in any spherical triangle of which \( a, b, c \) are the sides, and \( A, B, C \) the opposite angles, if \( a \) be considered as a function of \( c, A, B, \) by virtue of the equation
\[
\cot a \sin c = \cot A \sin B + \cos c \cos B,
\]
we have by differentiation
\[
\frac{da}{dc} = (\sin a)^2 (\cot a \cot c + \cos B)
\]
\[
= \cos a \sin a \cot c + (1 - (\cos a)^2) \cos B = \cos B + \cos a \cot C \sin B
\]
\[
\frac{da}{dA} = \left(\frac{\sin a}{\sin A}\right)^2 \frac{\sin B}{\sin c} = \frac{\sin a \sin B}{\sin C}
\]
\[
\frac{da}{dB} = \frac{(\sin a)^2}{\sin c} \cot A \cos B + \cot c (\sin a)^2 \sin B = \sin a \cot C;
\]
and if the sides of the triangle be indefinitely diminished, these expressions become, in the limit,
\[
\frac{da}{dc} = \frac{\sin (B + C)}{\sin C}, \quad \frac{da}{dA} = 0, \quad \frac{da}{dB} = 0,
\]
provided the angle \( C \) do not vanish.
If these results be applied to the triangle of which the sides are \( v, v_i, v_i - v, \) and the opposite angles \( \pi - i, i, I, \) it is easily seen that the values of \( \left(\frac{du}{dv}\right), \) &c. are as follows:
\[
\left(\frac{du}{dv}\right) = -\frac{\sin i}{\sin I}, \quad \left(\frac{du}{dv_i}\right) = \frac{\sin i}{\sin I}, \quad \left(\frac{du}{dv}\right) = -\frac{\sin i}{\sin I}, \quad \left(\frac{du}{dv_i}\right) = \frac{\sin i}{\sin I}
\]
\[
\left(\frac{du}{di}\right) = \left(\frac{du}{di_i}\right) = \left(\frac{du}{di}\right) = \left(\frac{du}{di_i}\right) = 0,
\]
provided \( I \) be not \( = 0. \) We have therefore
\[
\frac{d\Omega}{dv} = -E\Omega + \frac{1}{\sin I} (\sin i, E\Omega + \sin i, E_i\Omega)
\]
\[
\frac{d\Omega}{di} = -\frac{d\Omega}{dT};
\]
exactly in the same way we find
\[
\frac{d\Omega_i}{dv_i} = -E_i\Omega_i - \frac{1}{\sin I} (\sin i, E_i\Omega_i + \sin i, E_i\Omega_i)
\]
\[
\frac{d\Omega_i}{di_i} = \frac{d\Omega_i}{dT}.
\]
The variations of the elements will now be found by introducing the above values of
\[ \frac{d\Omega}{dv}, \frac{d\Omega_i}{dv_i}, \frac{d\Omega}{dt}, \frac{d\Omega_i}{dt_i} \]
in the expressions given in art. 55, and completing these expressions by the addition of the terms (95.), art. 79. But the angular velocities \( \omega_0, \omega_1, \omega_2 \) are no longer wholly arbitrary, since we have made one assumption concerning the motion of the axes, which implies one relation between these quantities and the elements. In order to determine \( \omega_0, \omega_1, \omega_2 \) completely, it will be necessary to make two more assumptions; but first we will investigate the relation already implied.
86. The complete expressions for \( v', v' \), obtained in the way mentioned in the last article, from arts. 55 and 79, may be written in the following form: put for brevity
\[ m\sqrt{\mu a(1-e^2)} = p, \quad m_i\sqrt{\mu_i a_i(1-e_i^2)} = p_i, \]
and put \( \mu^{3/2}a^{-3/2} \) for \( n \), and \( \mu_i^{3/2}a_i^{-3/2} \) for \( n_i \); then
\[ v' = \frac{1}{p \sin i} \frac{d\Omega}{di} + \cot i (\omega_0 \sin v - \omega_1 \cos v) - \omega_2 \]
\[ v' = \frac{1}{p_i \sin i_i} \frac{d\Omega_i}{di_i} + \cot i_i (\omega_0 \sin v_i - \omega_1 \cos v_i) - \omega_2 \]
and if the latter equation be subtracted from the former, and the conditions
\[ v_i = v, \quad \frac{d\Omega}{di} = -\frac{d\Omega_i}{di_i}, \quad \frac{d\Omega_i}{di_i} = \frac{d\Omega}{di}, \quad i_i = i \]
be introduced, the result is easily found to be
\[ (\omega_0 \sin v - \omega_1 \cos v) \sin I = \frac{\sin i}{p_i} \frac{d\Omega_i}{di_i} + \frac{\sin i}{p} \frac{d\Omega}{di}. \]
This is the relation between \( \omega_0, \omega_1 \) and the elements and \( t \), implied by the one assumed condition that the plane of \( xy \) passes through the line of nodes. The angular velocity \( \omega_2 \) of the system of axes about the axis of \( z \), is so far left, as it evidently ought to be, perfectly arbitrary.
\( \Omega \) and \( \Omega_i \) are now functions of the following elements* only:
\[ a, e, (\varepsilon), \varpi, a_i, e_i, (\varepsilon_i), \varpi_i, I, v. \]
And we now have
\[ \cos \chi = \cos (\theta - v) \cos (\theta_i - v) + \sin (\theta - v) \sin (\theta_i - v) \cos I. \]
87. The complete expression for \( t' \), derived from arts. 55 and 79, is easily put in the form
\[ t' = -\frac{1}{p \sin i} \left[ \frac{d\Omega}{dv} + (1 - \cos i) E \Omega \right] - (\omega_0 \cos v + \omega_1 \sin v); \]
and, on introducing the assumptions of the preceding articles, this will be found to become, after simple reductions,
\[ p \sin I.t' = -(\cos I.E \Omega + E_i \Omega) - (\omega_0 \cos v + \omega_1 \sin v) p \sin I; \]
* On the difference between \( (\varepsilon) \) and \( \varepsilon \) see above, art. 55. We cannot strictly call \( \Omega \) a function of \( a \) and \( \varepsilon \).
similarly, we find
\[ p_i \sin I \cdot i' = \cos I \cdot E_i \Omega_i + E \Omega_i - (\omega_0 \cos \nu + \omega_1 \sin \nu) p_i \sin I; \]
and since \( I' = i' - i \), we obtain from these
\[ pp_i \sin I \cdot I' = p (\cos I \cdot E_i \Omega_i + E \Omega_i) + p_i (\cos I \cdot E \Omega + E_i \Omega), \ldots (104.) \]
an equation which may be transformed as follows:—since
\[ \frac{\partial a}{\partial t} = 0, \quad \frac{\partial e}{\partial t} = 0 \text{ (art. 79)}, \]
we find from art. 55,
\[ p' = \frac{d \Omega}{ds} + \frac{d \Omega}{d \omega} = E \Omega, \]
and similarly \( p' = E_i \Omega_i \), whence it is easily seen that the above equation may be written in the form
\[ (pp_i \cos I)' + pE \Omega_i + p_i E_i \Omega = 0. \ldots \ldots \ldots \ldots (105.) \]
If we investigate, from the above expressions for \( i' \) and \( i \), the values of \((p \sin i)'\) and \((p_i \sin i)' \), we find
\[ \sin I \cdot (p \sin i)' = -(\cos i \cdot E + \cos i \cdot E_i) \Omega - p \cos i \sin I (\omega_0 \cos \nu + \omega_1 \sin \nu), \]
\[ \sin I \cdot (p_i \sin i)' = (\cos i \cdot E_i + \cos i \cdot E) \Omega - p_i \cos i \sin I (\omega_0 \cos \nu + \omega_1 \sin \nu), \]
and, adding these equations,
\[ \sin I \cdot (p \sin i + p_i \sin i)' = (\cos i \cdot E + \cos i \cdot E_i)(\Omega_i - \Omega) \]
\[ - (p \cos i + p_i \cos i) \sin I (\omega_0 \cos \nu + \omega_1 \sin \nu). \ldots \ldots (106.) \]
With respect to this equation and (103.), it may be observed that \( \omega_0 \cos \nu + \omega_1 \sin \nu \) is evidently the angular velocity of the plane of \( xy \) about the line of nodes, and \( \omega_1 \cos \nu - \omega_0 \sin \nu \) is the angular velocity of the same plane about a line in itself perpendicular to the line of nodes; or, which comes to the same thing, the angular velocity of the line of nodes itself in fixed space, estimated perpendicularly to the plane of \( xy \).
88. The position of the plane of \( xy \) at any instant has been so far left arbitrary, except that it has been subjected to the condition of passing through the line of nodes.
As a further assumption, that which most naturally presents itself is, that the plane of \( xy \) should always coincide with the principal plane. By the principal plane I mean, of course, that on which at any instant the sum of the projections of the areal velocities (multiplied by the masses) of the two planets about the sun, is a maximum; it evidently always passes through the line of nodes, and would be the invariable plane if the disturbing functions vanished.
To determine the position of this plane we have (see art. 80.) to express the condition that \( i \) and \( i_i \) are always so taken, subject to the equation \( i_i - i = I \), that the expression \( m \sqrt{\mu a(1-e^2)} \cdot \cos i + m \sqrt{\mu_i a_i(1-e_i^2)} \cdot \cos i_i \) shall be a maximum. We will put \( \sigma \) for the value of this expression, so that using \( p \) and \( p_i \) with the same meaning as before (see art. 86.), we have
\[ \sigma = p \cos i + p_i \cos i_i, \]
and the required condition of a maximum will evidently be
\[ p \sin i + p_i \sin i = 0 \]
adding the squares of these expressions, we obtain
\[ \sigma^2 = p^2 + p_i^2 + 2pp_i \cos I \]
which determines the actual value of \( \sigma \); moreover we have
\[ -\frac{\sin i}{p_i} = \frac{\sin i}{p} = \frac{\sin I}{\sigma} \]
and it is easy to find
\[ \sigma \cos i = p + p_i \cos I \]
\[ \sigma \cos i = p_i + p \cos I \]
so that \( \sigma, \sin i, \sin i_i, \cos i, \cos i_i \) are all simply expressible in terms of \( p, p_i \) and \( I \).
The variation of \( \sigma \) is easily found by means of the equation (105.), which gives
\[ \sigma' = (p_i E_i - pE)(\Omega_i - \Omega) \]
The equation (103.) now gives (see (109.))
\[ \sigma (\omega_0 \cos \nu - \omega_0 \sin \nu) = \frac{d}{dI} (\Omega_i - \Omega) \]
and from (106.) we obtain
\[ \sigma^2 \sin I (\omega_0 \cos \nu + \omega_0 \sin \nu) = \{(p + p_i \cos I)E_i + (p_i + p \cos I)E\}(\Omega_i - \Omega) \]
The two last equations determine the motion of the principal plane in space, irrespectively of any arbitrary sliding motion which we may attribute to it in its own plane. For they give the angular velocities with which it is at any instant moving about two lines at right angles to one another in its own plane (see the end of art. 87.).
They may be put in another form as follows:—the actual value of \( \Omega_i - \Omega \) is
\[ mm_i \left( \frac{r_i}{r_i^2} - \frac{r_i}{r_i^2} \right) \cos \chi \]
where \( \cos \chi \) has the value given above (end of art. 86.); and if the operations indicated be actually performed, observing that \( Er = 0, Er' = 0, \&c., \) and that
\[ E \cos \chi = \frac{d \cos \chi}{db}, \&c., \]
the results will be found to be
\[ \sigma (\omega_0 \cos \nu - \omega_0 \sin \nu) = mm_i \sin I \left( \frac{r_i}{r_i^2} - \frac{r_i}{r_i^2} \right) \sin (\theta - \nu) \sin (\theta_i - \nu) \]
\[ \sigma^2 (\omega_0 \cos \nu + \omega_0 \sin \nu) = mm_i \sin I \left( \frac{r_i}{r_i^2} - \frac{r_i}{r_i^2} \right) \times (p \cos (\theta - \nu) \sin (\theta_i - \nu) + p_i \sin (\theta - \nu) \cos (\theta_i - \nu)) \]
Here \( \theta - \nu, \theta_i - \nu \) represent, it will be remembered, the angular distances of the planets from the line of nodes.
We will assume for the present the condition \( \omega_2 = 0 \), so that the plane of \( xy \) may have no sliding motion, but roll upon the conical surface to which it is always a tan-
* Referring to the arrangement supposed in the diagram, it will be seen that \( i \) becomes negative in the case now considered.
gent plane. If, then, we call $\omega$ the actual angular velocity of the principal plane about its instantaneous axis of rotation, so that $\omega = \sqrt{\omega_0^2 + \omega_j^2}$, and put $j$ for the angle between the line of nodes and the instantaneous axis (which is the line in which the principal plane is intersected by its consecutive), we shall have
$$\omega_0 \cos \nu + \omega_j \sin \nu = \omega \cos j \quad \text{and} \quad \omega_1 \cos \nu - \omega_0 \sin \nu = \omega \sin j;$$
and the above equations give
$$\sigma \cot j = p \cot (\theta - \nu) + p_i \cot (\theta_i - \nu),$$
a result which may also be put in the form (see (109.))
$$\sin I \cot j = \sin i \cot (\theta - \nu) - \sin i \cot (\theta_i - \nu).$$
It is very easy to show by spherical trigonometry that this equation signifies that the instantaneous axis coincides with the line in which the principal plane is intersected by the plane of the two radii vectores*. In other words, we have this theorem:
The principal plane always turns about the line in which it is intersected by the plane of the two radii vectores.
It follows of course that the principal plane always touches the conical surface described in fixed space (relatively to the sun) by the said line. I think it probable that most persons would expect, at first sight, that the principal plane would always touch the conical surface described by the line of nodes, which, as has been just shown, is not the case. It is perhaps worth while to verify this result by independent reasoning.
89. Let Roman letters refer, as in former articles, to axes of coordinates having any arbitrary fixed directions. Then, putting $A = m(yz' - zy')$, &c. (as in art. 80.), and using $\xi$, $\eta$, $\zeta$ for current coordinates, the equation to the principal plane is
$$(A + A')\xi + (B + B')\eta + (C + C')\zeta = 0; \quad . . . . . . (114.)$$
and the line in which this plane is cut by its consecutive is determined by combining the equation (114.) with that obtained from it by differentiating the coefficients of $\xi$, $\eta$, $\zeta$ with respect to $t$; namely,
$$(A' + A'_t)\xi + (B' + B'_t)\eta + (C' + C'_t)\zeta = 0. \quad . . . . . . (115.)$$
Now from the fundamental equations
$$mx'' + my\frac{x}{r^3} = \frac{d\Omega}{dx}, \&c.,$$
we obtain
$$A' = m(yz'' - zy'') = y\frac{d\Omega}{dz} - z\frac{d\Omega}{dy} = mm(yz_t - zy_t)(\delta^{-3} - r^{-3}).$$
* Let $\psi$ be the angle between the principal plane and the plane of the radii vectores; the former plane divides the angle $I$ into two parts, of which one is $i$, and the other is $-i$; and we get two spherical triangles which have a common side $j$, with adjacent angles $i$, and $\psi$ in one, $-i$ and $\pi - \psi$ in the other; hence
$$\cot (\theta - \nu) \sin j = \cot \psi \sin i + \cos j \cos i,$$
$$\cot (\theta_i - \nu) \sin j = \cot \psi \sin i + \cos j \cos i;$$
and if cot $\psi$ be eliminated between these, the result is the equation in the text.
(where $\delta$ is the distance between $m$ and $m'$); and in like manner
$$A' = -mm_i(yz_i - zy_i)(\delta^{-3} - r_i^{-3});$$
we have therefore
$$A' + A' = mm_i(yz_i - zy_i)(r_i^{-3} - r^{-3}),$$
with similar expressions for $B' + B', C' + C'$; so that, when the common factor $mm_i(r_i^{-3} - r^{-3})$ is omitted, the equation (115.) becomes
$$(yz_i - zy_i)\xi + (zx_i - xz_i)\eta + (xy_i - yx_i)\zeta = 0,$$
which is evidently the equation to the plane containing the two radii vectores. Thus the theorem in question is verified.
90. To return from this digression: the motion of the line of nodes in the principal plane will be given by putting $\omega_2 = 0$ in either of the equations (102.), and introducing the value of $\omega_0 \sin \nu - \omega_1 \cos \nu$ from (103.). In this way we find, after slight reductions,
$$pp_i \sin I.\nu' = p_i \cos i.\frac{d\Omega}{dT} + p \cos i.\frac{d\Omega_i}{dT},$$
in which we may substitute for $\cos i, \cos i'$, the values given by equations (110.); this gives
$$\sigma pp_i \sin I.\nu' = (p^2 + pp_i \cos I)\frac{d\Omega}{dT} + (p^2 + pp_i \cos I)\frac{d\Omega_i}{dT};$$
or, if we introduce the actual values of $\frac{d\Omega}{dT}, \frac{d\Omega_i}{dT}$, we find
$$pp_i = -mm_p r_r \sin (\theta - \nu) \sin (\theta_i - \nu) \times \{p \cos i(\delta^{-3} - r^{-3}) + p_i \cos i(\delta^{-3} - r_i^{-3})\}.$$
It is not my purpose however to enter further into details; and I shall conclude this subject by briefly examining the consequences of a slightly different assumption as to the motion of the axes of coordinates. I shall suppose, namely, that the plane of $xy$ still always coincides with the principal plane, but has a sliding motion such that the axis of $x$ always coincides with the line of nodes.
91. The assumption made at the end of the last article implies the condition $\nu = 0$; and $\omega_2$ will no longer be 0, but must be determined by equations (102.); either of these gives (putting $\nu' = \nu' = 0$, and reducing by means of (103.), (109.), &c.)
$$pp_i \sin I.\omega_2 = p_i \cos i.\frac{d\Omega}{dT} + p \cos i.\frac{d\Omega_i}{dT}, \ldots . . . . . . (116.)$$
which coincides, as of course it ought, with the expression given for $\nu'$ on the former hypothesis (art. 90.). The difference is that $\Omega, \Omega_i$ now no longer contain $\nu$.
The values of $\omega_0, \omega_1$ are obtained at once by putting $\nu = 0$ in the equations (112.), (113.); and all the conclusions which were derived independently of any supposition as to the value of $\omega_2$, subsist as before, when modified by putting $\nu = 0$.
We may add one more equation, which is required in forming some of the expressions for the variations of the elements; namely,
$$\tan \frac{i}{2} = \frac{-p_i \sin I}{\sigma + p + p_i \cos I}. \ldots . . . . . . (117.)$$
This is easily obtained from (109.) and (110.) ; and in like manner
\[ \tan \frac{t}{2} = \frac{p \sin I}{\sigma + p_i + p \cos I}. \]
I shall now recapitulate the results of the last supposition, so as to exhibit in one view the transformed differential equations of the problem of three bodies. It will be as well to repeat also the explanation of the symbols.
92. Signification of the Symbols.
\( M, m, m_i \) are the masses of the sun, and of the two planets, and \( M + m = \mu, \)
\( M + m_i = \mu_i. \)
\( a \) and \( e \) are the semiaxis and excentricity of the instantaneous ellipse described by \( m \) the sun.
\( a_i \) and \( e_i \) have the same meanings with reference to \( m_i. \)
\( I \) is the inclination of the plane of the former ellipse to that of the latter.
\( \theta, \theta_i \) are the longitudes of the two planets, measured in the planes of their orbits from the common line of nodes.
\( r, r_i \) their radii vectores.
\( \varphi, \varphi_i \) the longitudes of the perihelia, measured likewise in the planes of the orbits from the line of nodes.
\( \varepsilon, \varepsilon_i \) two elements such that \( \int_0^t ndt + \varepsilon, \int_0^t n_i dt + \varepsilon_i \) are the mean longitudes, where \( n, n_i \) are defined as usual by the equations \( n^2 = \mu a^{-3}, \; n_i^2 = \mu_i a_i^{-3}. \)
Then \( \int_0^t ndt + \varepsilon - \varphi \) is the mean anomaly of \( m, \) and \( r, \theta \) are functions of the mean anomaly and mean longitude given by the laws of elliptic motion. The same is true for \( m_i, \) mutatis mutandis.
\( \chi \) is the angle between the radii vectores, so that
\[ \cos \chi = \cos \theta \cos \theta_i + \sin \theta \sin \theta_i \cos I. \]
Let \( \delta \) be the distance between the planets, so that
\[ \delta^2 = r^2 + r_i^2 - 2rr_i \cos \chi. \]
\( \Omega, \Omega_i \) are the disturbing functions, defined by the equations
\[ \Omega = mm_i \left( \frac{1}{\delta} - \frac{r \cos \chi}{r_i^2} \right), \quad \Omega_i = mm_i \left( \frac{1}{\delta} - \frac{r_i \cos \chi}{r^2} \right), \]
and, when expressed in terms of the elements and \( t, \) are functions of
\[ a, a_i, e, e_i, I, \int_0^t ndt + \varepsilon, \int_0^t ndt + \varepsilon - \varphi, \int_0^t n_i dt + \varepsilon_i, \int_0^t n_i dt + \varepsilon_i - \varphi_i. \]
When \( \Omega, \Omega_i \) are considered on the one hand as expressed in this way, and on the other, in their original form as functions of \( r, r_i, \theta, \theta_i, \) and \( I, \) we have, as applied to either of them,
\[ \frac{d}{d\theta} = \frac{d}{d\varepsilon} + \frac{d}{d\varphi}, \quad \frac{d}{d\theta_i} = \frac{d}{d\varepsilon_i} + \frac{d}{d\varphi_i}. \]
$p, p_i, \sigma$ are defined by the equations
$$p = m\sqrt{\mu a(1-e^2)}, \quad p_i = m\sqrt{\mu_i a_i(1-e_i^2)}, \quad \sigma^2 = p^2 + p_i^2 + 2pp_i \cos I.$$
$\iota, \iota_i$ are the angles between the principal plane and the planes of the two orbits, and are given functions of the elements $a, a_i, e, e_i, I$, by virtue of the equations
$$-p\sigma \sin \iota = p_i \sigma \sin \iota_i = pp_i \sin I,$$
from which we have also
$$\sigma \cos \iota = p + p_i \cos I, \quad \sigma \cos \iota_i = p_i + p \cos I$$
(for the values of $\tan \frac{\iota}{2}, \tan \frac{\iota_i}{2}$; see art. 91.).
$\omega_0$ is the angular velocity of the principal plane about the line of nodes;
$\omega_1$ the angular velocity of the principal plane about a line in itself perpendicular to the line of nodes;
$\omega_2$ the angular velocity of the line of nodes estimated in the direction of the principal plane.
**Differential Equations of the Problem.**
93. The nine intrinsic elements, as we may perhaps appropriately call them, namely,
$$a, a_i, e, e_i, \varepsilon, \varepsilon_i, \varpi, \varpi_i, I,$$
are determined as functions of $t$ by the following system of nine simultaneous differential equations of the first order:
$$m\mu a' = 2na^2 \frac{d\Omega}{de} - m\mu_i a_i' = 2n_i a_i^2 \frac{d\Omega_i}{de_i}$$
$$m\mu e' = -na \sqrt{1-e^2} \left\{ \frac{d\Omega}{de} + (1-\sqrt{1-e^2}) \frac{d\Omega_i}{de_i} \right\}$$
$$m\mu_i e_i' = -na_i \sqrt{1-e_i^2} \left\{ \frac{d\Omega_i}{de_i} + (1-\sqrt{1-e_i^2}) \frac{d\Omega}{de} \right\}$$
$$m\mu \varepsilon' = -2na^2 \frac{d\Omega}{da} + na \sqrt{1-e^2} (1-\sqrt{1-e^2}) \frac{d\Omega}{de}$$
$$- \frac{m\mu}{\sin I} \left\{ \frac{\cos I}{p} \frac{d\Omega}{dT} + \frac{1}{p_i} \frac{d\Omega_i}{dT} \right\}$$
$$m\mu_i \varepsilon_i' = -2n_i a_i^2 \frac{d\Omega_i}{da_i} + na_i \sqrt{1-e_i^2} (1-\sqrt{1-e_i^2}) \frac{d\Omega_i}{de_i}$$
$$- \frac{m\mu_i}{\sin I} \left\{ \frac{\cos I}{p_i} \frac{d\Omega_i}{dT} + \frac{1}{p} \frac{d\Omega}{dT} \right\}$$
$$m\mu \varpi' = na \sqrt{1-e^2} \frac{d\Omega}{de} - \frac{m\mu}{\sin I} \left\{ \frac{\cos I}{p} \frac{d\Omega}{dT} + \frac{1}{p_i} \frac{d\Omega_i}{dT} \right\}$$
$$m\mu_i \varpi_i' = na_i \sqrt{1-e_i^2} \frac{d\Omega_i}{de_i} - \frac{m\mu_i}{\sin I} \left\{ \frac{\cos I}{p_i} \frac{d\Omega_i}{dT} + \frac{1}{p} \frac{d\Omega}{dT} \right\}$$
pp_i \sin I.I' = p_i \left\{ \cos I \left( \frac{d\Omega}{d\varepsilon} + \frac{d\Omega}{d\varphi} \right) + \frac{d\Omega}{d\varepsilon_i} + \frac{d\Omega}{d\varphi_i} \right\} \\
+ p \left\{ \cos I \left( \frac{d\Omega_i}{d\varepsilon_i} + \frac{d\Omega_i}{d\varphi_i} \right) + \frac{d\Omega_i}{d\varepsilon} + \frac{d\Omega_i}{d\varphi} \right\}.
94. The only parts of the preceding expressions of which the deduction is not perfectly obvious, are the terms involving \( I \) in the values of \( \varepsilon', \varepsilon, \varphi', \varphi \). They are obtained, as has been sufficiently explained, from the expressions in art. 55, to which are to be added the values of \( \frac{\partial \varepsilon}{dt}, \ldots \) ((95.), art. 79.) ; on putting \( v = 0, \omega_0 \) disappears from the latter; and the values of \( \omega_1, \omega_2, \cos \iota, \cos \iota, \tan \frac{\iota}{2}, \tan \frac{\iota}{2} \) are to be introduced (equations (110.), (112.), (116.), (117.).) After some rather troublesome reductions, the expressions above given will be found.
In these equations it will be recollected that the mean longitude of \( m \) is represented by \( \int_0^t ndt + \varepsilon \), and the differentiation with respect to \( a \) in \( \frac{d\Omega}{da} \) is only performed so far as \( a \) appears explicitly. If we wished that the mean longitude should be expressed by \( nt + \varepsilon \), the only change in the equations would be that the differentiation with respect to \( a \) must be total; i.e. must extend to \( a \) as contained in \( n \). A similar remark applies of course to \( \varepsilon_i \).
In actual use it would be more convenient to introduce \( R, R_i \) instead of \( \Omega, \Omega_i \); the latter functions give a rather more symmetrical form to the equations, and are more convenient in general investigations. (The relation between them here is merely \( \Omega = mR, \Omega_i = m_iR_i \); in another part of the paper the symbol \( \Omega \) was used for \(-mR\) (art. 55.).)
95. If the equations of art. 93. were completely integrated, the intrinsic motion of the system would be completely determined; that is, we should know at any instant the dimensions of the two orbits, the mutual inclination of their planes, the position of their axes with respect to the line of nodes, the place of each planet in its orbit, and (by (110.)) the inclination of each orbit to the principal plane.
The position of the system relatively to fixed space would then have to be separately determined as follows:
The three quantities \( \omega_0, \omega_1, \omega_2 \) (see end of art. 92.), of which the values are ((112.), (113.), (116.)) given by the equations
\[ \sigma^2 \sin I.\omega_0 = \left\{ (p + p_i \cos I) \left( \frac{d}{d\varepsilon} + \frac{d}{d\varphi} \right) + (p_i + p \cos I) \left( \frac{d}{d\varepsilon_i} + \frac{d}{d\varphi_i} \right) \right\} (\Omega_i - \Omega) \]
\[ \sigma \omega_1 = \frac{d}{dI} (\Omega_i - \Omega) \]
\[ \sigma \sin I.\omega_2 = \left( \frac{p_i}{p} + \cos I \right) \frac{d\Omega}{dI} + \left( \frac{p_i}{p} + \cos I \right) \frac{d\Omega_i}{dI}, \]
would be given functions of $t$. Then if we call
$J$ the inclination of the principal plane to an arbitrary fixed plane;
$\Omega$ the longitude of the line of intersection of these two planes, reckoned in the fixed plane from a fixed line;
$N$ the angle between this line of intersection and the line of nodes;
we should have (as in art. 83. with a different notation)
$$\begin{align*}
J' &= \omega_0 \cos N - \omega_1 \sin N \\
\Omega' \sin J &= \omega_0 \sin N + \omega_1 \cos N \\
N' &= \omega_2 - \cot J (\omega_0 \sin N + \omega_1 \cos N)
\end{align*}$$
and the integration of this system would give $J$, $\Omega$, $N$ as functions of $t$, and so determine at any instant the position of the principal plane and of the line of nodes, relatively to fixed space.
With respect to the motion of the principal plane, the following may be added. It has already been shown (art. 88, 89.) that the line about which it is at any instant turning, coincides with that in which it is intersected by the plane of the radii vectores; and the values of $\omega_0$, $\omega_1$ (see art. 88, putting $\nu = 0$ in the expressions there given) may be put in the form
$$\sigma^2 \omega_0 = mm_i \sin I \left( \frac{r_1}{r^2} - \frac{r}{r^2} \right) (p \cos \theta \sin \theta_i + p_i \sin \theta \cos \theta_i)$$
$$\sigma \omega_1 = mm_i \sin I \left( \frac{r_1}{r^2} - \frac{r}{r^2} \right) \sin \theta \sin \theta_i.$$
If the latter of these be multiplied by $\sigma$, and then both sides of each squared, and the results added (after putting for $\sigma^2$ on the right its value $p^2 + p_i^2 + 2pp_i \cos I$), we find, observing that $\cos \theta \cos \theta_i + \sin \theta \sin \theta_i \cos I = \cos \chi$,
$$\sigma^2 \sqrt{\omega_0^2 + \omega_1^2} = mm_i \sin I \left( \frac{r_1}{r^2} - \frac{r}{r^2} \right) (p^2 \sin^2 \theta_i + p_i^2 \sin^2 \theta + 2pp_i \sin \theta \sin \theta_i \cos \chi)^{\frac{1}{2}},$$
an expression which may be further transformed as follows. Let $\lambda$, $\lambda_i$ be the latitudes of $m$, $m_i$ (with reference to the principal plane); then $\sin \lambda = \sin \theta \sin \theta_i$, $\sin \lambda_i = \sin \theta_i \sin \theta_i$; hence, since $p = \frac{\sigma \sin \theta_i}{\sin I}$, $p_i = \frac{-\sigma \sin \theta_i}{\sin I}$, we obtain
$$\sigma \sqrt{\omega_0^2 + \omega_1^2} = mm_i \left( \frac{r_1}{r^2} - \frac{r}{r^2} \right) (\sin^2 \lambda + \sin^2 \lambda_i - 2 \sin \lambda \sin \lambda_i \cos \chi)^{\frac{1}{2}}.$$
This gives the absolute angular velocity with which the principal plane is at any instant changing its direction in space; it is evident that (if the supposition $r_i = r$ be excluded) it can never vanish except when both planets are in the line of nodes. The direction of the rotation is determined by the signs of $\omega_0$ and $\omega_1$.
96. The system of differential equations given in art. 93. affords an example of the so-called "elimination of the nodes" in the problem of three bodies. Jacobi, by a very remarkable and ingenious transformation, has effected the elimination in a quite
different manner*. The equations of art. 93. are merely transformations of the original differential equations of the problem, without any integrations; they are however in a form which might perhaps be used advantageously in certain cases for the purposes of physical astronomy. Those of Jacobi are obtained by employing all the four usual integrals of the problem, and are shown to include an additional integration. They have however the disadvantage of substituting the coordinates of two fictitious bodies for those of the actual planets, and would probably be inconvenient for ordinary practical use; though in a theoretical point of view they seem to deserve more attention than they have hitherto received. It would be wrong to take leave of this celebrated problem without referring to another transformation by M. Bertrand†, which, as has been remarked by a recent writer in the same journal, effects six integrations, and therefore represents the furthest advance which has yet been made towards a rigorous solution.
Appendix A.
When the method described in Theorem VII. (art. 49.) is applied to the solution of a system of equations of the form (I.), of which \( n \) integrals, \( a_1 \ldots a_n \), satisfying the conditions \([a_i, a_j] = 0\), are given, the first step is to express the \( n+1 \) partial differential coefficients \( \frac{dX}{dx_i}, \ldots \), and \( \frac{dX}{dt} \); namely, the values of \( y_1, \ldots y_n \) and \(-Z\) in terms of \( x_1, \ldots x_n, a_1, \ldots a_n \) and \( t \). The direct process is then to find \( X \) by integrating the expression \( y_1 dx_1 + y_2 dx_2 + \ldots + y_n dx_n - Z dt \), and afterwards to form the remaining integral equations \( \frac{dX}{da_i} = b_i, \ldots \): when this process is adopted, the inferior limits in the integrations are perfectly arbitrary; in other words, we may add to \( X \) an arbitrary function of \( a_1, a_2, \ldots a_n \), without altering any of the general properties of the final system of integrals.
But it is generally much more convenient to perform the differentiations with respect to \( a_1, \ldots a_n \) first, and integrate afterwards; thus we obtain the remaining equations in the form
\[
b_i = \int \left( \frac{dy_1}{da_i} dx_1 + \frac{dy_2}{da_i} dx_2 + \ldots + \frac{dy_n}{da_i} dx_n - \frac{dZ}{da_i} dt \right).
\]
When this plan is followed, the limits are still arbitrary if it be only required that the equations thus obtained shall be true; but if it be required that they shall form, with the given integrals, a normal solution, it is necessary to take the limits in such a manner that the functions equated to \( b_1, b_2, \ldots b_n \) shall be the partial differential coefficients with respect to \( a_1, a_2, \ldots a_n \), of one and the same function; which will not generally be the case unless care be taken that it should be so.
In practice, the expression for \( dX \) usually consists of several terms, of which each
* Comptes Rendus, 1842, part 2. p. 236, &c. † Liouville's Journal, 1852.
contains one of the variables only. Suppose one of these terms is
\[ \phi(x, a_1, a_2, \ldots a_n)dx, \]
so that, so far as this term is concerned, we have
\[ X = \int_A^x \phi(x, a_1, a_2, \ldots a_n)dx, \]
where \( A \) is an arbitrary function of \( a_1, \ldots a_n \). Consequently
\[ \frac{dX}{da_i} = \int_A^x \frac{d\phi}{da_i}dx - \phi(A, a_1, \ldots a_n)\frac{dA}{da_i}, \]
and we see that we should not in general obtain the differential coefficients with respect to \( a_1, \ldots a_n \) of one and the same function \( X \), by merely integrating \( \frac{d\phi}{da_1}, \frac{d\phi}{da_2}, \ldots \), with respect to \( x \), from the same inferior limit \( A \), chosen at hazard.
But it is evident that we shall attain this end if we adopt the following simple rule:
Integrate \( \frac{d\phi}{da_1}, \ldots \) with respect to \( x \), taking the same inferior limit in each case, namely, either
(1) a value \( A \) of \( x \) which satisfies the equation \( \phi(x, a_1, \ldots a_n) = 0 \), or
(2) any determinate constant (i.e. not a function of \( a_1, \ldots a_n \)).
For example, in the problem of central forces (Part I., art. 28, &c.), we had (see art. 19.)
\[ dX = -hdt + cd\theta + \left(2m(h + \phi(r)) - k^2r^{-2}\right)^{\frac{3}{2}}dr + (k^3 - c^2 \sec^2 \lambda)^{\frac{3}{2}}d\lambda. \]
(where \( r, \theta, \lambda \) are the three variables).
The very troublesome process of differentiating \( X \) with respect to \( h, k \) and \( c \) after first finding \( X \) by integrating the above expression, is avoided by the method adopted in art. 29; namely, by differentiating first, and integrating afterwards. In the integrations with respect to \( r \), the inferior limit is one of the roots of the equation
\[ 2m(h + \phi(r)) - k^2r^{-2} = 0, \]
namely (in the case of elliptic motion), the perihelion distance; and in those with respect to \( \lambda \), the inferior limit is 0; so that the rule above given is observed.
At the time of writing the article referred to, neither the rule itself, nor the necessity of attending to the limits, had occurred to me; it was therefore, strictly speaking, accidental that the final integrals were obtained in a normal form.
In treating the problem of rotation (Section III.), I perceived the necessity of caution as to the limits, if the former order of proceeding were adopted; but preferred avoiding the risk of error altogether, by performing the integrations first, so as to obtain the actual expression for \( V \). The final equations (R.), art. 44, might however be obtained in a more simple way by differentiating first; thus we should have (see equations (45.), (46.)), observing that \( \frac{dk}{dh} = \frac{k}{2h} \), &c. (art. 44.), and putting
\[
(1 - \cos^2 i - \cos^2 j + 2 \cos i \cos j \cos \theta - \cos^2 \theta)^{\frac{1}{2}} = Q,
\]
\[
\frac{dV}{dh} = \frac{k}{2h} \left[ \psi \cos i + \phi \cos j + \int \frac{Q}{\sin \theta} d\theta \right] = t + \tau
\]
\[
\frac{dV}{\cos i} = k \left[ \psi + \int \frac{\cos j \cos \theta - \cos i}{Q \sin \theta} d\theta \right] = \alpha
\]
\[
\frac{dV}{\cos j} = \frac{(C-A)k^3 \cos j}{2AC} \left[ \psi \cos i + \phi \cos j + \int \frac{Q}{\sin \theta} d\theta \right]
\]
\[
+ k \left[ \phi + \int \frac{\cos i \cos \theta - \cos j}{Q \sin \theta} d\theta \right] = \beta.
\]
In order to get rid of the troublesome integration involved in the term \( \int \frac{Q}{\sin \theta} d\theta \), we may (1) eliminate this term between the first and last of these equations, and (2) eliminate \( \psi \) between the first and second. We thus find the two following equations,
\[
\phi + \int \frac{\cos i \cos \theta - \cos j}{Q \sin \theta} d\theta = \beta - \frac{C-A}{AC} k \cos j \cdot (t+\tau)
\]
\[
\cos j \left[ \phi + \int \frac{\cos i \cos \theta - \cos j}{Q \sin \theta} d\theta \right] + \int \frac{\sin \theta d\theta}{Q} = \frac{2h}{k}(t+\tau) - \frac{\alpha \cos i}{k},
\]
which last, combined with the preceding, gives
\[
\int \frac{\sin \theta d\theta}{Q} = \frac{k}{A}(t+\tau) - \frac{\alpha \cos i + \beta \cos j}{k};
\]
and we may take (i.), (ii.) and (iii.) as expressing the solution of the problem.
Now we have \( \cos I = \frac{\cos i - \cos j \cos \theta}{\sin j \sin \theta} \), from which it is easy to find (observing the conditions which determine the sign of \( Q \))
\[
-dI = \frac{\cos i \cos \theta - \cos j}{Q \sin \theta} d\theta, \text{ and similarly,}
\]
\[
-dJ = \frac{\cos j \cos \theta - \cos i}{Q \sin \theta} d\theta;
\]
we have moreover \( d\Theta = \frac{\sin \theta d\theta}{Q} \).
All the integrations may therefore now be performed immediately; and we may take for the inferior limit of \( \theta \) any value which satisfies the equation \( Q=0 \), or
\[
(\cos \theta - \cos i \cos j)^2 - \sin^2 i \sin^2 j = 0;
\]
this is satisfied by \( \theta = i+j \), which evidently corresponds to \( I=0, J=0, \Theta=0 \), and it is manifest that equations (i.), (ii.), (iii.) will thus become identical with equations (R) of art. 44.
I do not regret however having introduced the rather prolix investigation of arts. 39 and 43, because it is interesting to know the actual value of \( V \) (equation (48.)), which the method just given leaves undetermined.
APPENDIX B.
On the subject of the transformation of elements, the following additional remarks will hardly be superfluous. Suppose $\Omega$ is originally a function of the elements $a, b, c,$ &c. with $t$; and let $\alpha, \beta, \gamma,$ &c. be other quantities connected with $a, b, c,$ &c., by equations such as
$$da = A d\alpha + B d\beta + C d\gamma + \ldots + K dt, \quad \ldots \quad \ldots \quad \ldots \quad \ldots \quad \ldots \quad (a.)$$
where $A, B, C, \ldots K$ are given functions of $\alpha, \beta, \gamma, \ldots t$; or by equations such as
$$d\alpha = A_p da + B_p db + \ldots + K_p dt, \quad \ldots \quad \ldots \quad \ldots \quad \ldots \quad \ldots \quad (\alpha.)$$
where $A_p, \&c.$ are given functions of $a, b, \ldots, t$. In either case, if each of the equations be integrable per se, we may consider $a, b, c, \ldots$ as functions of $\alpha, \beta, \gamma, \ldots, t$; and such equations as
$$\frac{d\Omega}{d\alpha} = A \frac{d\Omega}{da} + B \frac{d\Omega}{db} + \ldots \quad \ldots \quad \ldots \quad \ldots \quad \ldots \quad (\Omega.)$$
are both significant and true.
But if the expressions on the right of the equations $(a.)$ be not differentials per se, the equations $(\Omega.)$ are either unmeaning or untrue. For the symbol $\frac{d\Omega}{d\alpha}$ implies one of two things; either, that $\Omega$ is expressed in terms of $\alpha, \beta, \ldots t$ without arbitrary constants (i.e. that the transformation of $\Omega$ can be actually effected without integrating the differential equations of the problem), which is manifestly impossible, unless $(a.), \&c.$ be integrable per se; or else, that the differential equations are to be conceived to have been completely integrated, so that $a, b, \&c.$, and consequently $A_p, B_p, \&c.$, are known as functions of $t$ and arbitrary constants, whereby the right-hand side of $(\alpha.)$ becomes an explicit function of $t$ (and arbitrary constants), so that $\alpha, \beta, \&c.$ may by integration be expressed in the same way, and, by means of $(a.), a, b, \&c.$ may be similarly expressed, and finally, by algebraical elimination, $a, b, \&c.$ become functions of $\alpha, \beta, \&c., t$, and arbitrary constants. On this supposition, $\frac{d\Omega}{d\alpha}$ has a meaning, but the equation $(\Omega.)$ is untrue; for we must have
$$\frac{d\Omega}{d\alpha} = \frac{d\Omega}{da} \frac{da}{d\alpha} + \frac{d\Omega}{db} \frac{db}{d\alpha} + \ldots;$$
and it is manifestly not true that $\frac{da}{d\alpha} = A, \&c.$ in this case, because the equation
$$da = A d\alpha + B d\beta + \ldots + K dt,$$
not being integrable per se, only subsists for those variations of $\alpha, \beta, \&c.$ which actually take place during the instant $dt$; whereas the equation
$$da = \frac{da}{d\alpha} d\alpha + \frac{da}{d\beta} d\beta + \ldots + \frac{da}{dt} dt$$
subsists for arbitrary variations of all the variables. This view of the subject entirely
coincides, in substance, with that taken by Jacobi; but the above mode of stating it may tend to make it clearer, and to call attention to a matter which, so far as I know, is not so much as mentioned in any of the elementary works usually in the hands of students of physical astronomy.
[Addition to Appendix B.]
Received March 1, 1855.
The remark made above, that the symbol $\frac{d\Omega}{da}$ is unmeaning in the case considered, is not of course intended to imply that a meaning may not be given to it; but then such meaning is different from the ordinary signification of the symbol, which is a partial derived function.
The whole matter may be strikingly illustrated by a simple example.
Consider the movement of a rigid body about a fixed point. Adopting the notation of art. 40 (Part I.), we have
$$pdt = -\cos \phi d\theta - \sin \theta \sin \phi d\psi$$
$$qdt = \sin \phi d\theta - \sin \theta \cos \phi d\psi$$
$$rdt = d\phi + \cos \theta d\psi.$$
Let $\alpha, \beta, \gamma$ be three new variables, defined by the equations
$$\alpha = \int_0^t pdt, \quad \beta = \int_0^t qdt, \quad \gamma = \int_0^t rdt,$$
so that $d\alpha = pdt$, &c. Then the above equations give
$$d\theta = -\cos \phi d\alpha + \sin \phi d\beta, \quad d\phi = d\gamma + \cot \theta (\sin \phi d\alpha + \cos \phi d\beta)$$
$$d\psi = -\cosec \theta (\sin \phi d\alpha + \cos \phi d\beta).$$
Here $\alpha, \beta, \gamma$ are the sums of the elementary angles described about the axes in the course of the motion; and no one would maintain that $\theta, \phi, \psi$ are functions of $\alpha, \beta, \gamma$, for the values of the latter variables at any time do not determine the values of the former. If therefore we choose to write such equations as
$$\frac{d\theta}{da} = -\cos \phi, \quad \frac{d\theta}{d\beta} = \sin \phi, \quad \text{&c.},$$
we must admit that $\frac{d\theta}{da}, \frac{d\theta}{d\beta}, \text{&c.}$ are not partial derived functions in the ordinary sense.
At most, $\frac{d\theta}{da}$ is the derived function of that function of $\alpha$ which $\theta$ would become if $\beta$ and $\gamma$ were maintained invariable, i.e. if the motion were restricted to a rotation about the A-axis. Again, if we admit such symbols as $\frac{d^2\theta}{d\beta da}, \frac{d^2\theta}{dad\beta}$, we must interpret them as follows:
$$\frac{d^2\theta}{d\beta da} = \frac{d}{d\beta} \frac{d\theta}{da} = -\frac{d\cos \phi}{d\beta} = \sin \phi \frac{d\phi}{d\beta},$$
but \( \frac{d\phi}{d\beta} = \cot \theta \cos \phi \), and therefore
\[ \frac{d^2\phi}{d\beta d\alpha} = \cot \theta \sin \phi \cos \phi; \]
and in like manner we find the same value for \( \frac{d^2\phi}{dad\beta} \), so that in this particular case the condition \( \frac{d^2\phi}{dad\beta} = \frac{d^2\phi}{d\beta d\alpha} \) is verified.
But if we take \( \frac{d^2\phi}{dyd\alpha} \) and \( \frac{d^2\phi}{d\beta dy} \) in the same way, we find the former \( = \sin \phi \) and the latter \( = 0 \), so that in this case the condition is not verified. The geometrical meaning of this is obvious; analytically it is merely an instance of a general fact, pointed out by Jacobi; namely, that the effect of two successive pseudo-differentiations with respect to two independent variables, is not generally independent of the order of operations.
If \( V \) be the potential of another body, given in position, upon the body considered, then \( V \) is a function of \( \theta, \phi, \psi \), and
\[ dV = \frac{dV}{d\theta} d\theta + \frac{dV}{d\phi} d\phi + \frac{dV}{d\psi} d\psi; \]
and if we substitute for \( d\theta, d\phi, d\psi \) their values in terms of \( d\alpha, d\beta, d\gamma \), we obtain an expression which we may call \( Ld\alpha + Md\beta + Nd\gamma \), \( L, M, N \) being functions of \( \theta, \phi, \psi, \frac{dV}{d\theta}, \frac{dV}{d\phi}, \frac{dV}{d\psi} \), of which, as is well known, the mechanical meanings are the moments of the attraction of the second body about the three axes. Here again no one would maintain that \( V \) is a function of \( \alpha, \beta, \gamma \); and if, as is often done, we say \( \frac{dV}{d\alpha} = L, \&c., \) the above remarks apply in all respects to these equations.
I should have thought it superfluous to dwell so much on these points if it had not appeared that writers on physical astronomy have in some instances either overlooked the distinction between real and pseudo-differentiation, or at least have failed to point it out to their readers. The only discussion of the subject which I have met with is that given by Jacobi, in the correspondence referred to.
It may be added, that in general investigations, where symbols such as \( \frac{dV}{d\alpha}, \&c. \) may be used without defining the nature of \( V \), or the precise meaning of \( \alpha, \beta, \&c. \), serious errors might be committed if it were assumed that the condition \( \frac{d^2V}{dad\beta} = \frac{d^2V}{d\beta d\alpha} \) always subsisted.
**Appendix C.**
The theorems relating to the transformation of coordinates, given in Section VI., may be made more general, and in many cases more useful, as follows:—
If \( x_1, x_2, \ldots, x_n \) be the coordinates employed in the first statement of any dynamical problem, the differential equations are comprehended in the formula
\[
\Sigma_i \left( \frac{dW}{dx'_i} - \frac{dW}{dx_i} \right) \delta x_i = 0. \quad \ldots \quad (D.)
\]
[If there be any forces, such as those arising from a resisting medium, which do not satisfy the natural conditions of integrability, then on the right-hand side of the formula (D.), instead of 0 we shall have an expression such as \( \Sigma_i (X_i \delta x_i) \); but such terms are easily introduced and allowed for separately, and do not affect the following investigation. I shall therefore here assume that they do not exist.]
In the above formula, \( W \) is a given function of \( x_1, \ldots, x_n, x'_1, \ldots, x'_n \), which may also explicitly contain \( t \).
In Section VI. the only case contemplated was that in which \( x_1, \ldots, x_n \) are independent coordinates; in which case the formula (D.) is equivalent to \( n \) separate equations, since \( \delta x_i, \&c. \) are wholly arbitrary and independent.
In practice, however, it is often more convenient to use, at first, a set of coordinates more in number than the independent variables of the problem, and therefore subject to equations of condition.
Let us assume then that the \( n \) coordinates \( x_1, \ldots, x_n \) are subject to \( r \) equations of condition,
\[
L_1 = 0, L_2 = 0, \ldots, L_r = 0,
\]
where \( L_i, \&c. \) may explicitly contain \( t \), besides the \( n \) variables \( x_i, \&c. \).
If we introduce the \( n \) conjugate variables \( y_1, \ldots, y_n \) defined by the equations \( y_i = \frac{dW}{dx'_i} \),
and take \( Z \) a function of \( x_i, \&c., y_i, \&c. \) (with or without \( t \)), defined by the equation
\[
Z = -[W] + [x'_1]y_1 + \ldots + [x'_n]y_n
\]
(the brackets indicating that \( x'_i, \&c. \) are expressed in terms of \( y_i, \&c. \)), then it follows exactly as in art. 18 (Part I.), that the formula (D.) will be changed into the system
\[
\begin{align*}
x'_i &= \frac{dZ}{dy_i} \\
\Sigma_i \left( y'_i + \frac{dZ}{dx_i} \right) \delta x_i &= 0
\end{align*}
\]
\[
\ldots \quad (E.)
\]
[In the most usual problems \( W \) is of the form \( T + U \), where \( T \) is homogeneous and of the second degree in \( x'_i, \&c. \), and \( U \) does not contain \( x'_i, \&c. \) at all. In this case \( Z \) is only \( T - U \) expressed in terms of \( y_i, \&c. \), instead of \( x'_i, \&c. \). But \( T \) is not necessarily homogeneous; in fact it is not so in problems relating to motion relative to the earth, as affected by the earth's rotation.]
Let us now suppose that the system (E.) is to be transformed by the introduction of the \( m \) independent coordinates \( \xi_1, \xi_2, \ldots, \xi_m \), and of the new conjugate variables \( \eta_1, \eta_2, \ldots, \eta_m \); where \( m = n - r \). And let it be required to investigate a theorem by means of which the transformation may be effected without recurring to the original formula (D.).
MDCCCLV.
The definitions of the new coordinates $\xi_1$, &c. will furnish $m$ equations (which may explicitly contain $t$) by means of which $\xi_1$, ... $\xi_m$ may be expressed as functions of $x_1$, ... $x_n$ (with or without $t$); and conversely, by means of these $m$ equations, together with the $r$ equations of condition $L_i = 0$, &c., the $n$ variables $x_1$, $x_2$, ... $x_n$ may be expressed as functions of $\xi_1$, ... $\xi_m$, with or without $t$. When $x_1$, ... $x_n$ are so expressed, let them be represented by $(x'_1)$, ... $(x'_n)$. We shall have then
$$x'_i = \frac{d(x_i)}{dt} + \frac{d(x_i)}{d\xi_1} \xi'_1 + \cdots + \frac{d(x_i)}{d\xi_m} \xi'_m,$$
so that $x'_i$, &c. are expressible (and in only one way) as functions of $\xi_1$, &c., $\xi_1$, &c.
If then the formula (D.) be transformed by expressing $x_1$, &c., $x'_1$, &c. in this manner, it becomes, as is well known,
$$\sum_i \left( \frac{d(W)}{d\xi'_i} \right)' - \frac{d(W)}{d\xi'_i} \delta_{\xi'_i} = 0,$$
where $(W)$ represents the result of transforming $W$ as above; and since $\delta_{\xi'_i}$, &c. are now independent, this formula breaks up into the $m$ separate equations
$$\left( \frac{d(W)}{d\xi'_i} \right)' = \frac{d(W)}{d\xi'_i}. \quad \text{(F.)}$$
Moreover, if we now define $\eta_i$ by the equation $\frac{d(W)}{d\xi'_i} = \eta_i$, and put
$$\Psi = -(W) + (\xi'_1)\eta_1 + \cdots + (\xi'_m)\eta_m,$$
where $(\xi'_i)$, &c. are expressed in terms of $\eta_i$, &c., we know already (art. 18.) that the system (F.) becomes
$$\xi'_i = \frac{d\Psi}{d\eta_i}, \quad \eta'_i = -\frac{d\Psi}{d\xi'_i}. \quad \text{(G.)}$$
Now let $P$ be a function of the $m$ new variables $\xi_1$, ... $\xi_m$, and of the $n$ old variables $y_1$, ... $y_n$ (with or without $t$), defined by the equation
$$P = (x_1)y_1 + (x_2)y_2 + \cdots + (x_n)y_n.$$
Since $\eta_i = \frac{d(W)}{d\xi'_i}$, and since $(W)$ contains $\xi'_i$, &c., only through $x'_i$, &c., we have, observing that $\frac{dW}{dx'_i} = y_i$,
$$\eta_i = y_1 \frac{dx'_1}{d\xi'_i} + y_2 \frac{dx'_2}{d\xi'_i} + \cdots + y_n \frac{dx'_n}{d\xi'_i};$$
but from the equation (x') we have $\frac{dx'_j}{d\xi'_i} = \frac{d(x_j)}{d\xi'_i}$;
consequently
$$\eta_i = y_1 \frac{d(x_1)}{d\xi'_i} + y_2 \frac{d(x_2)}{d\xi'_i} + \cdots + y_n \frac{d(x_n)}{d\xi'_i},$$
an expression evidently equivalent to $\frac{dP}{d\xi'_i}$. Thus $\eta_i$ may be defined by the equation
$$\frac{dP}{d\xi'_i} = \eta_i. \quad \text{(n.)}$$
without recurring to the formula (D.). And since each equation of condition gives, if differentiated totally with respect to \( t \), with the substitution of \( \frac{dZ}{dy_i} \) for \( x'_i \), an equation such as
\[
0 = \frac{dL}{dt} + \frac{dL}{dx_1} \frac{dZ}{dy_1} + \ldots + \frac{dL}{dx_n} \frac{dZ}{dy_n}
\]
(L.)
the \( m \) equations (\( n \)) with the \( r \) equations \( L \) (in which last \( x_i \), &c. must be expressed, after the differentiation, in terms of \( \xi \), &c.), give \( n \) equations by means of which \( y_1, \ldots y_n \) may be expressed in terms of \( \xi_1, \ldots \xi_m \), &c., and can be so expressed only in one way.
Lastly, the value of \( \Psi \) (see equations (G.)), may be obtained as follows:
Since
\[
\Psi = -(W) + (\xi_1)\eta_1 + \ldots + (\xi_m)\eta_m
\]
and
\[
Z = -[W] + [x'_1]y_1 + \ldots + [x'_n]y_n,
\]
and \( (W) \) is only \([W]\) differently expressed, we have, without reference to modes of expression,
\[
\Psi - Z = \Sigma_i(\xi_i\eta_i) - \Sigma_i(x'_iy_i).
\]
On the other hand, since \( P \) can contain \( t \) explicitly only through \((x_i)\), &c., we have
\[
\frac{dP}{dt} = \Sigma_i(y_i \frac{d(x_i)}{dt}),
\]
and also
\[
\Sigma_i(x'_iy_i) = \Sigma_i(y_i \frac{d(x_i)}{dt}) + \Sigma_i(\frac{dP}{d\xi_i}\xi'_i);
\]
hence, observing that \( \frac{dP}{d\xi_i} = \eta_i \), we obtain
\[
\Sigma_i(\xi_i\eta_i) - \Sigma_i(x'_iy_i) = -\frac{dP}{dt};
\]
and therefore, finally,
\[
\Psi = Z - \frac{dP}{dt},
\]
so that \( Z - \frac{dP}{dt} \) must become identical with \( \Psi \), when expressed entirely in terms of the new variables.
These results may be stated in the form of the following theorem. The system (E.) is transformed into the system (G.) by the following substitutions:
1. \( x_1, \ldots x_n \) are expressed in terms of \( \xi_1, \ldots \xi_m \) by means of the \( m \) equations which define the latter variables, together with the \( n-m \) equations of condition
\[
L_1 = 0, \ldots L_r = 0 \quad (r = n - m).
\]
2. \( \eta_1, \ldots \eta_m \) are defined by the \( m \) equations \( \frac{dP}{d\xi_i} = \eta_i \); where the modulus of transformation \( P \) is given by the equation
\[
P = (x_1)y_1 + (x_2)y_2 + \ldots + (x_n)y_n,
\]
\( (x_i) \), &c. being expressed in terms of \( \xi_1, \ldots \xi_m \), so that \( P \) is explicitly a function of \( \xi_1, \ldots \xi_m, y_1, \ldots y_n \), with or without \( t \).
(3) $\Psi$ is defined by the equation $\Psi = Z - \frac{dP}{dt}$, in which (after the explicit differentiation of $P$ with respect to $t$), $x_1$, &c., $y_1$, &c. are to be expressed in terms of the new variables. $y_1$, &c. are thus expressible by the help of the $m$ equations $\frac{dP}{d\xi_i} = n_i$ and the $n-m$ equations $\frac{dL}{dt} + \sum_i \left( \frac{dL}{dx_i} \frac{dZ}{dy_i} \right) = 0$.
If $(x_1)$, &c., do not contain $t$ explicitly, then $\frac{dP}{dt} = 0$, and $\Psi$ is obtained merely by expressing $Z$ in terms of the new variables.
It may be observed that the whole of the above reasoning would apply to the case in which the new variables $\xi_1$, ..., $\xi_m$ are more in number than the independent variables of the problem (or $m > n-r$), with this exception; that the $m$ equations $\frac{dP}{d\xi_i} = n_i$, together with the $r$ equations obtained by differentiating the equations of condition totally with respect to $t$, would be more than sufficient to express $y_1$, ..., $y_n$ in terms of the new variables; consequently $y_1$, &c. might be so expressed in different ways, and therefore, although the value of $\Psi$ obtained by the above rule would certainly be the same as that obtained by recurring to the original formula (D.), the form of $\Psi$ might be different, and therefore the resulting formula erroneous.
There must doubtless exist some rule for choosing $n-m$ combinations of the equations of condition in such a way as to lead to the correct forms of $y_1$, ..., $y_n$ as functions of the new variables; but I have not at present attempted to investigate it, and perhaps it would be hardly worth while. The theorem in the case in which the new coordinates are independent, may, I believe, be practically useful.
ERRATA IN PART I.
Art. 1. equation (4.), for $dx$ read $dx_i$.
Art. 10. In paragraph preceding equation (26.) omit the words "not containing $t$ explicitly."
Art. 18. equation (5), for $y_i$ read $y'_i$.
Art. 19. equation (29.), for $h_i$ read $b_i$.
Art. 24. second line after equation (L.), for "such as $h$, $k$" read "such as $f$, $g$."
Art. 30. The expressions equated to $h$, $k$, $c$, and the three terms in the left-hand column of the table of elements, should each be multiplied by $m$.
Art. 42. near the end, for "according as $\Theta$ is between $0$ and $\pi$, or not" read "according as $\Theta$ is between $\pi$ and $2\pi$, or between $0$ and $\pi$."