ADDENDUM
TO
VOLUME FOURTH.*
EQUATIONS.
Equations. 1. In all the applications of Algebra, it is not the magnitudes concerned that we immediately consider, but merely their proportions. In every class of quantities of the same kind, one being adopted as the unit of comparison, all the rest are referred to this standard, and are represented by the proportions they bear to it. The letters of the alphabet, or other symbols used in Algebra, are not, therefore, properly speaking, the representatives of magnitudes; they denote ratios, or abstract numbers, viewed, as in the fifth book of Euclid, in the most general manner, and independently of any particular system of arithmetic or numeration.
The ancient Geometry follows a different procedure. In that science the attention is in every case confined to the magnitudes under actual consideration. A general property of triangles is established, by showing, that it is true of any particular triangle that comes under the proposed hypothesis. The geometer contemplates particular instances, presenting, for the most part, relations not very complex, and easily kept in view. On this account he carries on his investigations with the greatest clearness, and is in no danger of falling into contradiction or paradox. But his science is little susceptible of general methods. If any process within the compass of the ancient geometry be entitled to that appellation, it is what is called the method of exhaustion. Every geometer perceives that all the demonstrations under this head have the closest analogy. Yet, after a hundred applications, it is still necessary, in any new case, to pursue the reasoning through all its details, without deriving assistance from any general conclusion previously obtained.
Algebra possesses a great advantage over geometry in generalizing its processes. Problems relating to magnitudes of the most different kinds, nevertheless, lead to similar expressions in numbers. Questions in geometry, in mechanics, or concerning mercantile business, are made to depend on the same rules for their solution. It may be said that algebra and the modern analysis accomplish, for all the mathematical sciences, the project, entertained by some ingenious men, of an universal and philosophical language, which, being founded on an exact scrutiny into the nature of things, and on what they possess in common, might greatly facilitate the acquisition and the extension of our knowledge.
The spirit of generalization peculiar to algebra is no where more conspicuous than in the doctrine of equations. Every determinate problem that can occupy the attention of the mathematician, is ultimately reduced to the finding of such numbers as are necessary to determine the unknown quantity or quantities, by means of the equations that subsist between those numbers, and others which are given in the question. A wide field of mathematical investigation is thus brought under a limited number of algebraic expressions.
In treating of equations it will not be necessary to begin with laying down a formal definition. We confine ourselves, in this article, to the consideration of such equations as contain only one unknown quantity. We further suppose, that the elementary operations preparatory to solution are already performed; so that the unknown quantity is clear of radical signs, and is no where found in the denominator of a fraction: likewise that all the separate terms are
* See the word EQUATIONS, p. 175 of this Volume.
Equations. brought to one side of the sign of equality, and arranged in such a manner, that the first term, which must always be positive and have unit for its index, contains the highest power of the unknown quantity or ; the second term contains the next highest power, and so on, the term which does not contain being placed last. This arrangement must always be understood when any term is distinguished by the order it stands in; but it will sometimes be convenient to write the terms in an inverted order, arranging them according to the indices of the unknown quantity.
Equations are divided into different classes or orders, according to the highest power of the unknown quantity found in their terms.
An equation of the first degree, or a simple equation, is one which contains only, without any of its powers, as .
A quadratic equation, or one of the second degree, contains the square of , as , or .
A cubic equation, or one of the third degree, contains the cube, or third power of , as , or .
A biquadratic equation, or one of the fourth degree, contains the fourth power, or biquadrate, of , as , or .
And, in general, an equation of the degree contains the power of , and the powers inferior to the , such as
A root of an equation is a value of the unknown number . Thus, if represent a number, and if its powers, when they are substituted in the equation for produce an equality between the positive and negative terms, then is a root of the equation, and it is a positive root; but if, for , we must substitute , which are the powers of , in order to obtain the like equality, then is a negative root of the equation.
What we have here called roots are more generally named real roots, to distinguish them from those expressions to which the appellation of imaginary or impossible roots has been given. As it will conduce to perspicuity, we shall always use the word root in the sense here defined, unless when imaginary or impossible roots are expressly mentioned.
From the definitions laid down, it follows that the negative roots of the equation,
are the same with the positive roots of the equation,
in which the signs only of all the terms containing the odd powers of are changed. For the same result is obtained, whether we make equal to in the first equation, or to in the second.
2. A great advantage has resulted from the practice introduced by Harriot, of writing all the terms of an equation on one side of the sign of equality. The polynomials formed by all the terms thus brought together are rational and integral functions of the
Equations. unknown quantity; and the question is, to find in what circumstances such expressions are equal to zero. The most likely way of succeeding in this research, is to resolve the functions into their most simple component factors. Harriot supposed that every rational function can be produced by the continued multiplication of binomial factors; and, in this, he has been followed by succeeding algebraists. The modern theory of equations is entirely founded on this supposition, which, although it has not been demonstrated, has yet, in some measure, been verified in the progress of the science, and by the admission of those artificial expressions called imaginary or impossible quantities. But there is a distinction between the real and impossible binomial factors of a rational polynomial. For the first are expressions complete and significant by themselves, without reference to other quantities; whereas one impossible factor necessarily supposes the existence of another, the two related expressions being such, that their multiplication produces one real factor of the second degree. Thus, every pair of impossible factors is equivalent to a real quadratic factor; and, by an unavoidable consequence of the forced supposition made by Harriot, the attention of algebraists has been drawn to the two impossible expressions, instead of being directed to the real one which they compose. In order to place the doctrine of equations and the theory of impossible roots on a solid foundation, it appears necessary to attempt the resolution of rational functions into their component factors by a rigorous analysis, free from arbitrary suppositions.
To resolve the rational function into its component factors, we must begin with inquiring, whether it can be divided without a remainder, by a division such as , or ? If it can, the proposed function will be equal to , where , the quotient of the division, is a function similar to , but of an order one degree lower. In like manner, it may be possible to reduce to a degree still lower, by means of one or more divisors of the same form; and, in certain cases, the first function may be entirely exhausted by successive binomial divisors. When this happens, the divisors will be equal in number to the exponent of the highest power of , and their continued product will be equal to . It is evident, that by multiplying together a proper number of such factors, an algebraic expression may be formed similar to any rational and integral function, and the coefficients of this product will likewise contain as many quantities to be determined at pleasure as there are coefficients in the given function. But we should reason badly if, from this process of composition, we should infer that a product arising from the multiplication of a certain number of simple factors may have any given coefficients, or will coincide with any proposed polynomial of the same degree. This is a point that can be ascertained only by a process of analysis or resolution, and by seeking all the binomial divisors any given function admits of. In fact, the cases are extremely rare in which an algebraic function can be completely exhausted by real binomial divisors. There are many polynomials which
have not a single divisor of this kind; and, in the progress of resolution, we generally arrive at a function which cannot be further divided. When this is the case, it must be tried whether a quadratic divisor, as , will not be successful in lowering the function. But here it must be observed, that such divisors are of two kinds; one, as , which can be resolved into two binomial factors; and one as , which cannot be so resolved without introducing imaginary or impossible expressions. Now, to divide by a divisor of the first kind is the same thing as to divide by the two binomial factors of which it is composed; and, therefore, it is the second kind of quadratic factors only that need be tried, or that can succeed, in lowering a function already deprived of all its simple divisors. After quadratic divisors those of the third degree would naturally come to be considered; but this is unnecessary, because algebraists have found that every rational function may be completely exhausted by simple and quadratic factors.
What has now been said naturally distributes the subject under two heads; one treating of the simple or binomial factors, and the other of the quadratic or trinomial factors, of algebraic equations.
Binomial Factors.
3. The first object of inquiry must be to find the conditions necessary, in order that a binomial quantity, as , or , shall divide a rational polynomial without a remainder. Suppose that is a divisor of the polynomial,
which we shall denote by : then we shall have
wherefore, by subtracting and dividing by , we get
Now, it is known, that the difference between any like powers of two numbers is exactly divisible by the difference of those numbers: hence all the quantities on the right-hand side of the sign of equality form an integral expression. But as does not contain , it cannot be divisible by ; it follows, therefore, that cannot be divisible by , unless ; and it is obvious, that this condition is the only one necessary. Thus, the polynomial will be divisible by , when is a positive root of the equation ; otherwise not.
Again, let the divisor be : then,
and, by proceeding as before,
here again all the divisions on the right-hand side of the sign of equality can be exactly performed: and we must, therefore, conclude that will be divisible by only when , that is, when is a negative root of the equation .
Now being a divisor of , the quotient, which we may denote by , will be a polynomial of dimensions, or one degree lower than : and we shall have
From this equation, it appears that every value of that makes equal to zero, will likewise make equal to zero: consequently, every binomial divisor of the first function will likewise be a divisor of the second. And, if has no roots, and no binomial divisors, neither will have any roots except , nor any binomial divisors except . Suppose that the polynomials and have the common root ; they will likewise have the common divisor ; and if we put for the quotient arising from the division of by , so that ; we shall have
in which equation is a polynomial of dimensions, or two degrees lower than .
It is evident, we may continue to reason in the same manner, either till, after successive divisions, we come at last to a binomial quotient, in which case the original polynomial will be completely resolved into binomial factors; or till we come to a quotient that has no roots, in which case will have no binomial factors except those previously found. We may, therefore, conclude that "a rational polynomial has as many binomial factors as it has roots, and no more; every positive root producing a factor of the form , and every negative root one of the form ; and since the number of binomial factors can never be greater than the dimensions of the polynomial, its roots cannot exceed the same number."
4. There are very few cases in which it can be known immediately and by inspection, that an equation has one or more roots. These cases depend upon the following proposition, viz. "If denote a rational polynomial having , or some integral power of , in every one of its terms, and likewise having the term that contains the greatest power of positive, a value of may be found that will make equal to any positive quantity, as ."
Suppose, first, that all the terms of are positive; then, being the first term, or that in which rises to the highest power, if , and , it is manifest, that
Therefore, while increases from 0 to be equal to , the function increases from 0 to be greater than ; and as the variations of , however irregular
Equations. they may be, are connected by the law of continuity, the function will pass through every gradation of magnitude between 0 and the greatest limit . Consequently, there is a value of between 0 and , that will make equal to .
When the terms of are not all positive, let all the positive terms except be rejected, and all the negative terms be retained, and we shall have equal to, or greater than,
But, being equal to , we have
Now, by equating the negative terms of the first expression to the terms containing the like powers of in the value of , we shall get
&c.
And hence,
&c.
Let be either equal to, or exceed the greatest of these values of ; then we shall have
Wherefore, as before, there is a value of between 0 and , that will make equal to .
From what has now been proved, we derive the following properties of equations.
1. "Every equation of odd dimensions has at least one positive root when the last term is negative, and one negative root when the last term is positive."
If the last term be negative, as in this instance,
according to what has been proved, a value of , viz. , may be found that will satisfy the condition,
then is a positive root of the equation.
When the last term is positive, as in this equation,
change the sign of the last term, and the signs of all the terms that contain the even powers of , then the polynome will become
and a value of , viz. , may be found such that
now transpose , and then change the signs of all the terms, and we shall get
which shows that is a negative root of the equation.
2. "Every equation of even dimensions having its last term negative, has two roots, one positive and one negative."
Let the equation be
and consider the polynomes,
in the latter of which the signs of all the terms containing the odd powers of are changed; then there are two values of , viz. and , such as to answer the conditions,
consequently is a positive, and a negative root of the equation.
3. "A polynome of even dimensions, which has no binomial factors, is always positive, whatever value be substituted for the unknown quantity."
Let the polynome be , or
then the last term, or that term which does not contain , must be positive; for, otherwise, the polynome would have two roots, and two binomial factors, contrary to the hypothesis. Now, if it be possible, let the polynome have a negative value when is substituted for , so that ; therefore, when , is equal to the positive quantity ; and, when , the same function is equal to ; but since passes through all degrees of magnitude between and , while varies from 0 to , it will become equal to zero when has some intermediate value; therefore the polynome has one root between 0 and , and one binomial divisor corresponding to that root contrary to the hypothesis.
It may be observed, that the converse of this proposition is not true; for a polynome of even dimensions, that has such factors as , , , may never become negative, although it is capable of being equal to zero.
5. The properties demonstrated in the last section lead to this general proposition relating to the number of roots in any equation, viz. "In any equation, the number of all the roots is even when the dimensions are even, and odd when the dimensions are odd."
For every equation has as many binomial divisors as it has roots; and if we suppose an odd number of roots in an equation of even dimensions, or an even number in one of odd dimensions, the last quotient, after dividing successively by all the divisors, would be a polynome of odd dimensions, having at least one root, which would likewise be a root of the proposed equation. Therefore the number of all the roots of an equation cannot be even when the dimensions are odd, nor odd when the dimensions are even.
Equations. And again, since every polynome is equal to the continued product of all its binomial divisors, and the quotient last found, after dividing by them all successively, we obtain the following proposition, viz.: "Every rational polynome is equal either to the continued product of as many binomial factors as it has dimensions; or to the continued product of an even or odd number of such factors, according as the dimensions of the polynome are even or odd, and a polynome of even dimensions, which, having no binomial factors, is always positive, whatever value be substituted for the unknown quantity."
6. When several of the binomial factors of an equation are equal to one another, it is said to have so many equal roots. In this case, the equation can be divided a number of times successively by the same binomial divisor. Thus, an equation which is twice divisible by , or, which is the same thing, once by , has two roots equal to ; and, if it can be divided by , it has roots equal to .
The most obvious way of finding the conditions on which the equality of the roots depend would, therefore, be to expand the divisor by the binomial theorem, and then divide the equation by it: for, after the integral quotient is obtained, the required conditions will be found by making the several parts of the remainder separately equal to zero. The number of the conditions found in this manner is equal to the exponent of the divisor; for of so many parts will the remainder of the division consist. But, in a complex operation, it is difficult to ascertain the remainder; and besides, it is not necessary to consider all the equations obtained by this process, because both the number and the value of the equal roots can be found by means of two of them only.
The inconveniences, just mentioned will be avoided by proceeding in the following manner: Let the equation be
then, if it be divisible by , the quotient will be a polynome of dimensions; and we may, therefore, suppose that the expression
is equal to the product,
In these expressions, may have any value whatever; and, therefore, the equality between them will still subsist if we substitute for , being any arbitrary number; therefore the expression
will be equal to the product,
Now, let the several powers of be expanded by the binomial theorem, and put
VOL. IV. PART II.
then the given polynome of dimensions will become
And if the like operations are performed in the polynome of dimensions; and be expanded by the binomial theorem; the product of these two expressions will become
The expression (A) being equal to the product (B), whatever stands for, the coefficients of the like powers of must be equal; and hence, by equating the terms in which is wanting, and likewise the terms that contain the first power of , we get
which proves that is a common divisor of and . If, therefore, by means of the usual process, we seek the greatest common measure of the two polynomials, , , or,
we shall obtain the factor ; and the given polynome will be divisible by ; that is, it will contain the common factor once more than the polynome contains it.
If we proceed farther, and equate the coefficients of in the expressions (A) and (B), we shall get
which shows, that is divisible by . In the same manner, it may be proved, that is divisible by , and so on. It appears, therefore, that the first coefficients of the expression (A) are respectively divisible by , , , &c.; and, consequently, we shall have
when the common root is substituted for .
If the polynome is divisible by ,
Equations. it may be proved in like manner, that will be a common divisor of X and Y.
We may, therefore, lay down the following rule, for finding all the double, triple, &c. divisors of any given polynome X: "Find R, the greatest common measure of X and Y, and resolve it into its elementary factors; then each of these factors will be contained in X once more than in R."
Number of Positive and Negative Roots. 7. If it be required to find how many of the roots of an equation are positive, and how many are negative, we have for this purpose the rule first published in the Geometry of Descartes. This celebrated rule seems to have been discovered by induction; at least, its author gave no demonstration of it, and disputes arose about its true import. It was demonstrated for the first time by Du Gua, in the Mémoires de Paris; but many other demonstrations of it have since appeared, of which that of Segner, in the Mémoires de Berlin 1756, is not only the most simple, but probably the most simple that will ever be invented.
Segner deduced the rule of Descartes from the following analytical proposition, viz.
"If any rational polynome be multiplied by , the changes from one sign to another, from to , and from to , will be at least one more in the product, than in the given polynome; and if it be multiplied by , the successions of the same sign, of to , and of to , will be at least one more."
Let the proposed polynome be
then, according to the usual process, the product of the polynome by will be found by adding these two lines, viz.
the signs of the several terms remaining unchanged in the first line, and being all changed in the second line. It is evident, therefore, that the terms of the product will have the same signs with the respective terms of the proposed polynome, except when a coefficient in the second line is greater than the one above it, and likewise has a contrary sign; the sign of the last term of the product being always the same with the sign of the last term of the second line. Now, beginning on the left hand, pass over the terms of the first line, so long as they have the same signs with the terms of the product. When this ceases to be the case, the signs in the product will be the same as in the second line, and contrary to those in the first line; wherefore descend to the second line, and pass along its terms till the signs in the product are again the same as those in the first line, and then ascend to that line. Continue thus descending and ascending alternately, till all the terms in both lines are taken in. At the conclusion, it is evident, that the descendings are always one more than the ascendings, because the passing from one line to another both begins and ends with descending.
Equations. If we descend from , in the first line, to , in the second line, it is evident, that the signs of and , in the first line, will be the same, both being contrary to the sign of , in the second line. Therefore, in the given polynome, the first and second terms have the same sign. But in the product, the like terms have contrary signs; for the second term of the product has the same sign with in the first line, and the third term of the product has the same sign with in the second line. Thus, it appears that a variation, from one sign to another, is introduced in the product, instead of a continuation of the same sign that takes place in the given polynome; and the same thing will happen at every descending.
In ascending from the second line to the first, there may either be a continuation in the product instead of a variation in the given polynome, or the contrary: but one of these two must take place.
Now, so long as we keep on the first line, the signs in the product are the same with those of the given polynome; and, so long as we keep on the second line, the signs in the product are contrary to those in the polynome. In both cases, therefore, the variations from to , and from to , are the same in the product and in the polynome. Every descending introduces a variation in the product, instead of a continuation that takes place in the polynome; and although it be supposed that every ascending introduces a continuation in the product instead of a variation that exists in the polynome, yet, on the whole, the variations introduced must be one more than the continuations, because the descendings are one more than the ascendings.
Again, if the given polynome be multiplied by , the product will be the sum of these two lines, viz.
Here the terms of both lines have the same signs; and, as before, the signs in the product will be the same with the signs of the proposed polynome, unless when a coefficient in the second line is greater than the one above it, and likewise has a contrary sign; the sign of the last term of the product being always the same with the sign of the last term in the second line. Now, if we pass along all the terms of both lines, descending from the first line to the second, when the signs in the product change from being the same with those in the given polynome, to be contrary to them; and ascending from the second line to the first, when the signs in the product change from being contrary to those in the polynome, to be the same with them; it is evident, that the descendings will be one more than the ascendings, as in the former case.
If we descend from in the first line, to in the second line, the two terms
Equations. and in the first line, will have different signs; for, on account of the descending, has a contrary sign to the term below it, and, consequently, to in the first line. Therefore the second and third terms in the polynome have different signs. But the like terms in the product have the same sign; for the second term in the product has the same sign with in the first line; and the third term of the product has the same sign with in the second line. Thus there is a continuation of the same sign introduced in the product, instead of a variation from one sign to another that takes place in the polynome; and the same thing is true at every descending.
In ascending from the second line to the first, there may either be a variation in the product instead of a continuation that exists in the polynome, or the contrary. But one of these two must take place.
Now, it is evident, that, except at the descendings and ascendings, there is the same number of continuations of the same sign, and the same number of variations from one sign to another, in the product and in the given polynome. Every descending introduces a continuation in the product instead of a variation existing in the polynome. And even if we suppose that every ascending introduces a variation in the product instead of a continuation that takes place in the polynome, yet, on the whole, there will be one continuation more in the product than in the polynome, because the descendings are one more than the ascendings.
In the preceding demonstration it is supposed, that all the ascendings have a contrary effect to the descendings, by which means there is introduced in the product the least possible number of variations from one sign to another in the one case, and the least possible number of continuations of the same sign in the other. But if, in the first case, we suppose that, at one ascending, there is a variation in the product, and a continuation in the polynome, this will add one to the variations in the product, and one to the continuations in the polynome; so that, the variations in the product will now exceed those in the polynome by three, namely, by two more than in the circumstances supposed in the demonstration. And if we extend the like reasoning to two, three, &c. ascendings, the variations in the product will exceed those in the polynome, respectively by five, seven, &c. The like conclusion is evidently true of the second case, mutatis mutandis; and hence the preceding proposition, when it is generalized as much as it can be, may be thus enunciated: "If any rational polynome be multiplied by , the variations from one sign to another in the product will exceed those in the polynome by one, or three, or five, or by some odd number; and if it be multiplied by , the continuations of the same sign in the product will exceed those in the polynome by one, or three, or five, or by some odd number."
Equations. Now, if we conceive that any rational polynome is resolved into its binomial factors; there will be a factor of the form for every positive root, and one of the form for every negative root; and when all the factors are multiplied together in order to reproduce the polynome, it follows, from what has been proved, that the product will contain at least one change from to , or from to , for every factor of the form , or for every positive root; and at least one succession of to , or of to , for every factor of the form , or for every negative root. Hence this rule, viz. "An equation cannot have more positive roots than it has variations from one sign to another; nor more negative roots than it has continuations of the same sign."
In general, this rule merely points out limits which the number of the positive and negative roots of an equation cannot exceed. But it gives no criterion by which we can certainly know that an equation has even one positive or one negative root, much less does it ascertain the exact number of each kind.
But if the proposed equation can be completely resolved into real binomial factors; in which case the total number of its roots will be equal to its dimensions, and, consequently, to the sum of all the variations from one sign to another, and of all the continuations of the same sign; it is evident, that the number of the positive roots will be precisely equal to that of the variations, and the number of the negative roots precisely equal to that of the continuations. In this case, therefore, and in this case only, the rule of Descartes is perfect, ascertaining the exact number of each kind of roots in the proposed equation.
We subjoin some consequences that result from the principles laid down.
"If a polynome of dimensions be multiplied by , or ; and, in the first case, if the number of variations from one sign to another be augmented by the odd number ; or, in the second case, if the number of continuations of the same sign be augmented by ; then the total number of the roots, positive and negative, of the proposed polynome, cannot be greater than ."
For, when the multiplier is , let denote the number of the variations from one sign to another, in the proposed polynome ; then will be the total number of variations in the product ; consequently, the total number of continuations in will be equal to , or . But a polynome cannot have more negative roots than it has continuations of the same sign; wherefore, the number of the negative roots of cannot be greater than . Now, the two polynomials and have the same negative roots; and hence the number of the negative roots of cannot exceed . But the number of the positive roots of cannot exceed ; consequently, the total number of the roots of cannot be greater than ; that is, than . And the proposition may be demonstrated in a similar manner when the multiplier is .
"If one, or several consecutive terms, of an equa-
Equations tion be wanting; and, if the next terms on each side of those wanting have the same sign, the equation cannot have as many roots as it has dimensions."
Let the equation be , and denoting the two parts on each side of the terms wanting. Having multiplied by , the product will be ; and it is evident that we may consider , , , as separate polynomials; hence, in each of the polynomials and there will be at least one more variation from one sign to another, than there is in and . Again, in the polynomials , there will be a continuation of the same sign, in passing from to ; because the last term of is supposed to have the same sign with the first term of . On the other hand, because the last term of has a contrary sign to the last term of ; and the first term of , the same sign with the first term of , it follows that, in the polynomials , there will be a variation from one sign to another, in passing from to . Therefore, on the whole, there will be at least three variations from one sign to another in , more than there is in : Consequently, by the last proposition, the number of all the roots of the proposed equation must be at least two less than its dimensions.
Number of Real Roots in an Equation. 8. An important inquiry is, to find how many roots, that is, real roots, there are in any proposed equation. Much has been written on this subject, but not very successfully. No general method has been found that is practically useful. Many criteria have been contrived, by means of which we can certainly discover that roots are wanting in an equation; although we cannot infer the existence of the roots when the same criteria fail. But great value cannot be attached to such rules; since they are neither sufficient guides in practice, nor have much tendency to throw light on the theory.
Waring first, and nearly about the same time Lagrange, proposed a method which is successful in finding the conditions necessary in order that an equation have as many roots as it has dimensions; and which, in all cases, points out a limit that the number of the roots cannot exceed. This is effected by an auxiliary equation, and merely by the signs of its coefficients, without requiring the computation of any of its roots. This procedure answers very well for equations of the third and fourth degrees; and it has even been extended by Waring to those of the fifth degree; but, in this last case, the calculation is very long, and would be altogether impracticable in the higher orders of equations. It is also not a little probable that this rule employs more conditions than are absolutely necessary for determining the point in question; there being great reason to think that some of them are implied in the rest, and are deducible from them. The method here alluded to depends upon the theory of trinomial divisors; and, as it is much referred to by algebraists of the present day, we shall, in a subsequent part of this article, briefly explain the principles on which it is founded.
There is also another way of finding the number of real roots in an equation, which is general for all orders, and requires the solution of such equations
only as are of lower dimensions than the one proposed. As to practical utility, indeed, this method is of little avail in equations passing the third and fourth degrees, or, at most, the fifth degree; but it is, nevertheless, not without interest; both because it is founded on the principles essential to the inquiry, and because it leads to some useful properties. Algebraists differ from one another in their exposition of this method. Some derive it from the theory of Harriot, namely, that every rational polynomial is the product of as many binomial factors as it has dimensions; in which manner of proceeding the impossible roots are the occasion of uncertainty and embarrassment. Others, again, deduce it from the variations of magnitude which a rational polynomial undergoes when the unknown quantity is made to pass through all possible degrees of increasing and decreasing. This last mode of investigation seems greatly to deserve the preference, being in reality the only one that is entirely unexceptionable, and requires no principles foreign to the research.
Suppose an equation, , which we may denote by : substitute in place of , and put
then the function will be transformed into
If we use the notation of the differential calculus, the same transformation will be thus represented,
which has the advantage of pointing out in what manner the several functions, , , &c. are derived from one another, and from the first function , or .
Let , &c. denote the real roots of the equation , or , arranged according to the order of their magnitude, that is, greater than , greater than , and so on. In like manner, observing the same order of arrangement, let ,
&c. represent the roots of , or ; and
for the sake of simplicity, suppose that the equation has no equal roots.
The relations, which the variations of the polynomial bear to the variations of , depend upon the functions , , &c. and principally upon the first of these. If be positive, will decrease as decreases; if be negative, will increase as decreases; and if pass from being positive to become negative, or the contrary, then continuing to decrease, will change from decreasing to increasing, or the contrary; that is, it will attain a minimum or a maximum value. What is here said is the foundation of the method taught in the differential calculus, for finding the maxima and minima of algebraic quantities.
Now, when has a value great enough, the poly-
Equations. nome will have the same sign with its first term, that is, it will be positive; and it will continue positive so long as is greater than , the greatest root of the equation ; after which it will become negative. Hence, while decreases to the limit , the polynome , which is positive when is sufficiently great, will continually decrease; and when , will pass from decreasing to increasing, or it will have a minimum value. Now, if this minimum be positive, has not decreased to zero, and the given equation will have no root greater than . If , then, because the two equations, and , take place at the same time, the given equation will have two roots equal to . (Sect. 6.) Lastly, if be negative, the polynome has decreased from being positive to be negative; and therefore it has passed through zero, and the given equation will have one root, viz. greater than .
As continues to decrease from to , the polynome being negative, will continually increase. At the limit , is first equal to zero, and then becomes positive; and will therefore change from increasing to decreasing, or will attain a maximum value. If this maximum be negative, the polynome has not increased to zero, and the given equation will have no root between and : if , it will have two roots equal to ; and if be positive, , in increasing from the negative quantity to the positive quantity , must have passed through zero, and the given equation will have one root, viz. , between and .
In like manner, continuing to decrease from to , the polynome will decrease from the maximum to the minimum : if be positive, the proposed equation will have no root between and ; if , it will have two roots equal to ; and if be negative, it will have one root, viz. , between the limits and .
As the function must become a minimum or a maximum, or must pass from decreasing to increasing, or the contrary, between every two contiguous roots of the equation ; and as the limits where the changes take place are determined by the roots of the equation ; it follows that there must be at least one root of this last equation between every two contiguous roots of the first. Hence the equation cannot have as many roots as dimensions, unless the equation likewise have as many roots as dimensions; and, in general, we have this rule, which determines a limit that the number of the roots of an equation cannot surpass, although it may fall short of it: "The roots of an equation cannot exceed in number those of the equation , by more than one."
But if we can find the roots of the equation , which is always one degree lower than the proposed equation, we can thence discover exactly both the number and the limits of the roots of this last. For let be substituted in the polynome , and let the results be arranged in order, viz.
Equations. if these quantities are alternately negative and positive; the first, third, fifth, &c. which are all minima, having the sign minus; and the second, fourth, &c. which are all maxima, having the sign plus; then the proposed equation will have just one root more than the equation . When some of the conditions fail, the roots of the proposed equation will fall short of the number specified. If one maximum have the sign minus, or one minimum the sign plus, two roots will be wanting in the proposed equation; and, in general, as many roots will disappear, as there are consecutive minima and maxima that have the same sign deducting one; unless the minima and maxima precede the greatest root, or come after the least root, in which cases there will be as many roots wanting as there are minima and maxima that have the same sign.
Since the series of functions, are derived similarly from one another, we may prove, as has been done with respect to the two first, that the roots of any one are contained between the roots of that which follows it. Hence, if the given equation have as many roots as dimensions, every equation in the series will likewise have as many roots as dimensions; and if there be roots wanting in any one, there will be at least as many wanting in every equation preceding it in the series.
The connected equations necessarily terminate in one of the first degree, which gives a limit between the two roots of the quadratic immediately before it; in like manner, the roots of the quadratic are the limits of the roots of the cubic preceding it; and, in this manner, by going through all the successive equations, we shall finally arrive at the limits of the roots of the proposed equation. This process has been called La Methode des Cascades; but the length of the calculations render it useless in practice.
The procedure explained above would enable us to find the number of roots in an equation of any order, if we were in possession of rules for solving equations of the inferior degrees. For want of such rules, the practical advantage that can be derived from it is very limited. Mathematicians have, therefore, turned their attention to determine the point in question in a way that should not require the resolution of equations. They have sought to investigate rational functions of the coefficients, which, by means of the signs they are affected with in every particular case, might indicate the number of roots the equation possesses. Of this nature is the method which Du Gua has given in the Memoires de Paris, 1741, for finding the conditions necessary in order that an equation have as many roots as dimensions. By a process analogous to that of Du Gua, M. Cauchy, in an excellent Memoir, published in the sixteenth volume of the Journal de l'Ecole Polytechnique, has shown not only that the total number of the roots may, in every case, be discovered, but likewise, that the numbers of the positive and negative roots may be separately ascertained. The principles of both these methods are to be found in the theory explained above; but, as many considerations of some intricacy are involved in them, a particular account of them would exceed the limits of this article.
In what goes before, we have supposed that all the roots of the equation are unequal; and, in order to complete the theory, it remains to notice the consequences that follow when the case is otherwise. Suppose, then, that : And, in the first place, if be a root of the equation , there will, in reality, be no exception to the general conclusion; because, in this case, it is known that the polynome will be divisible by . (Sect. 6.) Now, the case just mentioned being set aside, if be an even number, the polynome , or , will be equal to zero when ; but it will not change its sign when , from being less, comes to be greater than . Hence the polynome will neither attain a maximum nor a minimum value at the same limit; and it will have no root, either between and the next greater root of the equation , or between and the next less root of the same equation. It appears, therefore, that, when is even, the number of the roots of the equation , and their limits, will depend entirely upon the equation . Again, when is an odd number, the polynome will be equal to zero when , and it will likewise change its sign when is taken on contrary sides of that limit: Consequently, when , the polynome will be a maximum or a minimum; and the nature of its roots will depend upon the equation . It is evident that we may extend the same conclusions to any two adjacent equations in the series,
provided the one which stands lower in the series is reducible to the form ; and that is not a common divisor of both. We may likewise draw this general inference from the principles that have been explained, viz. "If, in the series of connected equations, any one be found which is divisible by , or , at the same time that is not a divisor of the equation immediately preceding, there will be at least roots wanting in this last equation, and in all that stand before it in the series."
The following not inelegant proposition is a consequence of what has just been proved: "The number of the roots of an equation of dimensions, in which or , consecutive terms, are wanting, cannot be greater than ."
Let the equation be represented by
supposing that , or terms, are wanting between and . Therefore, if the first term of contain , the last term of will contain , or . Now, in the series of equations, we shall at length arrive at one from which all the quantities of are exterminated; which equation, if we use the notation of the Differential Calculus, is equivalent to
and it is divisible by , or : And, as the one immediately preceding it in the series, viz.
is not divisible by , it follows from what has been shown, that there will be at least roots wanting in this last equation, and in all those that stand before it; consequently, the proposed equation cannot have more than roots.
From this we learn, that it is not always possible, at least by any operations with real quantities, to transform an equation into another in which any proposed number of the intermediate terms shall be wanting. For the terms to be taken away may be such, that the transformed equation could not have the same number of real roots as the one given; but it is impossible, without introducing imaginary quantities, to transform an equation with a certain number of real roots into another with a different number of such roots.
9. In what goes before, we have sought for the roots and binomial divisors in the nature of the polynome. We are now to take an inverted view of the subject, and to consider a rational polynome as produced by the continued multiplication of as many binomial factors as it has dimensions; from which source there arises an interesting set of properties.
If we take the words, root and binomial factor, strictly in the sense in which we have hitherto used them, and as denoting real quantities only, nothing is more certain than that all polynomials cannot be generated by binomial factors. But it will afterwards be proved, that every rational polynome can be completely exhausted by binomial and trinomial divisors; and if we admit the resolution of every trinomial divisor into two imaginary factors, we shall arrive, with all the rigour of which the investigation is capable, at the genesis of equations supposed by Harriot, which represents them as entirely composed of binomial factors, possible or impossible. Besides, in extending to all equations the conclusions obtained from the manner of generating them, it may be observed, that the properties so obtained, being ultimately expressed in functions of the coefficients from which the roots and generating factors have disappeared, are in a manner independent of the method of investigation. Such is the structure of the language of algebra, that the conclusions to which it leads, although deduced by reasoning from a hypothesis not strictly general, are nevertheless true in all cases, when they are finally disengaged from what is peculiar in the analysis.
Suppose a polynome, as
which is produced by the multiplication of the factors,
then, by actually multiplying the factors, and equating the like terms of the equivalent expressions, we shall get
Hence, it appears that the coefficient of the second term of the polynome, or , is equal to the sum of all the roots with their signs changed; the coefficient of the third term, or , to the sum of all the products of every two roots; the coefficient of the fourth term, or , to the sum of all the products of every three roots with their signs changed, and so on, the signs of the roots being always changed in the products of an odd number; and, finally, the last term is the product of all the roots with their signs changed or not, according as their number is odd or even.
It is evident, that the ultimate product of the binomial factors will always be the same, in whatever order they are multiplied; and hence the coefficients of the polynome will consist of the same products, however the roots be interchanged among one another. Expressions of the kind just mentioned, which have constantly the same value, whatever change is made in the order of the quantities they contain, are called invariable functions and symmetrical functions. The coefficients of an equation are the most simple symmetrical functions of the roots, from which it may be required, on the one hand, to deduce all other functions of the like kind, and, on the other, to go back to the roots themselves. Most inquiries relating to equations are connected with one or other of these two problems; of which the first, like most direct methods, is attended with little difficulty, and has been completely solved; while the other, past equations of the fourth degree, has eluded all the attempts of algebraists.
After the coefficients of the polynome, the next most simple symmetrical functions of the roots are the sums of the squares, cubes, &c. In the universal arithmetic of Sir Isaac Newton, a very elegant rule is given for computing the sum of any proposed powers of the roots; and as this rule is a fundamental point in the theory of equations, we subjoin an elementary investigation of it.
Of the binomial factors before set down, let the first be left out, and, having multiplied the rest together, let the product be,
in which expression is the sum of all the roots except the first ; is the sum of the products of every two of them, and so on. Now, multiply by , and the product will be equivalent to the given polynome: hence we get
Again, multiply these formulae in order by , , , &c.; then
and, by adding and subtracting alternately, we get
in which expression is the sum of all the products of dimensions of the roots leaving out the first .
In like manner, if we leave out the factor , and multiply all the rest, and proceed as before, we shall get
the symbol being the sum of the products of dimensions of all the roots except the second .
And, if we next leave out the factor , and follow a like procedure, we shall get
where represents the sum of the products of dimensions of all the roots except the third .
If we proceed similarly till every one of the factors is left out in its turn, and then add all the results, we shall get
in which expression is written for the sum of the powers of the roots; , for the sum of the powers, and so on.
Every product in any one of the aggregate quantities, , is found in , which is the sum of the products of dimensions of all the roots: and, hence, it is easy to perceive that the sum of all the aggregates must be a multiple of . Take any product in : then that product will not be contained in of the quantities ; because, in so many of them, one or other of the letters of the product
Equations. will be wanting; but the same product will be contained once in every one of the remaining quantities, because, in every one of these, all the letters of the product will be contained. Every product in is, therefore, repeated times in the sum of the quantities , , , &c.: consequently,
Substitute this value in the formula obtained above, and, after transposing and cancelling , which appears with contrary signs, we shall get
This is the rule of Sir Isaac Newton, and contains all his particular formulæ, as will readily appear by putting 1, 2, 3, &c. successively for .
The preceding formula will enable us to compute, in succession, the sums of all the positive powers of the roots, both when is less, and when it is greater than the dimensions of the equation. But, in applying the formula in the latter case, we must observe that all the coefficients of the polynome after are wanting, or equal to nothing.
If, in the first step of the preceding investigation, we take the coefficients that follow , we shall get
And, by first dividing by , , , &c. in order, and then subtracting and adding alternately, we shall obtain
In a similar manner, we get
Therefore, by adding all these formulæ, and substituting for the sum of , , &c. the value of it already found, we shall finally obtain
the symbols , , &c. being put for the sums of the negative powers of the roots according to the indices underwritten. This formula will enable us to compute the sums of the negative powers of the roots.
If, in the formula for the sums of the positive
powers of the roots, we make successively equal to Equations. 1, 2, 3, &c. we shall get
and from this we learn that the quantities , , , &c. may be found by means of this expression, viz.
for if we multiply the series on the right-hand side of the sign of equality, by the denominator of the fraction on the other side, and then equate the coefficients of the product to the like coefficients of the numerator, we shall obtain the very formulæ set down above. Hence the sums of the powers of the roots expressed in terms of the coefficients of the polynome, will be found by developing the fraction in a series. In effecting the development different analytical methods may be followed; and the quantities sought will thus be obtained by different rules, or exhibited in expressions of different forms, such as those given by Waring, Vandermonde, Euler, and La Grange.
And in like manner if, in the formula for the sums of the negative powers of the roots, we make successively equal to , , , &c. we shall get
from which it appears that the values of all the quantities , , , &c. will be obtained by means of this expression, viz.
Two kinds of quantities only can enter into any rational and symmetrical function of the roots of an equation; and these are, the sums of the like powers of the roots, and the sums of such products as, &c. which arise from multiplying different
powers of the roots, two and two, three and three, &c. We shall now shortly point out in what manner the latter sums are deduced from the sums of the like powers, for the computation of which rules have already been given; by which means we shall be enabled to find the value of any proposed function of the kind above mentioned.
Equations. Let it be required to find the sum of all the products, such as , that arise from combining two powers of the roots in all possible ways; which sum may be denoted by the symbol . Now it is evident that the product, , will contain two sorts of terms only, namely, powers of the roots, such as , and the products of which the sum is sought; therefore
Next let it be required to find , or the sum of all the products of three powers of the root. Now will contain three sorts of terms, namely, products, such as and , in which two roots only are combined, and the products of which the sum is required; therefore
but, according to the last case,
wherefore
In like manner, when four different powers of the roots are multiplied together, we get
and we have only to apply the preceding case, in order to obtain the expression of the quantity sought in terms of the sums of the like powers of the roots.
According to the procedure just explained, the case where any number of powers are multiplied together, is reduced to the simpler case where the powers multiplied are one less. There would be no great difficulty in deducing a general formula for the sum when the products contain any proposed number of different powers; but this would lead to calculations incompatible with the length of this article; and it may be doubted, whether the use of such a formula is preferable in any cases likely to occur in practice, to the application of the principles here laid down.
The theory of symmetrical functions is of the most
VOL. IV. PART II.
extensive use in every branch of the doctrine of Equations. equations. Thus, if it be required to form an equation, the roots of which shall be any combinations of the roots of a given equation; it is manifest, that the coefficients of the equation sought will be symmetrical functions of the roots of the given equation; and hence they may be found, by calculating these functions in terms of the coefficients of the given equation.
The theory of symmetrical functions is also of use in approximating to the roots of numerical equations. Sir Isaac Newton seems to have had this application in view, in giving his rule for computing the sums of the like powers of the roots. He observes, that the powers of a great number increase in a much higher ratio than the same powers of less numbers; and hence, the power of the greatest root of an equation will approach nearer to the sum of the powers of all the roots, as is greater. Wherefore, neglecting the distinction between positive and negative roots, if we calculate , and then extract its root, we shall have an approximation to the root of the equation greatest in point of magnitude; and the approximation will be so much more accurate as is greater.
But there is a more convenient way of approximating to the greatest and least roots of an equation, by means of symmetrical functions. For, since
we have
Now, being the greatest root, the fraction on the right-hand side will approach to unit when is sufficiently large, in which case will be nearly
equal to . Hence, if we compute a series of consecutive sums, viz. ; the values
will approach nearer and nearer to the greatest root of the equation.
In like manner, if we take the sums of the negative powers of the roots, we shall have
from which it appears that will approximate so much more to , the least root of the equation, as is greater.
10. We proceed next to consider the trinomial divisors of a given polynome; and, in order to avoid reference to other treatises, we shall begin with a short investigation of a preliminary point.
We have this identical expression,
consequently,
and, again,
Now, using the letters H and G as the characteristics of the particular functions under consideration, let
or, by expanding the binomial quantities in series,
then, by means of these notations, the preceding expression will be thus written, viz.
This equation is identical; that is, when the expressions on both sides of the sign of equality are expanded in series of terms containing the powers of , they will consist of the same quantities with the same signs. It is evident, therefore, that the equation will still be identical, if we change into : for, by this change, the simple quantities of the developed expressions will not be affected; and no alteration will be produced, except in the signs of the odd powers of , which will now be contrary to what they were before. We therefore have
in which equation it is to be observed that the functional expressions are not, as in the former instance, susceptible of an abridged algebraic notation, at least without introducing a new sign; but they can be exhibited in series, viz.
Now put , , ; and let
denote an arc, depending, in a certain manner, not yet discovered, upon the arc and the index ; then, in consequence of the equation obtained above, we shall have
Again, multiply both sides of the same equation last referred to by ; then
but, since the equation alluded to is general for all the values of , we may write for ; and thus we get
therefore, by comparing the two values of ,
and finally, by substituting the values of the functions in terms of the arcs, , , , we shall obtain
Now, if we make successively equal to 1, 2, 3, &c. the results will be,
&c.
and generally, .
Thus it appears that
or, if we take the expanded expressions of the functions,
in which formulae, , .
The functions here designated by the letters H and G may be expressed by means of the imaginary sign; for we have
And, in the case of , the formulæ obtained above are equivalent to the expressions known in analysis since the time of Dr Moivre, viz.
But the mode of investigation we have followed is rigorous; and it has the advantage of leading to the true import of the imaginary sign, and of putting in a clear light its real effect in analytical operations. The real use of this sign may be shortly described by saying that it performs for even and odd functions the same office that the negative sign does for ordinary functions; in other words, when, by means of the ordinary operations of analysis, it has been proved that an even or odd function of an indeterminate quantity is equal to zero, it is by means of the impossible sign that the same equation is extended to the case when the square of the indeterminate quantity is negative. Every function of the indeterminate quantity may be thus represented, viz.
and the substitution of in place of , has no other effect than to change the preceding expression into the one following, viz.
and from this it is obvious, that the same operations which, in the one case, lead us to the equations and , will, in the other, necessarily conduct us to the equations and . It is to be observed, too, that the truth of the two latter equations is involved in that of the former. For the former equations cannot be generally true for all values of , unless they are identical, or consist of equal quantities with opposite signs that mutually destroy one another; in which case the latter equations will also be identical. The sign of impossibility, as it has been called, is, therefore, one as truly significant as any other in analysis. It has, indeed, no consistent meaning when we consider it as only affecting , or the indeterminate quantity to which it is joined; but it becomes perfectly intelligible when we contemplate the real changes produced by it in the functions of even and odd dimensions, in which its conclusions are always ultimately expressed. When the true import and real effect of the imaginary sign are clearly apprehended, the truth of its conclusion is no longer doubtful or mysterious, but follows as a necessary consequence of a fundamental principle of analytical language. Proceeding on this principle we may even lay aside the imaginary character; and, in every particular case, with the assistance of a proper notation, arrive, by the ordinary operations, at the same conclusion to which it leads, as has been done in the preceding instance. It is to be observed further, that the imaginary arithmetic is not merely a short method of calculation convenient in practice,
and that may be dispensed with; it is strictly a necessary branch of analysis, without which, or some equivalent mode of investigation, that science would be extremely imperfect. The equations and , are unchangeable by any operations with the signs commonly received, by the use of which alone it is impossible to deduce, in a direct manner, the related equations and : although the latter are equally true, of as frequent occurrence, and as extensive application, as the former. Without the impossible sign the operations of algebra would, therefore, be defective; since there are analytical truths that could not be investigated in a direct manner by means of the elementary signs usually admitted. It is to supply this defect that the Imaginary Arithmetic has been introduced, and has grown up to be an extensive branch of analysis; advancing at first by slow steps, because the true import of the character it employs, and the real effect of its operations, were neither clearly perceived nor fully understood. But, having premised what is conducive to our present purpose, we proceed to the investigation of the trinomial divisors of rational functions.
11. Every polynome of odd dimensions, having at least one binomial factor, it may, by dividing by that factor, be reduced to another polynome one degree lower. And hence, in this part of our subject, we may confine our attention to polynomials of even dimensions. We may also suppose that the even polynomials, under consideration have no double, triple, &c. factors of any kind; since, in case any such are present, they can be found separately and eliminated by division.
Suppose, then, that represents any polynome of even dimensions; let be substituted in place of ; and, by using the notation of the differential calculus, the given polynome will be transformed into
Since is an even polynome, the equation will be one of odd dimensions, having at least one root. Let be the sole root of , when it has but one; and the greatest root, when it has several; then, because , the transformed function will become
It readily appears, from what was formerly proved (Sect. 8), that , the greatest root of , exceeds the greatest root of any of the equations, , , &c.: and, because, in any equation, the substitution of a value greater than the greatest root must give a positive
result, all the quantities &c. will
be positive. With regard to it may be either positive or negative, but not equal to zero; since this last case can happen only when the polynome has equal roots. The original polynome will, therefore, assume this form, viz.
in which expression &c. represent any positive quantities.
The most interesting proposition in the branch of the subject under consideration, is to prove that every polynome of even dimensions has a quadratic divisor, either of the form , which admits two real binomial factors, or of the form , which has two imaginary factors. By the preceding transformation this proposition is brought under two cases, according as is affected with the sign minus or plus; the quadratic divisor being always of the form in the first case; and always of the form in the other case; a distinction that agrees with what was before proved, Sect. 8.
Now the first of these cases is attended with no difficulty. For two values of , one negative and one positive, may be found that will satisfy the equation, Sect. 4.
Of these values, it is obvious that the negative one will be always greater than the positive one; and they may, therefore, be represented by and ; wherefore, the polynome
will be divisible by each of the binomial factors,
and likewise by the quadratic factor,
produced by their multiplication.
But the same mode of reasoning will not apply when has the sign plus; in which case the demonstration must be deduced from other principles.
12. If we put
the transformed polynome, supposing to have the sign plus, will become
Let be a quadratic divisor of this polynome, and put , or ; then, by substituting for , and writing all the terms of the transformed function in two lines, one containing all the even, and the other all the odd, powers of ; the polynome will be equal to
By the same substitution of for , the divisor
is changed into the binomial quantity ; which will be a divisor of each of the preceding lines, if , when it is substituted for , render each of them equal to zero, Sect. 3. Hence we obtain the two following equations, viz.
If two numbers, and , can be found that will satisfy these equations, it is evident that will be a divisor of each of the two lines that compose the transformed function ; consequently, it will be a divisor of the sum of both lines, or of the function itself; that is, will be a divisor of the proposed polynome . We are now to prove that two such numbers may be found.
Substitute for in the equations (C), being a quantity to be afterwards determined; and, in order to shorten expressions, put
And the two equations (C) will be thus written, viz.
In these equations and are always supposed to represent positive numbers, in which case the equation cannot take place when is greater than ; for then all the terms of would be positive.
Considering as a function of , the part of it that does not contain is evidently
which is always positive. The highest power of contained in the same function is ; and we shall obtain all the terms of that contain this power by putting for in the expression,
which terms are therefore as follows, viz.
Now, in the expression obtained in Sect. 10, viz.
if we put , and divide both sides by
from which formula it follows, that the polynome on the right-hand side of the sign of equality will be equal to nothing, where , being any integer number less than , zero not included. Wherefore the first, third, &c. roots of the polynome will be expressed by the formula
representing any odd number less than ; and the second, fourth, &c. roots by the formula
being any even number less than . And it is evident that the polynome will be negative for every value of that lies between any odd root and the next even root, that is, for every value between these limits, viz.
Thus, an indefinite number of values of may be found that will make the polynome negative.
Having assumed such a value of , let any positive number whatever be substituted for , and will be converted into a rational function of ; the greatest power of , or , being odd, and having a negative coefficient; and the term which does not contain being positive. Wherefore, at least, one positive value of may be found that will satisfy the equation ; and, as has already been observed, this value of will be such as to make a positive quantity. It is possible indeed that, in the equation , there may be several values of for every assumed value of ; but we here confine our attention to the least positive value, which is distinguished by this circumstance, that it vanishes with the absolute term of the equation, or with ; whereas, when is equal to zero, all the other roots of the equation have finite values depending upon the given coefficients.
Now, if we suppose to increase from zero to infinity, and assume two values, and , very near one another, according to what has been proved, we shall have the corresponding values, and , such, that the equation will be satisfied by substituting both and , and likewise and . Hence, because , and , we get
Again, if we substitute first and , and then and , in the function , we shall get
But, by comparing the functions and , the following properties will readily be discovered, viz.
whence,
Consequently,
and, if we observe that , and substitute the value of found above, we shall get
in which expression all the quantities are essentially positive, except , which is always negative, as may be thus proved.
The quantity remaining invariable, if we make , the function will be positive; for it is equal to
and the same function will continue positive, while increases from zero to the least root of the equation . At this limit, is first equal to zero, and then becomes negative; it must, therefore, be decreasing, and consequently is negative. It
may indeed happen, that, for particular values of , the coefficients of may be such, that and shall be both equal to zero at the same time; but, in such cases, it will readily appear, that and
will likewise be equal to zero. Wherefore will be negative; at least, if it become equal to zero for any particular values of and , it cannot become positive. It follows, therefore, that the function itself will be invariably negative, while and increase together from zero to be infinitely great.
Now assume a series of values of increasing from zero without limit, viz.
and having substituted these in the function , find, by means of the equation , the corresponding values of , viz.
then, by substituting these values in , we shall obtain a series of results all negative, and increasing from zero without limit, viz.
and whatever be the magnitude of the positive quantity , it must be contained between two consecutive terms of this last series, viz. between and . But as the values of may be assumed as near one another as we please, it follows that and may be made to approach to one another and to , within any required degree of accuracy. Thus, two values of and may be found that will satisfy both the equations,
and having found these values, we shall obtain the quadratic divisor of the proposed polynome , viz. , or
In the preceding demonstration, it is supposed, that increases without limit, as becomes indefinitely great; which may be thus proved: The values of and will coincide nearly with the terms containing the highest powers of and , when these quantities are very great; and ultimately the functions may be considered as equal to those terms alone. In such circumstances, therefore, the values of the functions will be found by writing for ; whence we get
and if we put , or ; then,
Now, remaining invariable, will increase as increases; and the least value of that will satisfy the equation , corresponds to the least value of that will make the polynome in the expression of equal to zero; which value, according to what was before shown, is
But, if we put , we shall get
or, because ; , and ;
which proves the point assumed in the demonstration.
By a similar mode of reasoning, we may likewise prove the former case of the proposition, when is negative. In this case, the quadratic divisor is ; and if we proceed as before, or, which is the same thing, if we change the signs of and in the equations (C) already obtained, and put
we shall get
Now, by pursuing the steps of the foregoing analysis, we may prove, first, that, for every assumed value of , a negative value of may be found, which will satisfy the equation ; and, secondly, that, when the values which satisfy the equation are substituted in the function , the results will be invariably positive: whence it follows that a positive value of , and a negative value of , may be found that will satisfy both the equations, whatever be the magnitude of . The analogy between the two cases is thus placed in a strong light; and a little reflection will even bring us to this conclusion, that in reality the one case is a necessary consequence of the other. For since and depend only upon , and the given coefficients of the polynome, they will be functions of ; wherefore, in the equations of the first case, viz.
being negative, and positive, we may suppose and , these values being such as to render each of the equations identical: and then the quadratic divisor will become
But, because the foregoing equations become identical by the substitution of the values mentioned, it is a necessary consequence that the equations of the second case, viz.
in which the signs of , , and , are contrary to what they were in the former equations, will likewise be identical, when and ; and the quadratic divisor, , will now become
Thus when the quadratic divisor of the first case is expressed in terms of , we have only to change the sign of that quantity, in order to have the quadratic divisor of the second case. It is not difficult to perceive, that what has now been proved is nothing more than another application of the principle employed in Sect. 10; a principle which is the real foundation
Equations. of the imaginary arithmetic, with the processes of which the preceding investigations are intimately connected. None but real quantities have occurred in the analysis we have pursued, because we have sought to investigate which is always rational; whereas, if we had proposed to find , we should inevitably have been led to the real quantity in the one case, and to the impossible quantity , in the other. These few observations are made for the purpose of throwing light upon a part of analysis, which is certainly obscure in its principles, although there is no question that it is a useful and even a necessary branch of the art of calculation. A fuller elucidation of the subject would be unsuitable to this place; but enough has been said to show that we must seek in the principles of analysis itself for the explanation of the operations it employs; and we may, with great probability, conclude, that no satisfactory account of the imaginary calculus will ever be obtained by having recourse to fanciful geometrical constructions, or to the analogy between the circle and the hyperbola, or to the metaphysical proposition, that all processes with general symbols, whether significant or not, are equally entitled to be considered as demonstrative.
13. Having now proved, in a rigorous manner, that every polynome of even dimensions has at least one quadratic divisor of the one kind or the other, it follows, that it may be reduced by division to another polynome two degrees lower: in like manner, this last polynome will admit of being lowered two degrees more; and by repeating the same process, the first polynome will at length be completely exhausted by quadratic divisors. If, therefore, we recollect, that every polynome of odd dimensions has one binomial divisor, we shall arrive at this general conclusion, "That every rational polynome can be completely exhausted by binomial and trinomial divisors; and, consequently, that it is equal to the product of a certain number of factors of the two first degrees."
It appears also that the binomial factors of any polynome are such only as arise from the resolution of the quadratic divisors; and they are, therefore, either real or imaginary. And thus we finally obtain the following proposition, which was assumed by Harriot, and is the foundation of the received theory of equations, namely, "Every rational polynome has as many binomial factors, and as many roots, real and imaginary, as it has dimensions."
The necessity of confirming, by a general demonstration, the assumed theory of the impossible roots of equations, was early felt; and, accordingly, this point has engaged the attention of all the great mathematicians to whom analysis is indebted for the progress it has made in the course of the last and the present centuries. An account of their several researches would greatly exceed the limits of this article; but the reader will find all the information he can wish for in two long notes (9 and 10) of the Traité des Equations Numériques, by La Grange, in which the author, with his usual elegance, has explained and commented upon the various modes of investigation that have been proposed. It will be
sufficient to observe here, that all the demonstrations Equations. that have appeared are either calculations with impossible quantities, or they proceed upon the assumption, that every equation has as many roots as dimensions, and thus involve the very thing to be proved.
14. The general cases in which mathematicians have been successful in resolving rational functions into their trinomial factors, are confined to the theorem of Cotes, and to a more general proposition of a similar kind, for which we are indebted to De Moivre. These instances are of great importance in analysis, and we shall therefore subjoin an investigation of them, because they are deduced in a very direct manner from the method we have followed.
Suppose, as before, that , or , is a rational polynome of dimensions, and one of its quadratic divisors; put , substitute for , and write the transformed function in two lines, one containing all the even, and the other all the odd powers of ; then the polynome will be equal to
By the same substitution of for , the divisor will become ; and, as before, the conditions that shall divide each of the foregoing lines, will be expressed by the following equations, viz.
In these formulæ substitute the expanded values of , , &c.; and class together all the homogeneous terms of the same order, that is, all the terms in which the exponents of and amount to the same sum, then we shall have
Equations. Now, put , ; and, by what was proved in Sect. 10, the two foregoing equations will become
And the quadratic divisor will be changed into
When , and or , the preceding equations coincide with these, viz.
which express the condition that the given polynome has two or more factors equal to ; at which limits a quadratic divisor changes from being of the form to be of the form , or the contrary. Thus we learn that, in the equations (E), must always have a finite value, and then the denominator of the second equation may be neglected.
Let the preceding investigation be applied to find the quadratic factors of . In this case the two equations (E) will become
Whence
Now, excluding the cases when and , the last equation will be satisfied when , or , the numerators of the fractions representing all the odd and even numbers less than the common denominator; but the second equation will be satisfied only when :
wherefore all the quadratic factors of the function will be comprehended in the formula
When is even number, the quadratic factors will amount to ; and if to them we add the simple factors and , we shall have the complete resolution of the function. When is odd, the
number of quadratic factors is , to which must be added the binomial factor .
By proceeding in a similar manner in the case of the function , we shall have the equations
Excluding the cases when and , the second and third equations will be both satisfied, when , the numerator of the fraction representing any odd number less than . Wherefore all the quadratic factors will be comprehended in the formula
When is even, the number of quadratic factors is , and they exhibit the complete resolution of the function. When is odd, the number of quadratic factors is , to which the binomial factor must be added.
Let us next take the more general function
And, in the first place, when is greater than unit, the function is equal to
and the quadratic factors may be found by the cases already considered.
When is less than unit, let , and the function to be resolved will be
By means of the equations (E) we get
And hence
But, ; and ; wherefore the two last equations will become
Equations. and these, supposing different from unit, can be satisfied only by making , or .
Now, , being any integer number whatever, zero included; and hence , which formula comprehends all the values of that will satisfy the above equations. Wherefore all the factors sought will be contained in this general expression, viz.
in which, if for we substitute all the integer numbers less than , zero included, we shall obtain the quadratic factors of the proposed function.
15. The quadratic divisors and , have hitherto been considered separately; but they may be both represented by , which will coincide with the one or the other according as is positive or negative. And, if we now proceed as before, we shall get the following equations which express the conditions necessary, in order that the polynome of any proposed dimensions, as , shall be divisible by , viz.
By eliminating we shall obtain an equation, viz.
in which is the unknown quantity. As the process of elimination is independent of the particular values of the coefficients of , the degree of the resulting equation will be the same when the polynome has as many real roots as dimensions, and when the case is otherwise. But when is equal to the product of real binomial factors, the multiplication of every two of them will form a quadratic factor. The number of such factors will, therefore,
be equal to , which expresses all the combinations made with things taken two and two.
Consequently, there will be just so many different values of that will satisfy the equation , which
will, therefore, have its exponent equal to .
It thus appears that the equation rises in its dimensions very rapidly above the given polynome, on which account little advantage is derived from this procedure.
Again, by eliminating from the same two equations we shall obtain one, viz.
in which is the unknown quantity. This equation, which has already been alluded to (Sect. 8), rises to the same dimensions with the former equation ; but it is possessed of some useful properties, derived chiefly from the consideration, that every positive root gives a quadratic factor of the form
in the polynome , and every negative root, a quadratic factor of the form in the same polynome. Equations.
The quadruple of is equal to the square of the difference of the two binomial factors of ; whence it follows that the quadruples of the several roots of the equation are equal to the squares of the differences of the roots of . If, therefore, we put for the roots of , the roots of will be
and from this it is manifest, that the coefficients of the same equation will be known symmetrical functions of the quantities or of the roots of . The rules formerly explained may, therefore, be employed for calculating the coefficients of ; and this method of forming the equation is not only more convenient than the process of eliminating; but it likewise has the advantage of enabling us to find any one coefficient separately without computing the rest. Thus, if we put
and expand this product, and in place of the symmetrical functions of which it is composed, substitute their values in terms of the given coefficients of , we shall obtain the value of ; and the last term of the equation will be equal to
the upper sign taking place when , the dimensions of the equation , is even, and the lower sign when the same number is odd.
If we suppose the given equation to be possessed of as many real roots as dimensions, or to have real binomial factors, the product of every two of these will be a quadratic factor , in which is positive; wherefore, the roots of will be all real and all positive. On the other hand, when the given equation has not as many real roots as dimensions, it will be divisible by one or more quadratic factors not resolvable into real binomial factors, and in which is negative; consequently, the equation will have one or more negative roots. It is, therefore, a property of the auxiliary equation , that when the roots are all real, they are all positive; and when they are not all real, some of them are negative. Now the rule of Descartes will enable us to find whether the roots are all positive or not; and by this means we shall discover whether the roots of the given equation are all real or not. From what has been said we may lay down this rule: "The proposed equation will have all its roots real, when the auxiliary equation has as many variations from one sign to another as it has dimensions, or when its terms are alternately positive and negative; otherwise the proposed equation will have one or more quadratic factors of the form , but
Equations. the number of such factors cannot exceed the continuations of the same sign in the auxiliary equation."
Again, in the equation , the polynome is equal to a certain number of binomial factors of the forms and , multiplied into a supplementary polynome of even dimensions, which, not being capable of having a negative value, will have its last term positive (Sect. 5). It is manifest, therefore, that the last term of will be positive or negative, according as the number of factors of the form is even or odd, that is, according as the equation has an even or odd number of real and positive roots. But every two real roots in the equation give one real and positive root in the subsidiary equation : wherefore, if denote the number of real roots in the former equation, the number of real and positive roots in the latter will be equal to ; and the last term of the subsidiary equation will be positive or negative, according as is an even or an odd number.
In a cubic equation , is either one or three. In the first case, the equation will have no positive roots, and the last term will be positive; in the second case, it will have three real and positive roots, and the last term will be negative. Now, the dimensions of being odd, the function will be negative in the first case, and positive in the second. Wherefore the given cubic equation will have one real root, or three, according as the function , that is,
or , is negative or positive.
In a biquadratic equation , is equal to zero, or two, or four. In the first case, the equation has no positive roots, in the third, it has six; and in both cases the last term is positive. In the second case, the same equation has only one real and positive root, and the last term is negative.
The dimensions of , equal to , being even,
the function will be positive in the first and third cases, and negative in the second case. Wherefore the proposed biquadratic equation will have only two real roots when the function , that is,
, or, , is negative; and when the same function is positive, the proposed equation will have four real roots, if the terms of the auxiliary equation be alternately positive and negative; otherwise it will have no real roots.
In an equation of the fifth degree, is equal to one, or three, or five. In the first and third cases, the last term of will be positive, for there are either no positive roots or ten; in the second case the last term is negative, the number of positive
roots being three. The dimensions of , equal to , being even, the function will be positive in the first and third cases, and negative in the second. Wherefore the given equation of the fifth degree will have three real roots when the function is negative; and when the same function is positive, it will have five real roots, if the terms of the auxiliary equation be alternately positive and negative; otherwise it will have but one.
Resolution of Algebraic Equations.
16. When the coefficients of an equation are given in numbers, we may investigate the numerical value of any one root separately, by first seeking the limits between which it lies, and then narrowing those limits to any required degree of approximation. But this process is not what is meant by the general solution of algebraic equations, which supposes that the coefficients are denoted by general symbols, and consists in finding such a function of those quantities as shall, by the multiplicity of its values, represent all the roots. An algebraic expression is susceptible of many values, by means of the different radical quantities it contains; but, these radical quantities being themselves the roots of an equation, it follows that the general formula for the solution of any proposed equation can be nothing more than a function of the given coefficients combined with the roots of another equation.
The solution of quadratic equations has been known since the origin of algebra; it is found in the work of Diophantus, the first treatise on the science extant, if it be not the very first that was written. The Italian mathematicians, who are the founders of the modern algebra, discovered the solution of cubic and biquadratic equations. The rules they invented for this purpose are, however, rather the result of particular artifices, than deductions from any profound views of the structure of the equations they considered. In the course of the last and the present centuries, the general solution of equations has been the subject of almost innumerable researches by all the mathematicians of the first rank; but their labours have not been successful in advancing this branch of the science beyond the steps made by the first algebraists.
The rules usually given for the solution of cubic and biquadratic equations are to be found in all the elementary books, and it would be superfluous to repeat them here. An account of the attempts that have been made to obtain a general theory for solving algebraic equations would greatly exceed the limits we must prescribe to ourselves. What has most impeded the progress of algebraists in their researches on this subject, is the difficulty of treating it by a perfect analysis, or of arriving at general conclusions by a process of reasoning founded solely on the principles of the inquiry, and disengaged from particular artifices of calculation, and from particular suppositions. In what follows, we shall endeavour to lay before our readers the general principles on which is founded all that has been successfully accomplished in this theory.
Let the three roots of a cubic equation be represented by ; and having interchanged these letters among one another, in all possible ways, we shall get the six permutations following, viz.
The combinations that stand first on the left are formed by prefixing the same letter to the permutations made with the other two; and those on each line are derived from one another by making the last letter of one stand first in that which follows, while the other two letters preserve the same order.
Now let ; and let the letters of first combination of each line be prefixed in order to the three terms of ; then we shall get
and if we multiply and by successively, we shall further obtain
The six quantities comprehend all the values that can be formed by combining with , the three letters taken in any order whatever; and it is obvious that the cubes of all these six quantities, being each equal either to or , have no more than two values.
And because and have only one value each, any symmetrical functions of them, as and , will have determinate values, which remain the same, however the letters be interchanged among one another. The quantities and must, therefore, be symmetrical functions of ; and, consequently, they can be found in terms of the coefficients of the given equation.
By actually involving to the third power, we get
Now , when is any root of different from unit; therefore, by adding the three last expressions, we get
Again, by actually multiplying
and, because ,
By means of the preceding formulæ, we can compute the values of and ; and these values being the coefficients of a quadratic equation having its roots equal to and , we can thence find and , and and . Now and being known, we have
wherefore,
To apply the foregoing investigation, we shall take a cubic equation, , which is so prepared as to want the second term, then (Sect. 9)
consequently ; , and . Hence
Wherefore, by substituting these values in the expressions of the roots, we get
The preceding investigation, as well as all other methods that have been proposed for cubic equations, leads to the same result with the rule invented by Cardan; and, like that rule, it becomes, in some cases, insufficient for arithmetical computation, on account of the imaginary quantities that appear in the expressions of the roots. What is now mentioned is not an accidental circumstance, but a necessary consequence of the method of investigation pursued, and of the introduction of the imaginary roots of the equation . When are real quantities, the values of and will be both imaginary, because they involve and , or and . In this case, therefore, although the
three roots of the proposed equation are all real, yet the algebraic expressions of them are all imaginary,
Equations. and useless for the purpose of numerical calculation; and the former circumstance is precisely the reason of the latter. On the other hand, when one root is real and the other two imaginary, the impossible quantities destroy one another in the expressions of and , which are, therefore, real quantities; and in this case, the algebraic formulae answer for finding the numerical values of the roots. The distinction here pointed out depends on the radical , which is real or imaginary, according as the equation has one or three real roots, because is always positive in the first case, and negative in the second.
Much labour and thought have been bestowed in order to free the formulae for the roots of cubic equations, from the imaginary expressions that render them unfit for arithmetical computation. In particular instances the difficulty disappears; namely, when the radical quantities are perfect cubes, in which cases the impossible parts of the cube roots destroy one another, so as to leave none but real quantities in the expressions of the roots of the equation. And by expanding the radical quantities we may, in all cases, obtain the roots of a cubic equation in series of an infinite number of terms free from the imaginary sign. But when it is required to transform the formulae for the case of a cubic equation with three real roots, into finite expressions free from impossible quantities, and to do so without employing any other than the received notations of algebra, all attempts to solve the problem have led to equations in the same circumstances with the one proposed, and have ended in bringing back the same difficulty; in so much that equations of the description mentioned are said to be in the irreducible case.
It is, however, possible to transform the formulae for the roots of a cubic equation in the irreducible case into real expressions, although not so as to fulfil all the conditions above mentioned. Let ;
then : wherefore the equation , will become
By the preceding formula, the value of in this equation will be
or, according to the notation of Sect. 10, making
By substituting this value of we get
which equation being true for all values of and , must be identical, or, when expanded, must consist of a series of quantities that mutually destroy one another. Now the equation will still be identical, when is changed into : so that we shall have
and this proves that the equation
is solved by the formula
As the investigation in Sect. 10 is equally true, whether be a whole or a fractional number, we may apply it to find the value of the symbol .
For this purpose, let
then ; and, according as we take one or other of the angles that have the same sines and cosines, we shall obtain three different values of , or of , viz.
By putting , the equation (2) will assume the same form as at first, namely,
and because , and ; if we determine the angles by means of their tangents, instead of their sines and cosines, we shall get
and the three roots of the equation will be
Every cubic equation falls under one or other of the formulae (1) and (2), except when , or , which takes place when an equation changes from one class to another; and in this case we have
The several rules that have now been given, therefore, include every possible case.
The difficulty attending the irreducible case arises from a real distinction between the two subordinate classes of cubic equations, and is insurmountable by the ordinary operations of algebra. There is no permanent distinction of equations belonging to the same order, when we consider their roots as positive
Equations. or negative: because, in any proposed equation, all the roots, or as many of them as we please, can be changed from positive to negative, by the simple artifice of increasing or diminishing them all by a given quantity. But the case is otherwise when we consider the roots of an equation in their character of real or imaginary quantities. No transformation can change an equation with one real root into another with three real roots, without involving the operations of the impossible arithmetic. If, therefore, we lay down this condition, namely, that the formulae for the roots of equations must be in a shape fit for numerical calculation; we may conclude that, in fact, there is no resolution of equations except what consists in reducing all those of the same class to some one of that class, the most simple and convenient in its form, that can be found. If we examine the preceding investigation, it will appear that it is merely an attempt to reduce all cubic equations to the form ; and this readily succeeds, without impossible operations, when the proposed equation and that with which it is compared have their roots of a similar description; and it as surely fails when the case is otherwise.
In geometry, where the relations of the magnitudes under consideration are never lost sight of, there is no tendency to refer the solution of a problem to a class to which it does not belong. The ancient geometer could never be in danger of applying the problem for finding two mean proportionals to a case that can be constructed only by the trisection of an angle. The modern analyst, dismissing the original magnitudes of his problem, and reducing all possible relations to equations in abstract numbers, is apt to overlook distinctions, and sometimes to waste his labour, in seeking to accomplish what a due separation of cases would show to be impossible. There is the same distinction between the class of cubic equations with one real root, and that with three real roots, that there is between the two geometrical problems alluded to above; and the algebraist who attempts, by means of the ordinary operations of his art, to transform Cardan's formula so as to make it apply to the irreducible case, is precisely in the same situation with the geometer who should set about trisecting an angle by finding two mean proportionals.
The power and force of the algebraic method does not consist in breaking down real distinctions, but in connecting, by sure and general principles, many truths which, in geometry, are joined only by vague analogies, and even have no affinity at all. This advantage is derived chiefly from the doctrine of negative quantities, and from the impossible arithmetic. By means of the first, a formula which is obtained by considering only one state of the data of a problem, applies, necessarily and by the very structure of analytical language, to the same problem in all possible conditions of the data. On the other hand, when the relations of the data vary, the geometer is obliged to subdivide his problem into cases, or into other subordinate problems; and although it may be perceived that great similitude prevails among all the subdivisions, yet it is impossible to reduce the analogy between them to determinate rules, as is done in
Equations. algebra. But, in the whole compass of geometry, there is nothing that bears any resemblance to the imaginary arithmetic. When the geometer has fixed the determination of his problem, or ascertained the limits within which it is possible, he has drawn a line that must be the boundary of his investigation. Now, it is to truths lying beyond this line that the meaning of the comprehensive expressions of the imaginary arithmetic must be referred. It is not to be understood that a problem can be solved by algebra, which is impossible in geometry; but the analytical formulae, at the same time that they mark the limits of the problem, go beyond them, and point out connected truths, that require only certain changes to be made in the algebraic expressions; in like manner, as all the possible cases of the same problem are derived from one only, by means of the variations of the signs.
If , represent the four roots of a biquadratic equation; and if we prefix the same letter to all the permutations made with the other three, we shall get the six combinations following, viz.
In the first line, the letters , are made to circulate, by placing immediately after the immovable letter that which stands last in the combination preceding; and, in the second line, the moveable letters have, respectively, an inverted order to what they have in the first line.
Let ; and let the four letters taken in the several orders of the six combinations be prefixed to the terms of ; the results of the first line being , and those of the second line ; then
Now, in the equation , is either equal to , or to ; and whether we take the one value or the other, it is apparent that .
Again, from every one of the six foregoing combinations, four others are derived by circulating the letters continually from the last place to the first; and, in this manner, we obtain twenty-four different permutations, which are all that can be made with four letters. Thus, if we take , and move the letters as directed, we shall get these four combinations, viz.
And if we multiply by continually, observing to retain the three first powers of , and to make , we shall get
Equations, so that , are the functions formed by prefixing to , the letters of the four combinations; and it is obvious that these functions have all the same square, equal to .
Wherefore, if the four letters, taken in all possible orders, be prefixed to the terms of , the squares of the twenty-four resulting functions will be equal to one or other of the six quantities, ; and since it has been proved that ; it follows that the twenty-four squares have no more than three different values, equal to .
And, because , can have no more than one value each, any symmetrical functions of them, viz.
will have determinate values independent of the order of the letters . The same functions will therefore be symmetrical expressions of the roots of the given biquadratic equation, and they will be known in terms of the coefficients of that equation.
Supposing , we get
and hence,
the symbol being used here, as in Sect. 9, to denote the sum of the products of every two of the roots. Wherefore, if we put
then
and hence,
But it will readily appear that
Now, by substituting these values, we get
Again, if we multiply the expressions of , Equations, we shall get
and finally, by means of the formulæ in Sect. 9,
If now we substitute the values computed by the preceding formulæ, in the cubic equation,
we shall obtain the values of , and consequently of , by solving that equation: and, when are known, we have
wherefore, because , we get
And finally, by making ,
In applying these formulæ, it is necessary to observe, that, as the quantities , are found by extracting the square root, they may each have either the sign plus or the sign minus prefixed. But all ambiguity from this cause will be taken away, if it be observed, that the expressions of , will always give the same results, provided the signs of , be so determined as to satisfy the equation,
For, if we suppose that the signs of , are so determined as to satisfy the equation mentioned, they cannot be varied so as still to satisfy the same
Equations. equation, unless two of them be changed together; for, if one sign only be changed, or if all the three be changed together, the product will have an opposite sign to what it had before, and the equation will no longer be satisfied. But the expressions of , give the same set of values when the signs of any two of the letters , are changed together; so that, in order to have the true values of the quantities sought, no other rule for the signs of , is necessary than that they must be such as to satisfy the equations alluded to.
To apply the preceding investigation we may take the equation,
which wants the second term. Then,
and , are the roots of the cubic equation,
Having solved this equation, and found the values of , the signs of these quantities must next be determined so as to satisfy the equation,
and then we have these formulæ for computing the roots of the proposed equation, viz.
These formulæ coincide with the method of solving biquadratic equations first proposed by Euler in his Algebra. But, in order to take away the ambiguity arising from the double sign of the square root, that celebrated mathematician uses two sets of expressions for the roots of the equation, viz.
of which one set is the same with the formulæ given above, and the other is obtained by changing into ; the first set being directed to be used when is positive, and the other set
when the same quantity is negative. This procedure is not so simple as that we have followed, which requires only one set of formulæ. It has even been the occasion of leading into error, in as much as it makes the signs of , depend entirely upon the sign of the given quantity ; whereas, it is indispensable that, regard being had to the nature of the quantities , their signs shall be determined so as to satisfy the equation . This inadvertence of Euler has escaped the observation of most of the authors who have treated of biquadratic equations, and was first noticed by M. Bret in the second volume of the Correspondance sur l'Ecole Polytechnique.
It may not be improper to notice briefly some of the other rules for biquadratic equations. These are chiefly two; the method of Descartes, which resolves the given equation into two quadratic factors; and the oldest method of all, invented by Louis Ferrari, a pupil of Cardan, which proceeds by transforming the given equation, so as to make it equal to the difference of two complete squares, and then extracting the square roots. However different from one another these two methods may at first seem, they are at bottom the same; and they are so far connected with that already investigated, that all the three lead to the same cubic equation.
Suppose that , are the roots of the biquadratic equation,
then , and , are two quadratic factors, the product of which is equal to the given equation. Now,
wherefore, if we put , , the two factors will become
and if we multiply them, and equate the coefficients of the product to the coefficients of the given equation, we shall get
And it is to be observed that, on account of the two first of these equations, and are both real quantities when is a real quantity; so that, provided a real value of can be found, the given equation is always resolved, by this method, into two quadratic factors free from imaginary expressions.
Now, by combining the equations just found, we shall get
The first of these equations is a cubic, of which the root is ; and it is precisely the same with the cubic of the former method. As the last term of this equation is essentially positive, it follows, that there is always one positive value of , and one real value of ; wherefore, in consequence of what has been proved, the values of and , derived from the positive value of , are in every case real quantities, which is, no doubt, an advantage in the practical application of the method.
If we wish to follow the process of Louis Ferrari, we may assume , so as to render the expression
identical with the given equation; and as this expression is no more than the product of the two quadratic factors of the last method, the quantities to be determined will be found by the formulæ already given.
The theory of permutations, which is successful in solving cubic and biquadratic equations, applies likewise to those of the fifth and higher orders. But, to use the words of Lagrange, "Passé le quatrième degré, la méthode, quoiqu' applicable en général, ne conduit plus qu'à des équations résolvantes de degrés supérieurs à celui de la proposée." Thus, in the case of equations of the fifth degree, the theory leads to a biquadratic equation of which the coefficients are to be found by resolving an equation of the sixth order.
There is, however, no doubt that the doctrine of permutations contains the principles from which we are to expect the resolution of equations of the higher orders, if the problem be possible. It may be alleged, with great probability, that the theory succeeds in the less complicated cases, because, when the number of the roots is small, their permutations are soon exhausted, and we speedily arrive at those combinations of them which remain invariable, whatever be the order of the quantities combined. But when the number of the roots is greater than four, their permutations are very numerous, and, at the same time, the functions produced by combining them are very complicated; on which accounts it is difficult to conduct the investigation so as to arrive at a satisfactory conclusion, either accomplishing the intended purpose, or proving that the undertaking is impossible.
In the twelfth volume of the Italian Society, and in a work published at Modena in 1813, M. Paolo Ruffini has proved, that no function of five letters can exist that is susceptible of only three or four different values when the letters are interchanged among one another in all possible ways. M. Cauchy, in the sixteenth volume of the Journal de l'Ecole Polytechnique, has demonstrated, that a function of letters, unless it have no more than two different values, cannot have a number of different values less than the prime number next below . On these grounds, it
has been inferred, that the resolution of equations of the fifth degree is in reality an impossible problem. (Lacroix, Compt. des Elem. d'Algebre, p. 61.) And, if it be admitted that, in the process of resolution, no equations can occur except such as have symmetrical functions of the five letters for their coefficients, the inference founded on the labours of the eminent mathematicians we have mentioned would be indisputable. But it is not impossible that the resolution of equations of a high order must be effected by gradually depressing an equation at first of great dimensions; and in this procedure we may arrive at equations, the coefficients of which, although functions of the roots of the proposed equation, are not symmetrical functions, but partial expressions susceptible of several values, according as the order of the letters that denote the roots is made to vary. On this supposition, the resolution of equations above the fourth order, by means of equations inferior in degree, would not be inconsistent with what has been proved.
17. A method for solving equations of one order may be generalized so as to extend to a certain class in all orders. Thus De Moivre has found a species of equations of every degree that have their roots similar to those of cubics, and which are solved by the formula
differing in no respect from the expression for resolving cubics, except that is written in place of 3.
An equation may be depressed to a lower order when it is known that the roots have a given relation to one another. An instance of this has already occurred in the case of equal roots; for, the equal roots having been first found, the equation can be lowered by division. Reciprocal equations furnish another example of depression to a lower order, on account of a relation subsisting among the roots. A reciprocal equation is one of even dimensions, such that half the roots are respectively the reciprocals of the other half, in which case no alteration is pro-
duced in the equation when is substituted for . In equations of this kind, the same coefficients occur in the same order, and with the same signs, reckoning from either end; a description that likewise applies to some equations of odd dimensions, which, however, do not constitute a new class, being merely reciprocal equations, as defined above, multiplied by the factor . A reciprocal equation may always be depressed to half the dimensions, by transforming it so that the new unknown quantity shall be equal to . It is sufficient to have mentioned these cases, which are fully treated of in all the elementary books.
Equations with only two terms, as , are the most extensive class that have been resolved by a general method. The successful application of analysis to this class of equations is extremely interest-
Equations. ing both in itself, and likewise because it is connected with the division of the circle into equal parts, and has occasioned the discovery of some curious and unexpected results respecting that problem. For these reasons, it appears proper to lay before our readers a short view of this branch of the doctrine of algebraic equations.
We have already shown, that, admitting the theory of angular sections, every equation with only two terms, as , may be completely resolved into its binomial and trinomial factors; and hence all its roots, possible and impossible, may be computed by means of the trigonometrical tables in common use. If we put
, and denote by any number less than , we
have found that the equation is divisible by the quadratic factor , and, consequently, that it has the two impossible roots,
and, because , and , the same two roots may be otherwise more symmetrically represented, thus,
Therefore, when is odd, the equation has one real root equal to 1; and when is even, it has two real roots equal to ; and in both cases the remaining roots are all impossible, and are found from the formula,
by making equal to all the integral numbers less than in the one case, and less than in the other. Nothing, therefore, can be more simple than the computation of the roots of such equations by means of the trigonometrical tables. But in seeking a general solution, it is required to investigate the roots without resorting to the properties of the circle, unless in so far as this may be necessary for solving similar equations inferior in degree to the one proposed. In this view the resolution of the equation , is equivalent to the division of the circle into equal parts, granting the like division for all numbers less than . And in order to render the investigation of the problem as simple as possible, it may be further observed, that it will be sufficient to consider the case when the exponent is a prime number; because, from this case, the other, when it is a composite number, can be readily deduced.
It will be proper to premise here a property of the roots of equations with only two terms, to which we shall have occasion continually to refer. The property in question depends upon this theorem, namely: When is any number, not a multiple of the prime number , the remainders of the terms of the series,
when each is divided by , are all different from one
VOL. IV. PART II.
another; and, consequently, without regard to the Equations. order, they will coincide with the numbers 1, 2, 3, &c. less than . If, therefore, we take any one of the impossible roots of the equation , viz.
all its powers with indices less than , viz.
&c.
will be different from one another; and likewise they will coincide, without regard to the order, with the like powers of any other impossible root of the same equation: because, whatever number stands for, the arcs are all different from one another, and, neglecting whole circumferences, constitute the same series of terms although in different orders. Wherefore, being a prime number, if be one of the impossible roots of the equation , all the roots will be represented by the terms of the geometrical progression,
for every one of these terms satisfy the given equation, and it has been shown that they are all different from one another.
When is a composite number, the same property does not belong to all the roots of the equation , but only to some of them. It belongs generally to the root
when is either equal to unit, or to any number that has no common divisor with ; in which cases, all the powers of are roots of the equation , and all different from one another, when the exponents are different and less than .
If the equation be divided by the binomial factor , we shall get
and this being a reciprocal equation, it can be further depressed to half the dimensions. In this manner we obtain the solution of , which is reduced to a cubic; but, by the same procedure, the equation next in order, viz. , can be lowered only to the fifth degree, for equations of which class there is no rule. Nevertheless, this last equation has been solved by Vandermonde, to whom, and to Lagrange, we are mainly indebted for disengaging the resolution of equations from the complicated operations of algebra, and for substituting, in their place, reasonings founded on the doctrine of combinations. The author has not explained particularly the process by which his solution was obtained; he gives it as a result of his theory, which, although it fails in general for equations above the fourth degree, succeeds in this instance on account of particular relations between the roots. Similar relations subsist between the roots of any other binomial
Equations. equation when the exponent is a prime number; and, in consequence, a like mode of investigation will apply, as indeed the author has expressly said. But this procedure would unavoidably be attended in every new instance with very long calculations; and it appears hardly possible to arrive in this way at any general method that would apply to all equations of the class in a regular manner, and without considerations drawn from each particular case.
M. Gauss, in a work entitled Disquisitiones Arithmeticae, replete with original and important matter, applied a property of prime numbers to the solution of binomial equations, which removed every difficulty, and led to a theory that unites simplicity and generality. If we suppose that is a prime number, and resolve into its component factors, so that
, being prime numbers, M. Gauss has proved that the solution of the equation , or, which is the same thing, the division of the circle into equal parts, can be effected by solving successively equations of dimensions, equations of dimensions, equations of dimensions, &c. Thus, if , then, because , the roots of can be found, or a polygon of 13 sides can be inscribed in a circle, by solving a cubic and two quadratic equations in succession. In certain cases, when a prime number comes under the form , as 17, 257, &c., the division of the circle will require the solution of equations no higher than the second order; whence this unexpected consequence has resulted from the theory of M. Gauss, that the inscription of a polygon of 17, or 257 sides in a circle, which are problems that have always been understood to transcend the limits of the elementary geometry, can, nevertheless, be constructed by the operations admitted in that science.
A work replete with so many interesting discoveries as the Disquisitiones Arithmeticae, could not fail to excite the attention of mathematicians. Legendre, in republishing his Essay on the Theory of Numbers, has added to it an exposition of M. Gauss's theory of binomial equations; and the same theory is the subject of the 14th note in the second edition of Lagrange's Treatise on Numerical Equations. No part of the mathematics could pass through the hands of men of so much ability without receiving great improvement. Lagrange has shown, that it is not necessary to go through the several intermediate equations that make so essential a part in the investigation of M. Gauss; and, by this means, he has reduced the solution of equations with two terms to the utmost simplicity of which it is capable. But, in one respect, it must be admitted that the procedure of the illustrious geometer is imperfect. Although it arrives, by a short investigation, at the partial quantities that by their additions form the expressions of the roots sought, it leaves indeterminate the order in which they are to be combined. M. Gauss has avoided ambiguity in this respect by deducing from one of the quantities all the other parts of the same expression; but, amidst a multiplicity of
different systems of values that may be deduced from the partial quantities, Lagrange has given no clue to guide to the true one. Equations.
In laying before our readers some account of this interesting branch of the theory of algebraic equations, we shall view the subject in a light somewhat different from that in which it has hitherto been placed. Instead of seeking directly the roots of binomial equations, we shall apply the principles of M. Gauss's theory immediately to the division of the circle into equal parts, by taking the arcs of the circumference in that order, to which the method owes all its success. This procedure is attended with some advantages. In the first place, the algebraic expressions of the quantities sought, represented by
, are more simple than those of the imaginary roots of the corresponding binomial equation;
and, in the second place, the same expressions, having always real values, are better fitted for application than the roots of binomial equations which require to be further reduced to prepare them for calculation.
Before entering on the principal problem, it is necessary to say something of that property of numbers on which the whole theory depends. Supposing to be any prime number, Euler has distinguished by the name of a Primitive Root any number less than , such that, if we take the series of all its powers with indices less than , and in each power reject the multiples of it contains, the several remainders are all different from one another, and, consequently, paying no regard to the order, they will coincide with the numbers 1, 2, 3, &c. less than . It has been proved that, for every prime number, there are as many primitive roots as there are numbers less than , which have no common divisor with it. The existence of such numbers in every case is therefore demonstrated; but no direct method of finding them has yet been published with which we are acquainted.
We gladly seize the present occasion of laying down a rule for finding the primitive roots of a prime number. But first we must premise, that when any proposed number is said to satisfy the
equation , it is always understood that the multiples of the prime number are rejected; and the meaning is, that, when the given number is substituted for , the whole result is divisible by without any remainder.
Now, let be a prime number, and the prime divisors of , so that ; then every primitive root will satisfy the first of the following equations without satisfying any of the rest, viz.
And, on the other hand, every number, not a primitive root, which satisfies the first equation, will, at the same time, satisfy one, or more, or all, of the other equations.
But the numbers which satisfy the first equation are exclusively those which are not found among the remainders of the series of square numbers divided by . Wherefore, setting aside the first equation, if we seek among the non-residual numbers for such as satisfy none of the remaining equations, the numbers so found will be the primitive roots sought.
When one primitive root is found by this method, all the rest may be directly obtained from it. For, if , represent all the numbers less than and prime to it; then, being one of the primitive roots, all the roots will be equal to the series of powers,
rejecting always the multiples of .
The demonstration of these properties would lead us aside from our present purpose; and we shall be content with adding some examples for the sake of illustration.
Let ; then , and ; so that in this case, the only equation of exclusion is , which admits only one solution, viz. . Therefore all the non-residual numbers except 10 are the primitive roots; namely, 2, 6, 7, 8. We may extend this conclusion to every case when is a prime number, as 7, 23, 47, &c.; in all which instances all the non-residuals, except , are the primitive roots.
Next, let ; then ; and there are no equations of exclusion. In this case, therefore, all the non-residuals, without exception, are primitive roots; and the same thing is true of every prime number of the form , such as 5, 257, &c.
Let ; then ; and the only equation of exclusion is
which admits only two solutions, viz. and . In this instance, therefore, all the non-residual numbers, except 5 and 8, are the primitive roots.
Let ; then ; and we have two equations of exclusion, viz.
The non-residual numbers are
3, 6, 11, 12, 13, 15, 17, 21, 22, 23, 24, 26, 27, 29, 30. Of these numbers the first, viz. 3, is a primitive root, since it satisfies neither of the two equations; and as the numbers less than 30, and prime to it, are 1, 7, 11, 13, 17, 19, 23, 29; all the primitive roots of 31 are as follows: viz. , , , , , , , . With respect to the other non-residual numbers, it will be found on trial, that the first equation is satisfied by 6 and 26; the second by 15, 23, 27, 29; and both equations by 30.
We are now prepared to enter upon the solution of the problem for dividing the circle into as many equal parts as there are units in the prime number . If we conceive a polygon of sides, to be inscribed in a circle, it will be admitted that the centre of gravity of the polygon coincides with the centre of the circle. Wherefore, if perpendiculars be drawn to any diameter of the circle from all the angles of the polygon, it follows, from the nature of the centre of gravity, that the sum of the cosines lying on one side of the centre of the circle will be equal to the sum of the cosines lying on the other
side. Let ; and put for the arc intercepted between the diameter and any angle of the polygon, then we shall have this equation, viz.
, which is no more than the analytical expression of the geometrical property just mentioned. Now, suppose that the diameter passes through one of the angles of the polygon; then , and the equation becomes
Let be one of the primitive roots of the prime number ; then rejecting multiples of , and paying no regard to the order, the terms of the geometrical progression,
will be equal to the several numbers less than . Wherefore, in the two series of arcs,
every arc in the geometrical progression will either be equal to some one in the arithmetical progression, or will differ from it by a whole circumference, or circumferences. Hence the cosines of the first series of arcs may be substituted in the last equation for the cosines of the other series; and thus we have
Again, by Fermat's theorem, . is a multiple of ; and because no primitive root of a prime number is the remainder of a square divided by that number, we have is a multiple of ; and, consequently, is a multiple of
. It follows, therefore, that is equal to a multiple of the circumference of the circle; and hence,
From this it appears that the cosines in the last equation may be distributed into two equal sums; one containing the cosines of all arcs from to inclusively, and the other the remaining cosines; consequently,
and because ,
Let ; and put
then all the powers of with indices less than will be different from one another, and all of them roots of the equation , the solution of which requires the division of the circle into only , or , equal parts.
In what follows, we shall have continual occasion to consider the expression
; and it will, therefore, be convenient to adopt an abridged mode of writing it. Now, the expression will be wholly known, and can be constructed when the two indices and are given; and we may therefore denote it by the symbol , placing always the index of before the other. We shall invariably make the index of positive, and suppose it reduced below by means of the formula (A). In like manner we shall suppose that the index of is always reduced below by suppressing the multiples of ; and we shall write it sometimes positive and sometimes negative, observing that the negative indices may be always rendered positive by supplying the proper multiples of ; thus, , &c.
According to the notation just explained, we have
And because , the symbols , , , will represent the series of cosines in the equation (1); so that we have
The following formula is no more than a corollary from the preceding notation, viz.
By means of the trigonometrical formula in common use, any powers and products of the cosines of the arc and its multiples may be reduced to a series of terms, containing the like cosines multiplied by given coefficients. Wherefore, because , and likewise, because the cosines of all arcs greater than , , , &c. may be reduced to the cosines of arcs less than , it follows that every rational and integral function of , , , &c. may be brought under this form of expression, viz.
Now, if we suppose the function we are considering to be such, that it retains the same value when any of the multiple arcs , , &c. is substituted for , the transformed expression will be possessed of the same property. But, if we actually substitute the arcs , , &c. for in the foregoing expression, it will become successively
each line containing the same cosines, although in a different order, because the series of arcs is the same when whole circumferences, or the multiples of are rejected; and all these expressions cannot have the same value unless ; that is, unless the expression be of this form, viz.
which, in consequence of what was before proved, is equal to . It is, therefore, demonstrated that every rational and integral function of , , , &c., which remains unchanged when any of the multiple arcs , , &c. is substituted for , has, for its value, an expression without cosines, and depending only upon the nature of the function.
If we introduce the arcs in geometrical, instead of those in arithmetical progression, it is obvious that the substitution of the multiple arcs , , &c., for , is equivalent to the changing of into , , , &c.; and hence any rational and integral function of the cosines of and its multiples, which remains invariable when is changed into , , , &c. is a quantity independent of the cosines, or has its value expressed by a function from which the cosines are eliminated.
What has now been proved will enable us to appreciate the advantage arising from the introduction of the arcs in geometrical, in place of those in arithmetical progression, in which principally consists the improvement that this theory owes to M. Gauss. The solution of the problem turns upon finding those functions of , , , &c. which have determinate values independent of the cosines; which functions, it has been proved, remain invariable when any of the multiple arcs , , &c. is substituted for . Now, although the substitution of any multiple arc, in place of the arc itself, always reproduces the same series of cosines, yet the order is irregular, and varies with every different multiple arc; and this circumstance makes it difficult to investigate what
Equations. change the substitution will effect in a given function. On the other hand, by introducing the arcs in geometrical progression, the same order is still preserved, whatever substitution be made; and, by this means, every facility possible is obtained for investigating the functions sought.
The following properties are deducible from what has been proved. First, if &c. be any numbers, none of which is equal to zero, or a multiple of , and such that their sum is equal to , or to a multiple of ; the product
will be independent of the cosines of and its multiples, or will be an expression containing only the powers of multiplied by numeral coefficients.
For by the formula (B) we have
Therefore, by multiplying and observing that , because is a multiple of , we get
which shows that the product in question is not altered when is changed into . Consequently, according to what was before proved, the product is independent of the cosines.
It follows, as a corollary, that the product
is independent of the cosines.
Next, if &c. be any numbers, and ; and if neither nor any of the numbers &c. be a multiple of , we shall have
, the quantity being independent of the cosines, and containing only the powers of multiplied by numeral coefficients.
For, by the property already demonstrated, and its corollary, we have
and being quantities independent of the cosines. Therefore, by exterminating , we get
The foregoing properties are the foundations of the theory. But it is not enough to establish the principles by a general demonstration: it is also necessary to be able to compute the numerical values that occur in the application to particular problems. Therefore, supposing that and are two numbers, and , none of the three numbers , being a multiple of , it is proposed to find the value of in the equation.
For this purpose, set down the several terms of in their order; and below them write the terms of , placing first any term, as , and the rest in their order, in this manner,
Now, let every term in the lower line be multiplied into that which stands above it; and, separating the factor , which is common to each product, let the symbol represent the sum of all the products; then
If we repeat this operation, so as to make every term of the lower line stand first in succession, it is evident that, by this means, every term of will be multiplied by all the terms of ; so that the sum of all the results will be the product sought. We therefore obtain
Let , and ; then because the product of the cosines of two arcs is equal to half the sum of the cosines of the sum and difference of the two arcs, we shall have
In the first place, when , , ; therefore,
But ; and hence ; or ;
and, according to the value assumed for , the equation cannot take place when is not a multiple of ; wherefore
Now, if we put , we shall get
Wherefore, on account of the formula (B), we finally get
Next, when is not equal to zero, let and denote the numbers derived from by means of the equations
then, by substituting and for and , we shall get
and on account of the formula (B),
Now, collecting all the parts in the expression of , we shall get these formulae, viz.
As nothing changes in the expression of except the indices and , it may be denoted by the abridged symbol , in which it is obvious that may be substituted for ; so that
When is equal to , and , the product in question becomes , which has been proved to be a quantity independent of the cosines. In this case, therefore, we shall have
being a quantity from which the cosines are eliminated, and which is now to be investigated.
If, in the foregoing case, we suppose and , we shall get
but here, because , and its powers disappear from the expression of , and we have
and, by expanding the products of the cosines, as before,
When ; therefore
But no alteration is made in equat. (1) when we substitute, instead of the arc , any one of its multiples, or, which is the same thing, change into , , &c.; because such substitution, or change, continually reproduces the same cosines. Thus it appears that the sum of the cosines in , is equal
to ; and we have
For every other value of , and are, neither of them, equal to zero, nor to a multiple of ; wherefore, according to what has just been said, the sum of the cosines in each of the two parts of
, is equal to ; and thus, when is not equal to zero, we have
By substituting the values of and , we get
But, as was already proved,
wherefore,
Now, if we put , we have finally
When is an even number, it is obvious that
therefore it follows as a corollary, that, in this case,
By applying the equat. (2) first to the indices and , and then to the indices and , or to and taken negatively, we deduce
: and, by multiplying, we shall get, on account of equat. (3), this remarkable formula, viz.
By successive applications of the equat. (2), we get
&c.
By combining these equations, and writing for , we deduce
&c.
Wherefore, when is an even number,
and, by squaring and observing that, by equation (3),
When is an odd number, we have in like manner,
but, by equation (3),
; wherefore
Again, from the preceding expressions we get
and, by equation (4),
In like manner,
&c.
These formulæ need only be continued till we obtain the value of the function when is
even, and of when is odd; the re-
maining functions , &c. or, which is the same thing, , &c. being derived from the preceding values merely by changing the signs of the different indices of . Thus, if we write for , we shall have
&c.
Now, being any number less than , it has been shown that
and hence if we attend to the nature of functions, , , , &c. we shall readily get
or, by arranging the terms differently, and because
and it is to be observed that, when is even, the
last term is the single quantity , which has no corresponding part. Now, this quantity is entirely known. For, since , we have
Equations. ; but has been so assumed, that none of its powers with indices less than are equal to unit; and, therefore, , and . Again, by equation (3), ; wherefore we have
On the whole, the preceding analysis brings us to the following formulæ, which contain the solution of the problem, viz.
when is even by equation (5),
when is odd, by equation (6),
and by equation (2), .
Finally, by substituting the values of , , &c. , , &c. in the expression of , we get
the series of terms must be continued till the last index of and is when is odd, and
when is even; and, in this last case, the
quantity , must be added, prefixing to the same sign that is given to it in the value of .
The solution of the problem is thus reduced to the computation of the functions , , &c. which requires no more than the substitution of 1 for , and of 2, 3, 4, &c. successively for , in the expression of , equation (2). The half of these functions that have negative indices are deduced
from the other half, merely by changing the signs of the several indices of , or by means of equation (4). All the cosines sought are found by substituting , &c. successively for . Although the function is susceptible of different values, represented by ; yet the same cosines are deduced from any one of these values. By this means all ambiguity is avoided with regard to the system of values that represent the cosines; but the numerical value that must be attached to each particular cosine remains quite indeterminate, because may equally stand for
&c. The adaptation of the numerical quantities to the geometrical cosines must be made out by means of their relative magnitudes; the largest number answering to the greatest cosine. But when the value of one cosine is fixed, the rest are unambiguously determined by means of their indices.
In the formula for all the terms in which two quantities are combined have real values, although their forms are imaginary. But it is not difficult to transform them into equivalent quantities without the imaginary sign.
It is manifest that the functions and are of this form, viz.
denoting given coefficients. But, we have generally
wherefore, by combining the two expressions of and , we shall readily get
But, on account of equation (4), we may assume
and, by substituting these values in the last expressions, we get
by which means the arc is determined without ambiguity, since both its sine and cosine are ascertained. In like manner are determined the several arcs in the formulae,
Again, because , we may assume
And if these values, and the similar values of the functions (1, 2), (1, 3), &c. be substituted in the value of
, we shall readily deduce, when is an even number,
When is an odd number, we must separate the function from the rest, by supposing
and then, by means of equation (6), we shall easily obtain
The two last formulae determine the arc ; and we likewise have
and, by putting ,
Finally, by substituting the different values exhibited above in the formula for , we shall get
the series of terms being continued till all the arcs are taken in when is odd; and till they are all taken in except the last when is even, in which case also the quantity must be added.
By the preceding analysis the division of the circle into equal parts is accomplished, when is a prime number, by dividing a given arc into or equal parts. And this conclusion agrees with the general
proposition of M. Gauss. For the th part of a given arc is found by bisecting as often as is divisible by 2, trisecting as often as it is divisible by 3, and so on. When is a power of two, as in the case of the polygon of 17 sides, the solution is effected by repeated bisections, and thus comes under the elementary geometry. Supposing the division of the circle to be accomplished, we must further resolve the quadratic equation
in order to find the roots of the binomial equation .
The following examples are subjoined for the sake of illustrating the method of calculation. And, in the first place, we may take the case of equivalent to finding the roots of the equation , which was first solved by Vandermonde, and has been considered both by Lagrange and Legendre. Here,
and, as 2 is a primitive root of 11, we may suppose . In order to find the numbers and , write down the series 1, 2, 3, &c. as far as or 5; and, above each number, write the power of equal to it when the multiples of 11 are rejected, taking always the least remainder, whether positive or negative: thus,
In this arrangement of the powers of , it is evident that, denoting any index, is the next on the right hand, and the next on the left hand: we have, therefore,
Now, substitute these numbers in the expression of , equat. (2), and likewise put ; then,
In order to find (1, 2) and (1, 3) we have only to substitute 2 and 3 for in the expression of ; hence
which values will, in this case, be rendered some-
Equations. what more simple by combining them with the equation : and thus we get
The functions and are found by subtracting the indices of in the values of and from 5, which is equivalent to changing the signs of the indices: therefore,
And it will be found, by actually multiplying, that and .
These values being found, we have, according to the foregoing method,
and hence
wherefore we have
If, in this expression, we make , and substitute the numerical values of , and of and its powers, in the quantities under the radical sign, the result will coincide with the formula of Vandermonde, and with the calculation of Lagrange.
The expression just found being imaginary, if it be required to reduce it to a form fit for calculation, we must begin with substituting the values of and its powers in and : then
Now, , and
; also , and : wherefore,
Again,
consequently,
Hence
By making successively equal to 0, 1, 2, 3, 4, the formula will give all the ten cosines of a polygon
of 11 sides inscribed in a circle; because
termines also the order of the arcs to which the numerical quantities belong; so that when the value of one cosine is fixed, the values of all the rest are likewise ascertained.
This last formula coincides with the calculation of Legendre.
The next example shall be the case of .
Then, , , , and
; and, 3 being one of the primitive roots of 17, we may take . Now, arranging the powers of as in the last example, we have
and hence,
By substituting these numbers in the expression of A, and likewise by putting , we get
In order to have the functions (1, 2), (1, 3), (1, 4), nothing more is necessary than to substitute 2, 3, 4 for in the expression of A: then, observing that , , , we readily get
These values being found, we next have
therefore, making ,
In order to reduce this expression, we shall put
And, because , we get , and . Wherefore, by squaring,
Now, in the formula for , viz. , if we change into , no alteration will be produced, except that will change its sign; for, it is obvious, that
Hence, we readily deduce these two equations, viz.
If we suppose , then
wherefore,
And, when , then
wherefore,
Next, suppose , then
wherefore,
and, finally, making , we get
wherefore,
These formulae enable us to find the numerical values of all the cosines sought; observing always that is indeterminate, and varies with the primitive root from which the solution is deduced. (c. c.)
TABLE
OF THE
ARTICLES AND TREATISES
CONTAINED IN THIS VOLUME.
EDRISI, or ALDRISI.
EDUCATION.
EDWARDS (JONATHAN).
ELBA.
ELECTRICITY.
ELLIPTOGRAPH.
EMBANKMENT.
EMIGRATION.
ENTOMOLOGY.
ENTRE-DUERO-E-MINHO.
EQUATIONS (Addendum, p. 669).
ESSEXSHIRE.
EXCHANGE.
FAROE ISLANDS.
FERMANAGH, COUNTY.
FERMAT (PETER DE).
FICHTE (JOHN THEOPHILUS).
FILANGIERI (GAETAN).
FISHERIES.
FLUENTS, or INTEGRALS.
FLUIDS, ELEVATION OF.
FOOD.
FORSTER (JOHN GEORGE ADAM).
FOX (THE RIGHT HONOURABLE CHARLES JAMES).
FUNDING SYSTEM.
GALVANISM.
GALWAY, COUNTY.
GAS-LIGHTS.
GOVERNMENT.
| GRANADA, NEW. | HEREFORDSHIRE. |
| GREAT BRITAIN. | HERTFORD COLLEGE. |
| GREECE. | HERTFORDSHIRE. |
| GUATIMALA, GOETIMALA, or GUALTIMALA. | HEYNE (CHRISTIAN GOTTLOR). |
| GUIANA, or GUYANA. | HIMALAYA MOUNTAINS. |
| GUYTON DE MORVEAU (BARON LOUIS BERNARD). | HOLLAND. (See NETHERLANDS, KINGDOM OF). |
| HADDINGTONSHIRE. | HOLLAND, NEW. (See SOUTH WALES, NEW). |
| HAMPSHIRE. | HOME (JOHN). |
| HELIGOLAND. | HORTICULTURE. |
| HERCULANEUM. |
ERRATA.
Dissertation Second, p. 66, line 10, for "the quantities of matter are as the mean distances," read "the qualities of matter are as the orbits of the mean distances."
Page 316, col. 1, line 31, for "canal below," read "canal between."
— 318, col. 1, second line from bottom, for "the fluid will there," read "will therefore."
— col. 2, line 40, for "case where," read "case when."
— line 56, read H.
— 319, col. 1, line 40, for "outside of the force," read "outside of the four."
DIRECTIONS FOR PLACING THE PLATES.
| PLATE LXXIV. LXXV. LXXVI. LXXVII. LXXVIII. | to face page | 74 |
| — LXXIX. | 98 | |
| — LXXX. | 322 | |
| — LXXX.* | 444 | |
| — LXXXI. LXXXII. LXXXIII. LXXXIV. | 462 | |
| MAP OF EUROPE, | 200 | |
INDEX
Information ... the quantities of matter ... the ...
Page 218, col. 1, line 21, ... " " " " "
Page 218, col. 1, line 21, ... " " " " "
Page 218, col. 1, line 21, ... " " " " "
Page 218, col. 1, line 21, ... " " " " "
Page 218, col. 1, line 21, ... " " " " "
DIRECTIONS FOR MAKING THE PLATE
- 127. ...
- 128. ...
- 129. ...
- 130. ...
- 131. ...
- 132. ...
- 133. ...
- 134. ...
- 135. ...
MAP OF EUROPE