Sir William Rowan Hamilton
[Proceedings of the Royal Irish Academy, Nov. 11, 1844,
vol. 3 (1847), 1-16]
In the theory which Sir William Hamilton submitted to the Academy in November, 1843, the name quaternion was employed to denote a certain quadrinomial expression, of which one term was called (by analogy to the language of ordinary algebra) the real part, while the three other terms made up together a trinomial, which (by the same analogy) was called the imaginary part of the quaternion: the square of the former part (or term) being always a positive, but the square of the latter part (or trinomial) being always a negative quantity. More particularly, this imaginary trinomial was of the form ix + jy + kz, in which x, y, z were three real and independent coefficients, or constituents, and were, in several applications of the theory, constructed or represented by three rectangular coordinates; while i, j, k were certain imaginary units, or symbols, subject to the following laws of combination as regards their squares and products,
but were entirely free from any linear relation among themselves; in such a manner, that to establish an equation between two such imaginary trinomials was to equate each of the three constituents, xyz, of the one to the corresponding constituent of the other; and to equate two quaternions was (in general) to establish FOUR separate and distinct equations between real quantities. Operations on such quaternions were performed, as far as possible, according to the analogies of ordinary algebra; the distributive property of multiplication, and another, which may be called the associative property of that operation, being, for example, retained: with one important departure, however, from the received rules of calculation, arising from the abandonment of the commutative property of multiplication, as not in general holding good for the mixture of the new imaginaries; since the product ji (for example) has, by its definition, a different sign from ij. And several constructions and conclusions, especially as respected the geometry of the sphere, were drawn from these principles, of which some have since been printed among the Proceedings of the Academy for the date already referred to.
The author has not seen cause, in his subsequent reflections on the subject, to abandon any of the principles which have been thus briefly recapitulated; but he conceives that he has been enabled to present some of them in a clearer view, as regards their bearings on geometrical questions; and also to improve the algebraical method of applying them, or what may be called the CALCULUS OF QUATERNIONS.
Thus he has found it useful, in many applications, to dismiss the separate consideration of the three real constituents, x, y, z, of the imaginary trinomial ix + jy + kz, and to denote that trinomial by some single letter (taken often from the Greek alphabet). And on account of the facility with which this so called imaginary expression, or square root of a negative quantity, is constructed by a right line having direction in space, and having x, y, z for its three rectangular components, or projections on three rectangular axes, he has been induced to call the trinomial expression itself, as well as the line which it represents, a VECTOR. A quaternion may thus be said to consist generally of a real part and a vector. The fixing a special attention on this part, or element, of a quaternion, by giving it a special name, and denoting it in many calculations by a single and special sign, appears to the author to have been an improvement in his method of dealing with the subject: although the general notion of treating the constituents of the imaginary part as coordinates had occurred to him in his first researches.
Regarded from a geometrical point of view, this algebraically imaginary part of a quaternion has thus so natural and simple a signification or representation in space, that the difficulty is transferred to the algebraically real part; and we are tempted to ask what this last can denote in geometry, or what in space might have suggested it.
By the fundamental equations of definition for the squares and products of the symbols i, j, k, it is easy to see that any (so-called) real and positive quantity is to any vector whatever, as that vector is to a certain real and negative quantity; this being indeed only another mode of saying that, in this theory, every vector has a negative square. Again, the product of any two rectangular vectors is a third vector at right angles to both the factors (but having one or other of two opposite directions, according to the order in which those factors are taken); a relation which may be expressed by saying, that the fourth proportional to the real unit and to any two rectangular vectors is a third vector rectangular to both; or, conversely, that the fourth proportional to any three rectangular vectors is a quantity distinct from every vector, and of the kind called real in this theory, as contrasted with the kind called imaginary.
Now, in fact, what originally led the author of the present communication to conceive (in 1843) his theory of quaternions (though he had, at a date earlier by several years, speculated on triplets and sets of numbers, as an extension of the theory of couples, or of the ordinary imaginaries of algebra, and also as an additional illustration of his view respecting the Science of Pure Time), was a desire to form to himself a distinct conception, and to find a manageable algebraical expression, of a fourth proportional to three rectangular lines, when the DIRECTIONS of those lines were taken into account; as Mr. Warren and Mr. Peacock had shewn how to conceive and express the fourth proportional to any three lines having direction, but situated in one common plane. And it has since appeared to Sir William Hamilton that the subject of quaternions may be illustrated by considering more closely, though briefly, this question of the determination of a fourth proportional to three rectangular directions in space, rather in a geometrical than in an algebraical point of view.
Adopting the known results above referred to, for proportions between
lines having direction in a single plane (though varying a little the
known manner of speaking on the subject), it may be said that, in the
horizontal plane, ``West is to South as South is to East,'' and
generally as any direction is to one less advanced than itself in
azimuth by ninety degrees. Let it be now assumed, as an extension of
this view, that in some analogous sense there exists a fourth
proportional to the three rectangular directions, West, South, and Up;
and let this be called, provisionally, Forward, by contrast to
the opposite direction, Backward, which must be assumed to be
(in the same general sense) a fourth proportional to the directions of
West, South and Down. We shall then have, inversely, Forward to Up as
South to West, and therefore, as West to North: if we admit, as it
seems natural and almost necessary to do, that (for directions, as for
lengths) the inverses of equal ratios are equal; and that ratios equal
to the same ratio are equal to each other. But again, Up is to South
as South to Down, and also as North to Up: and we can scarcely avoid
admitting, or defining, that (in the present comparison of directions)
ratios similarly compounded of equal ratios are to be considered as
being themselves equal ratios. Compounding, therefore, on the one
hand, the ratios of Forward to Up, and of Up to South; and on the
other hand the respectively equal (or similar) ratios of West to
North, and of North to Up, we are conducted to admit that Forward is
to South as West to Up. By a reasoning exactly similar, we find that
Forward is to West as Up to South; and generally that if X, Y, Z
denote any three rectangular directions such that A:X::Y:Z, A here
denoting what we have expressed by the word Forward, then also
A:Y::Z:X (and of course, for the same reason A:Z::X:Y); so that
the three directions may be all changed
together by advancing them in a ternary cycle, according to
the formula just written, without disturbing the proportionality
assumed. But also, by the principle respecting proportions of
directions in one plane, we may cause any two of the three
rectangular directions XYZ to revolve together round the third, as
round an axis, without altering their ratio to each other. And by
combining these two principles, it is not difficult to see that
because Forward has been supposed to be to Up as South to
West, therefore the same (as yet unknown) direction
``Forward'' must be supposed to be to any direction X
whatever, as any direction Y, perpendicular to X,
is to that third direction Z which is perpendicular to
both X and Y, and which is obtained from Y by a right-handed
(and not by a left-handed) rotation, through a right angle, round X;
in the same manner as (and because) the direction West was so chosen
as to be to the right of South, with reference to Up as an axis of
rotation. Conversely we must suppose that if any three
rectangular directions,
, be arranged, as to order of
rotation, in the manner just now stated, then Z:Y::X:A; or in other
words, we must admit, if we reason in this way at all, that the
direction called already Forward, will be the fourth proportional to
. And if we vary the order, so as to have Z to the left,
and not to the right of Y, with reference to X, then will the
fourth proportional to
become the direction which we have
lately called Backward, as being the opposite to that named Forward.
Again, since Forward is to Up as South to West, that is in a ratio compounded of the ratios of South to East and of East to West, or in one compounded of the ratios of West to South, and of any direction to its own opposite; or, finally, in a ratio compounded of the ratios of Up to Forward and of Forward to Backward, that is, in the ratio of Up to Backward, we see that the third proportional to the directions Forward and Up is in the direction Backward: and by an exactly similar reasoning, with the help of the conclusions recently obtained, we see that if X be ANY direction in tridimensional space, then A:X::X:B; B here denoting, for shortness, the direction which has been above called Backward.
The geometrical study of the relations between directions in space,
combined with a few very simple and guiding principles respecting the
composition of relations generally, might therefore have led to the
conception or assumption of a certain pair of contrasted
directions, namely, those which we have called Forward and Backward,
and denoted by the letters A and B. And these are such that if we
conceive a quantitative element to be combined with each, and give the
name of POSITIVE UNITY to the unit of magnitude measured in the
direction of Forward, but that of Negative Unity to the
same magnitude measured backward; and if we extend to this
positive unity and to lines having direction in space the received
definitions of multiplication, that ``Positive Unity is to Multiplier
as Multiplicand is to Product,'' and that ``the product of two equal
factors is the square of either;'' we may then consistently and
naturally be led to assert the same results as those already
enunciated from the theory of quaternions respecting the product of
two vectors, in the two principal cases, first, where those two
vectors are rectangular, and second, where they are coincident with
each other. And thus may we justify, or at least interpret and
explain, the fundamental definitions (A) (B) (C) of this theory, by
regarding the symbols as denoting three vector-units having
three rectangular directions in space.
But, farther, we derive from this view of the whole subject an
illustration (if not a confirmation) of the remarkable conclusion that
the so-called real and positive unit +1 is not (in this
theory) to be confounded with any vector unit whatever, but is
to be regarded as of a kind essentially distinct from every vector.
For this positive unit +1 is in the direction above called Forward,
and denoted by A. Now if this could coincide with a direction X
in tridimensional space, then, whatever this latter direction might be
supposed to be, we could always, by the general formula A:X::Y:Z
(where X is arbitrary), deduce the inadmissible proportion
X:X::Y:Z, in which the two directions in one ratio are identical,
but those in the other are rectangular to each other. If then we
resolve to retain the assumption of the existence of a fourth
proportional A to three rectangular directions in space, as subject
to be reasoned on at all in the way already described, and as
determined in direction by its contrast to its own opposite B
(corresponding to an opposite order of rotation in the system
), we must think of these two opposite directions A and B
as merely laid down upon a scale, but must abstain from
attributing to this SCALE any one direction rather than another
in tridimensional space, as having such or such a zenith distance, or
such or such an azimuth, rather than such or such another. And the
progression on this scale from negative to positive infinity,
obtained by combining a quantitative element with the contrast between
two opposite directions, corresponds less to the conception of
space itself (though we have seen that considerations of space
might have suggested it) than to the conception of time; the
variety which it admits is not tri- by uni-
dimensional; and it would, in the language of some philosophical
systems, be said to appertain rather to the notion of
intensive than of extensive magnitude. Though answering
precisely to the progression of the quantities called real in
algebra, it has, when viewed from the geometrical side, somewhat the
same sort of imaginariness, and yet (it is believed) of
utility, as compared with lines in space, which the square root of an
ordinary negative has, when compared with positive and negative
quantities. This analogy becomes still more complete when we observe
that (in this theory) the fourth proportional to any direction X in
space, and either of the two directions A or B upon the scale, is
the direction opposite to X; so that, if a vector-unit in
any determined direction X had been taken for positive unity,
then each of the two scalar units in the directions A and
B (in common, it is true, with every vector-unit perpendicular to
X) might have been called, by the general nomenclature of
multiplication, a square root of negative one.
It is, however, a peculiarity of the calculus of quaternions, at least as lately modified by the author, and one which seems to him important, that it select no one direction in space as eminent above another, but treats them as all equally related to that extra-spatial, or simply SCALAR direction, which has been recently called ``Forward.'' In this respect it differs in its processes from the Cartesian method of coordinates, and seems often to admit of being more simply and directly applied to the treatment of geometrical problems, because it requires no previous selection of axes, rectangular or other. The author is, indeed, aware that the cooperation of other and better analysts will be necessary in order to bring the method of quaternions to anything approaching to perfection. But he hopes that an instance or two of the facility with which some questions at least allow themselves to be treated by this method, even in its present state, may not be without interest to the Academy. And he conceives that two examples in particular, one relating to the composition of translations, and the other to the composition of rotations in space, may usefully be selected for statement on the present occasion.
As preliminary illustrations of the operations employed, it may be remarked that for any system of lines having direction in space, it is required by many analogies (and is, for lines in one plane included among the definitions or results of the theories of Mr. Warren and Mr. Peacock), that the sum should be regarded as being equal to that one line which constructs or represents the total effect of all the different rectilinear motions which are expressed by the different summands. Vectors are therefore to be added to each other by a certain geometrical composition, exactly analogous to the composition of motions, or of forces, and following the same known rules. Scalars, on the other hand (that is to say, the so-called real parts of any proposed quaternions), admitting only of a progression in quantity, and of a change of sign, without any other changes of direction, are to be added among themselves by the known rules of algebra, for the addition of positive and negative numbers. The addition of a scalar and a vector to each other can be no otherwise performed, or rather indicated, than by writing their symbols with the + sign interposed; each being, as we have seen, in some sense, imaginary with respect to the other. These operations of addition are all of the commutative, and also of the associative kind; that is to say, the order of all the summands may be changed, and any group of them may be collected or associated into one partial sum.
Scalars are multiplied, as well as added, by the known rules of ordinary algebra, for the multiplication of real numbers, positive or negative; because the positive unity of the system has been assumed to be itself a scalar, and not a vector unit.
For the same reason, to multiply any vector by any scalar a,
is in general to change its length in a known ratio, and to preserve
or reverse its direction, according as a is > or < 0; the
product is therefore a new vector, which may be denoted by .
The same new vector is obtained, under the form
, when we
multiply the scalar a by the vector
. If
and
be two quaternion factors, of which a and b are the
scalar parts, and
,
the vectors, then with a view to
preserving the distributive character of multiplication, it is natural
to define that the product may be distributed into the four following
parts:
And if the multiplicand vector be decomposed into two parts,
or summands, one
and in the direction of the multiplier
, or in the direction exactly opposite thereto, and the other
, and in a direction perpendicular to the former (so that
and
are the projections of
on
itself, and on the plane perpendicular to
), then it may be
farther defined that the multiplication of any one
vector
by any other vector
may be
accomplished by the formula
in which, by what has been shewn, the partial product
is to be considered as equal to a scalar, namely, the
product of the lengths of
and
, taken with the sign
- or +, according as the direction of
coincides with, or
is opposite to that of
; while the other partial product
is a vector, of which the length is the product of
the lengths of
and
, while its direction is
perpendicular to both of their's, being obtained from that of
, by making it revolve right-handedly through a right angle
round
as an axis. These definitions, which are compatible
with the formulæ (A), (B), (C), and may serve to replace them,
will be found sufficient to prove generally, and perhaps with somewhat
greater geometrical clearness than those formulæ, the distributive
and associative properties of quaternion multiplication, which have
already been stated to exist. They give easily the following
corollaries, which are of very frequent use in this calculus:
A and B denoting here the lengths of the lines
and
, and (A,B) the angle between them; while
is a vector-unit perpendicular to their plane, and such that
a right-handed rotation, equal to the angle (A,B), performed round
, would bring the direction of
to coincide with that
of
. For example, when
, then B = A,
(A,B) = 0, and
so that the length A of any vector , in this theory, may be
expressed under the form
More generally we have the equation
when the lines and
are coincident or opposite in
direction; while, on the contrary, the condition for their being at
right angles to each other is expressed by the formula
These simple principles suffice to give, in a new way, algebraical
solutions of many geometrical problems, of various degrees of
difficulty and importance. Thus, if it be required, as an easy
instance, to determine the length of the resultant of several
successive rectilinear motions, or the magnitude of the statical sum
of several forces acting together at one point, as a function of the
amounts of those successive motions, or of those component forces, and
of their inclinations to each other, we have only to denote the
components by the vectors , and
their sum by
, the corresponding magnitudes being
, and A; and the equation
will give, by being squared,
that is, by the foregoing principles (after changing all the signs),
a known result, it is true, but one which can scarcely be derived in any other way by so very short a process of calculation. For it is not quite so easy, on the algebraical side of the question, to see that
however easy this may be, as it is to see that
although the geometrical interpretation of the first of these two formulæ is of course more obvious than that of the latter, to those who are familiar with the method of coordinates, and not with the method of quaternions.
Again, let us consider the more difficult problem of the composition
of any number os successive rotations of a body, or, at first, of any
one line thereof, round several successive axes, through any angles,
small or large. Let the axis of the first of these rotations have the
direction of the vector-unit , (
), and let the
amount of the positive rotation round this axis be denoted by a,
which letter here represents still a scalar or real number. Let
be the revolving line, considered in its original position;
the same line, after it has revolved through the angle a
round the axis
. The part, or component, of
, which is
in the direction of this axis, is that which was denoted lately by
; and the formula (a), when multiplied by
, gives, as an expression for this part,
because it has been supposed that . This part of
remains unaltered by the rotation. The other part, or
component of
, is, in like manner, by (b),
and this part is to be multiplied by , in order to find the
part of
, which is perpendicular to
, but in the plane
of
and
. Again, multiplying by
, we cause
to turn through a right angle in the positive direction
round
, and obtain, for the result of this rotation,
an expression which is the half of that marked (b), and which is to be
multiplied by , in order to arrive at the remaining part of
the sought line
, namely, the part which is perpendicular to
the plane of
and
. Collecting, therefore, the three
parts, or terms, which have been thus separately obtained, we find,
that is,
the operations here indicated being thus sure to make no change in the
part , which is in the direction of the axis of rotation, but
to cause the other part
to revolve round that axis
through an angle = a. Again, let the same line
revolve
round a new axis of rotation denoted by a new vector unit
,
through a new angle a', into a new position
; we shall have,
in like manner,
and so on, for any number n of rotations. Let the last position of
be denoted by
; and since it can easily be proved, by
the theory of multiplication of quaternions, that the continued
products which present themselves admit of being thus transformed:
in which is a new vector unit, and
a new real angle,
we find that the result of all the n rotations is of the form
It conducts, therefore to the same final position which would have
been attained from the initial position , by a single
rotation
, round the single axis
; the
amount and axis of this resultant rotation being determined by either
of the two equations of transformation (k), and being independent of
the direction of the line
which was operated on, so that they
are the same for all lines of the body.
If the present results be combined with the theorem marked (R), in the account, printed in the Proceedings of the Academy, of the remarks made by the Author in November, 1843, it will at once be seen that if the several axes of rotation be considered as terminating in the points of a spherical polygon, and if the angles of rotation be equal respectively to the doubles of the angles of this polygon (and be taken with proper signs or directions, determined by those angles), then the total effects of all these rotations will vanish; or, in other words, the body will at last be brought back to the position from which it set out.
Finally, it may be mentioned that the author is in possession of a general method for expressing by quaternions the tangent planes and normals to curved surfaces, and that in applying this method to find the cone of tangents enveloping a given sphere, and drawn from a given point, the geometrical impossibility of the problem, when the point is an internal one, is expressed by the square of a vector becoming in this case positive.
Links:
D.R. Wilkins