111
From Mathsoc wiki
Ma111 Algebra
Lecturer: Dr. Sinead Ryan
Website: Link
Set Theory
This was just a grounding with terminology in order to prepare for Group theory. Naive Set Theory was thus chosen over Axiomatic Set Theory.
Basic set theory
- Set: A set is a collection of objects.
- Subset: a set A is a subset of a set S if every element of A is an element of S
- Notation: Union, intersection, difference, Cartesian product.
Relations, ordering
- Relations: Have a non-empty set A, and non-empty subset R of A × A. If (a,b) is in R, then a is related to b, written a~b.
There are 4 properties of relations.
- 1)If a in A, then a~a (reflexive)
- 2)If a~b, then b~a (symmetric)
- 2') If a~b, and b~a, then a=b (anti-symmetric)
- 3)If a~b, and b~c, then a~c (transitive)
- Partial Ordering: A relation that is reflexive, anti-symmetric and transitive is a partial ordering, and is written a~b as "a is less than or equal to b".
- Linear Ordering: A linear ordering is a partial ordering such that if a, b are in A, then either a is less than or equal to b, or b is less than or equal to b.
- Hausdorff Maximal Principle: In any partially ordered set, every linearly ordered subset is contained in a maximal linearly ordered subset (i.e. if the subset was made any bigger, it would cease to be linearly ordered). This is equivalent to Zorn's Lemma, and so to the Axiom of Choice.
- Equivalence Relation: A relation that is reflexive, symmetric and transitive is an equivalence relation.
- Equivalence Class: The equivalence class of an element a is defined by
Mappings
- Mapping: A mapping is a relationship or rule that assigns to each element of one set, a uniquely determined element of another set.
Types of Mapping:
- Injective (one-to-one): A mapping is one to one when distinct elements of the first set are mapped to distinct elements of the second.
- Surjective (onto): A mapping is onto when for every element of the second set, there is at least one element of the first set that maps to it.
- Bijective: A mapping is a bijection when it is both one-to-one and onto; every element of the first set is mapped to every element of the second set, and these mappings are distinct.
- Composition of mappings: Mappings can be composed. Various relationships were stated or proved in class.
Integers, LIP, Euclidean Algorithm, gcd, prime stuff, congruence
- ... -2,-1,0,1,2....
- LIP (Least Integer Principle): A non-empty set of positive integers contains a least integer, which is unique.
- gcd (Greatest Common Divisor): A positive integer c is the greatest common divisor of a and b if: 1) c|a and c|b, and 2) any divisor of a and b is a divisor of c. Note here the notation c|a signifying "c divides a". Also, if gcd(a,b) exists it can be written in the form c = ma+ nb, for some integers m,n.
- Euclidean Division Algorithm
- Relatively prime: a,b relatively prime if gcd(a, b) = 1. In other words, 1 is the only number that divides both a and b.
- Prime number: One that is divided only by +/-1 and +/- itself.
- Prime stuff: Sirloin.
- Unique Factorisation Theorem: Every integer greater than two can be factored as a product of primes in a unique way.
- Congruence Modulo: Two integers are said to be "congruent modulo n" ("congruent mod n") if they both have the same remainder when divided by n. For example, 5 is congruent to 9 mod 2, as they both have remainder 1 when divided by 2.
Group Theory
Basic group properties
Definition: A non-empty set of elements G forms a group if on it there is defined a binary operation
such that the following four properties are satisfied:
- If
are in G, then
is in G (closure)
- If
are in G, then
(associativity)
- There is an element
in G such that
(identity)
- For every a in G, there is an
such that
(inverse)
More group properties
- Abelian: A group is Abelian if its binary operation is commutative. Another way,
- Finite Group: A group with a finite number of elements is called a finite group. A finite group may be expressed using a [Cayley Table]. A Cayley Table is a table of all possible combinations of elements in a group under the defined operation.
- Order: The order of a group or subgroup is the number of elements it contains, denoted
.
- Some Notation: For a in G, write
and so on. Note that that the use of power notation does not mean the group operation is multiplication; in a group under addition
- Cyclic: A cyclic group,G, is one such that if it has order n,
for some a an element of G. In other words G is generated by some element of G
- Order of an element: The order of an element of a group, a, is the order of the cyclic subgroup it generates, denoted
.
Examples Of Groups
: The integers mod n form a group under addition mod n. E.g.
: The symmetry group
of permutations of a set of n objects forms a group under composition of permutations.
E.g.
Subgroups, Cosets, Lagrange's Theorem
- Subgroup: A non-empty subset
of a group
is a subgroup if, under the binary operation in
,
is itself a group (i.e., it is closed, contains an identity element, and all elements in it have inverses. Associativity is generally assumed.)
- The centre of a group is an abelian subgroup in that the centre of a group
is the set
of all elements in
which commute with all the elements of
.
- Normal Subgroup: A subgroup
of a group
is a normal subgroup if for every
and
,
.
- Coset: If
is a group,
a subgroup of
, and
, then a left coset of
in
is defined by
. Intuitively, a right coset is of the form
.
- Quotient Group: If
a normal subgroup of
, then its cosets form a group under multiplication of cosets. This is called a quotient group, denoted
.
- Lagrange's Theorem:
a finite group,
a subgroup, then the order of
divides the order of
.
Permutations, transpositions, cycles
- Permutation: A permutation is a bijective mapping from a set to itself (Laytalk: Rearrangement). These can be written as transpositions, n-cycles, and can be either odd or even.
- Determining whether it is odd or even appears to be confusing as different people write permutations from left to right, or from right to left, (Sinead Ryan reads permutations from left to right, whereas Durbin reads from right to left). The nicest method I've found is in Anton's Elementary Linear Algebra,which involves inversions.
- An inversion is when a larger integer precedes a smaller one in a permutation. Reading from left to right, you count the number of inversions for each integer and add these up - if the sum is even, the permuation is even. If the sum is odd, the permuation is odd.
- e.g.
would be even as it has
inversions required to put it into the standard form of
.
Isomorphisms and Homomorphisms
- Isomorphism: An isomorphism from a group
to a group
is a map
which has the following properties:
is a bijection, and
, where the first * denotes the binary operation in
, and the second * denotes the binary operation in
.
Thus, the mapping preserves the structure of the groups, and they are said to be isomorphic. Essentially, for algebraic purposes, there is no real difference between them.
- Homomorphism: A mapping
from a group
to a group
is a homomorphism if for all

,
,where the first * denotes the binary operation in
, and the second * denotes the binary operation in
.
- Kernel: If
a homomorphism from
to
, then the kernel of
is
. In other words, the kernel contains all elements of the first group that are mapped to the identity element of the second group.
- The Fundamental Homomorphism Theorem
The Fundamental Homomorphism Theorem features a diagram in the shape of a triangle.
Attempts in class to explain how one might possibly actually use this were futile and went on for ages, with lots of pointing at a similar diagram.
In the diagram shown,
is the Kernel of
. This means the picture actually depicts the First Isomorphism Theorem. The Fundamental Homomorphism Theorem is technically stated for
,
a normal subgroup with
a subset of the Kernel.
Rings, fields
- Rings: Are circular. Newton has some. Sauron lost his. Johnny Cash had a burning one.
A ring is a set R equipped with two binary operations called addition and multiplication, such that:
- (R, +) is an abelian group with the identity element 0:
- (a + b) + c = a + (b + c)
- 0 + a = a + 0 = a
- a + b = b + a
- For every a in R, there exists an element denoted −a, such that a + −a = −a + a = 0
- (R, ·) is a monoid with identity element 1:
- (a·b)·c = a·(b·c)
- 1·a = a·1 = a
- Multiplication distributive law|distributes over addition:
- a·(b + c) = (a·b) + (a·c)
- (a + b)·c = (a·c) + (b·c)
- Fields: Contain cows.
A field is a commutative division ring.
Vector Spaces
Take a vector space... define another basis... Take a vector space.... define another basis...
I'm gonna take you to another dimension!
Properties, subspaces
- Vector Space: A vector space
is a set of objects on which 2 operations are defined: 1) vector addition and 2) scalar multiplication, and which satisfy the following axioms for
and
scalars from the field
over which the vector space is defined:
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
- Subspace: A subset
of a vector space
is called a subspace of
if
is itself a vector space under the addition and scalar multiplication defined on
. In practice, to check if a subset is a subspace, you just need to check it is closed under addition of vectors and multiplication by scalars.
Linear combinations, span, linear independence, basis, dimension
- Linear Combination: A vector
is a linear combination of vectors
if there exist scalars
such that
- Span: If S is a set of vectors
, then the span of S is the set of all linear combinations of the vectors
- Linear Independence: Consider
and the equation
.
If the only solution to this equation is:
, then the vectors
are said to be linearly independent.
- Basis: A set of vectors from a vector space form a basis for that vector space if 1) they span the vector space, and 2) they are linearly independent.
- Dimension:
a vector space,
a basis. If
contains a finite number of elements, then
is finite dimensional and its dimension is the the number of vectors in
. Otherwise,
is infinite dimensional.
- Lots of Interesting Theorems: See Chapter 5, General Vector Spaces, of Elementary Linear Algebra, by Anton. I might type some of them up eventually.
Changing bases, transition matrices
- Coordinate Vector:
a basis for vector space
, and
. Then
and the scalars
are the coordinates of
relative to
. We write the coordinate vector
.
- Transition Matrix:
a finite dimensional vector space and
both bases for
.
Then the transition matrix from
to
is:
where the
column is the coordinates of
relative to
. Also,
is invertible and
is the transition matrix from
to
.
Inner Product Spaces
- Inner Product Space: A vector space
over a field
is an inner product space if for
there is defined an operation known as an "inner product" denoted
or
, an element of
, such that: 1)
, 2)
with equality iff
, 3)
.
As an example, the Euclidean Inner Product or dot product.
- Norm: The norm of
, an inner product space, is denoted
and equals
. It is analogous to the length of a vector.
- Properties of Norm:
iff
- Distance: The distance between
is given by
- Cauchy-Schwarz Inequality: If
, then
- Orthogonality:
, then
orthogonal to
if
.
- Orthogonal complement:
a subspace of
, the orthogonal complement of
is
. In other words, the orthogonal complement of a subspace consists of all vectors orthogonal to every element in the subspace.
- Orthonormal set: A set of vectors is an orthonormal set if each has norm=1, and if each is orthogonal to all the others. In addition, this means the vectors are linearly independent. Also, any finite dimensional inner product space has an orthonormal set as a basis.
Cross product
As demonstrated by MP Hammer.
Linear transformations, induced matrix
- Linear Transformation: A transformation (mapping) from one vector space
to another
is linear if for all
, and
a scalar, 1)
, and 2)
... in other words, a map is linear iff it preserves linear combinations.
- Induced matrix: Every linear map is uniquely specified by its effect on any basis. Consider
, basis for vector space
, and
, basis for vector space
. Then for any linear map
, there exist unique scalars
such that
. The
form the induced matrix of the linear map
with respect to the two bases.
Linear Algebra
Matrix multiplication
See Leaving Certificate.
Systems of linear eqns, augmented matrix, EROs, Gauss and Gauss-Jordan Elimination
- Linear Equation: A linear equations is an equation in which the unknowns appear to the first power only, and only as linear combinations. A system of linear equations is a collection of 2 or more linear equations. Essentially, these are just simultaneous equations, except now we will solve them using matrices.
- Augmented Matrix: Consider n linear equations, m unknowns:
...
Then one can form the augmented matrix
:
- Solving Linear Systems: Having formed the augmented matrix, one can then solve the system of linear equations by performing elementary row operations (EROs) on the matrix. These are: 1) scaling a row by a non-zero constant, 2) Swapping one row with another, and 3)Adding c times one row to another row. These EROs do not change the solution and are invertible.
The purpose of the EROs is to transform the matrix into either row-echelon form (Gaussian elimination) or reduced row-echelon form (Gauss-Jordan Elimination).
- In row-echelon form 1) any row of all zeros is at the bottom, 2)if a row is non-zero the first entry is a 1 (known as a leading 1) and 3), in any 2 non-zero rows, the leading 1 of the lower row is to the right of the leading 1 of the upper row.
- In reduced row-echelon form the above 3 properties hold but with the additional stipulation that if a column contains a leading 1, all other entries in the column are zero.
- Gaussian Elimination: In Gaussian Elimination, you reduce the matrix to row-echelon form, then use the coefficients in the matrix to solve for the
by back-substitution.
- Gauss-Jordan Elimination: In Gauss-Jordan Elimination you reduce the matrix to reduced row-echelon form, then use the leading ones to read off the values of the
.
Diagonal, triangular, symmetric matrices
- Diagonal matrix: A diagonal matrix is one with non-zero entries only on the main diagonal.
- e.g. of Diagonal Mx:
- e.g. of Diagonal Mx:
- Triangular matrix: An upper triangular matrix is one with non-zero entries only on the main diagonal and above it. A lower triangular matrix is one with non-zero entries only on the main diagonal and below.
- e.g of Lower
Mx.:
- e.g of Lower
- Symmetric Matrix: A Symmetric matrix is a square matrix, A, that is equal to its transpose.
- Skew-symmetric matrix: A Skew-Symmetric matrix is a square matrix A, that is equal to the negative of its inverse.
Finding inverse by EROs
Consider an invertible nxn matrix A. Then it is row equivalent to the identity matrix, and so can be transformed into the identity matrix by a series of elementary row operations. This is the same thing as multiplying the matrix by a series of elementary matrices. Hence, Ek....E2E1A = 1. Now we can multiply across by A-1 to get Ek....E2E11 = A-1. Hence, performing the same elementary row operations on the identity matrix transforms it into A-1. In practice form a matrix [ A | 1 ] and perform EROs on it until A has been transformed into the identity, then you will have performed the following transformation [ 1 | A-1 ].
Diagonal, triangular, symmetric matrices
- Diagonal matrix: A diagonal matrix is one with non-zero entries only on the main diagonal.
- Triangular matrix: An upper triangular matrix is one with non-zero entries only on the main diagonal and above it. A lower triangular matrix is one with non-zero entries only on the main diagonal and below.
- Symmetric Matrix: A Symmetric matrix is a square matrix, A, that is equal to its transpose.
- Skew-symmetric matrix: A Skew-Symmetric matrix is a square matrix A, that is equal to the negative of its inverse.
LU Decomposition
The idea is to decompose a given matrix A into a product of two matrices, one lower triangular and one upper triangular, so that A = LU. The relevant background theory is that if a matrix can be reduced to row echelon form U by elementary row operations (excluding row interchanges - this is important), then you can write U as a product of elementary matrices: Ek....E2E1A = U. Now these Ei are invertible, and all lower triangular. So we can multiply across by the inverses to get: A = E1-1E2-1....Ek-1U
As the inverse of a lower triangular matrix is a lower triangular matrix, and the product of lower triangular matrices is a lower triangular matrix, we get: A = LU.
In practice, you reduce A to row echelon form U by elementary row operations (but no row interchanges), and then can form the matrix L by inspection. The diagonal entries of L are one divided by the multiplying factor that introduced the corresponding one on the diagonal of U, and the non-diagonal entries are the negative of the multiplying factor used to introduce the corresponding zero in U.
For Ax = b, we have LUx = b. Let Ux = y, solve Ly = b for y, and then solve Ux = y for x.
Row space, column space, null space
- Row/Column Vectors: For an mxn matrix A, the vectors formed from its rows are the row vectors of A. The vectors formed from the its columns are the column vectors of A.
- Row space: The subspace of Rn spanned by the row vectors of A is the row space of A, R(A).
- Column space: The subspace of Rm spanned by the column vectors of A is the column space of A, C(A).
- Nullspace: The subspace of Rn spanned by the solution vectors x of Ax=0 is the nullspace of A.
Useful Facts About These Subspaces
- Elementary row operations do not change the nullspace or rowspace of a matrix. They can, however, changed the column space.
- If A and B are row equivalent matrices, then a set of column vectors that are linearly independent in A iff the same set of column vectors in B are linearly independent. Similarly, a set column vectors of A form a basis for the column space iff the same vectors in B do.
- If a matrix R is in row-echelon form, the row vectors with the leading ones form a basis for the row space of R, and the column vectors that contain these leading ones form a basis for the column space of R.
- The row space and column space of a matrix A have the same dimension. This is known as the rank of A, rank(A).
- The dimension of the nullspace is called the nullity of A, denoted nullity(A).
- If a matrix A has n columns, then rank(A) + nullity(A) = n.
- Fundamental System of Linear Systems: Consider m equations, n unknowns, written Ax=b, with augmented matrix Ag. Then Ax=b has a solution iff rank(A)=rank(Ag). It has a unique solution iff rank(A)=rank(Ag)=n. It has infinite solutions iff rank(A)=rank(Ag)<n.
- The nullspace of A and the rowspace of A are orthogonal complements in Rn with respect to the Euclidean Inner Product. This also implies the nullspace of AT and the column space of A are orthogonal complements in Rm.
Determinants, Cramer’s Rule
- Determinant: A determinant is a function that maps every
matrix to a real number. It is defined as the sum of the signed elementary products of the matrix. (Not a very helpful definition really).
- Cramer's Rule: Cramer's rule states that for a linear system
,
,
where
is the matrix resulting from replacing the
column of
with the column vector
Eigenvalues, eigenvectors
- Eigenvector: An eigenvector is a vector whose direction is unchanged when multiplied by a matrix. Thus for a vector
and matrix
,
with
a scalar.
Now, rewriting this gives
, where
is the identity matrix. We want this equation to have non-trivial solution, i.e. for
to not be zero. In other words, we want the matrix
to not be invertible, or that
- this is known as the characteristic equation of A.
Solving it gives the values of
for which
, these are known as the eigenvalues of
.
Subbing the eigenvalues back into
allows you to determine the corresponding eigenvector for each eigenvalue.
nxn matrices
- For an nxn matrix, the following are equivalent:
- A is invertible
- Ax=0 has only the trivial solution, x=0
- The reduced row-echelon form of A is the identity
- A is expressible as a product of elementary matrices
- Ax=b is consistent for all nx1 matrices b, and has exactly one solution for each b
- det(A) is non-zero
- The column vectors of A are linearly independent and span Rn; so they form a basis for Rn
- The same is true for the row vectors of A
- The rank of A is n, and the nullity is zero
- The orthogonal complement of the nullspace is Rn, that of the rowspace is {0}
- ATA is invertible
- 0 is not an eigenvalue of A
Note: the phrase "are equivalent" here means that if one of these statements holds for a particular nxn matrix A, then so do all the others... it does not mean they hold for all nxn matrices...
Diagonalising matrices, orthogonal diagonalisation, Gram-Schmidt Process
- Diagonalisation: An nxn matrix
that has n linearly independent eigenvectors is diagonalisable; that is, there exists an nxn matrix
such that
,
a diagonal matrix. We say that
diagonalises
.
In practice, one forms the matrix
(called the modal matrix) consisting of the eigenvectors of
, and then
is a diagonal matrix with the eigenvectors of
on the diagonal.
- Orthogonal Matrix: An orthogonal matrix is one such that
. Orthogonal matrices preserve norms and dot products;
. They are also used to diagonalise symmetric matrices.
- Orthogonal Diagonalisation: If A is a symmetric (
) matrix, then
can be orthogonally diagonalised. That is, there exists a matrix
such that
.
In practice, one determines the eigenvalues and eigenvectors of A, then uses Gram-Schmidt Orthogonalisation to orthogonalise the eigenvectors (note that distinct eigenvectors corresponding to distinct eigenvalues of a symmetric matrix are already orthogonal). The next set is to form an orthonormal set by dividing each newly-orthogonal vector by its norm. Then one forms the matrix
with these orthonormal vectors as columns. This
is such that
, and
has the eigenvalues of
on the diagonal.
- Gram-Schmidt Orthogonalisation: This is a process that orthogonalises vectors by subtracting off the projection of all previous vectors onto each vector in question. Given a set of vectors
one can construct an orthogonal set
using the following procedure:
and so on.
QR Decomposition
- QR Decomposition: If A is an nxm matrix with linearly independent column vectors, then A can be factored as A = QR, with Q an mxn matrix with orthonormal column vectors, and R an nxn invertible upper triangular matrix. This can be shown by construction; Sinead Ryan made some mistakes doing it in class so learn it out of a book so that you get the correct version.
The basic idea is to use the Gram-Schmidt process to orthogonalise the column vectors of A and so to obtain an orthonormal set of qi, with which you form the matrix Q. Then you can form a matrix R, whose entries are obtained from a series of dot products of ui and qi. I won't attempt to notate it here in depth. Go read Anton.
Least squares
The least squares problem involves a linear system
that is not consistent... a solution
cannot be found that accurately fits with A and b. Instead a "best fit" x can be found.
Essentially, one can think of it as follows: Let W denote the column space of A. Then as the vector x ranges over all
, the vector Ax varies over all of W (as Ax is a linear combination of the columns of A). We wish to find a vector x such that Ax is as close as possible to b.
Rather intuitively, the vector Ax that is closest to b is the orthogonal projection of b on W.
Thus, we have Ax=projWb. Now, note that b - projWb = b - Ax is orthogonal to the column space W. This means that b - Ax is in the nullspace of AT.
Hence,
Or,
.
This is known as the normal system associated with Ax = b. A solution x of this system is known as a least squares solution of Ax = b. If A has linearly independent columns, then the normal system has a unique solution.
Band and block matrices
- Band Matrix: A Band matrix is a matrix A with elements that are non-zero only on the main diagonal & adjacent diagionals.
- Block Matrix: A Block matrix A element of R(mxn) is a qxr block matrix if can be written as a qxr collection of blocks.
Jordan form
- Jordan Form: An nxn matrix A that does not have n linearly indepedent eigenvectors cannot be diagonalised. However, it can be transformed into an almost diagonal form, known as Jordan form, or canonical form, J. This matrix J has the eigenvalues of A on the main diagonal, and some ones on the superdiagonal (just above the main diagonal).
Matrix Norms
- Matrix Norm: a mapping
is a norm of an mxn matrix
if for
and
there is a vector norm, and 1)
,
2)
, and 3)
. The norm is denoted
- Frobenius norm:
.
In words, the square root of the sum of the squares of each value.
- p-norms:
.The p-norm for vectors is
For matrices, it is common to use
which works out as just the maximum absolute column sum (i.e. for a column, add up the absolute value of each entry. Do this for all columns and take the maximum), or
, which is the maximum absolute row sum.
- Condition number: The condition number of a matrix A is given by
for a given norm. If the condition number is large compared to the size of the entries in A, then the matrix is said to be badly conditioned, and a small change in the value of one of the entries can cause large changes in the solution set of the matrix.
Cayley-Hamilton Theorem And Applications
- Cayley-Hamilton Theorem: The Cayley-Hamilton Theorem states that every square matrix
satisfies its own characteristic polynomial.
Applications
- Reducing Order of a Polynomial:
Given a polynomial
, you can reduce its order using the C-H Theorem. Denote the characteristic polynomial of nxn matrix
by
.
Then
,
where
is the quotient formed from dividing
into
by polynomial division, and
is the remainder from the same operation.
Now, write this for the matrix
.
.
But as
satisfies its own characteristic polynomial,
. Hence,
. Note that
is always of degree n-1 or less.
- Determining Analytic Functions of a Matrix: Consider a scalar function
analytic in some area of the complex plane. Then in that region we can express the function as a polynomial:
Consider now the nxn matrix
with characteristic polynomial
and eigenvalues
Then we can write
And so
since
And as
is of degree n-1 or less, this means
and since we know the
this defines a set of simultaneous equations which we can solve for the
Then
So
and as we now know the
we can work out
.
- Determining Exponential of a Matrix:
Hermitian Matrices
- A Hermitian Matrix (or self-adjoint matrix) is a square matrix which is equal to its own conjugate transpose.
- i.e
.
- i.e
- A skew-hermitian matrix is one which is equal to the negative of its conjugate transpose ie
.
- NB Sinead Ryan denoted the operation of transposing the conjugate of a matrix
as
but this can be quite confusing because
usually denotes a hermitian matrix itself, not the operation.
External Links
Gilbert Strang's Video Lectures Well-recommended. Short on proofs but full of concepts.
Sinead Ryan's 111 Website (06/07)
O'Dunlaing's 111 Website (05/06)
David Wilkins' 111 Website (96/97)
References
My notes from Sinead Ryan's lectures and my own vague understanding of algebra.
Modern Algebra by John Durbin, 5th Edition
Elementary Linear Algebra, by Howard Anton, 9th Edition
Linear Algebra and Its Applications by Gilbert Strang, 8th Edition

