113
From Mathsoc wiki
| Course Name: | 1111 Linear Algebra |
| Lecturer: | Dr. Vladimir Dotsenko |
| Lecturer's Page: | http://www.maths.tcd.ie/~vdots/ |
| Course Page: | http://www.maths.tcd.ie/~vdots/indexLinearAlgebra.html |
| Course Description: | http://www.maths.tcd.ie/pub/official/Courses08-09/113.html |
- Notes on course 111 (abstract + linear algebra), 2006-2007 are here
- A page with some worked examples of general Linear Algebra questions can be found here
- Strang's Linear Algebra is a great book for revision or just looking at things from a different (slightly more applied) perspective.
Contents |
Matrices
Inroduction
- A finite set of linear equations in the variables
is defined as a System of Linear Equations. The general form for a System of Linear Equations is given by:
|
|---|
|
|
|
- A Solution Set of a System of Linear Equations is given by
where
.
- Lemma: Given any System of Linear Equations, exactly one of the following is true:
- The system has no solutions,
- The system has exactly one solution set, or
- The system has infinitely many solution sets.
- Given a System of Linear Equations, the corresponding Coefficient Matrix is given by:
- Given a System of Linear Equations, the corresponding Augmented or Extended Matrix is given by:
- Given any Matrix, there are three operations that we may perform on the system or on the augmented matrix that will not alter the solution set of the matrix. They are called Elementary Row Operations and are:
- Multiplying a row by a non zero constant,
- Interchanging two rows, and
- Adding a multiple of one row to another.
- In any non-zero row of a matrix, if we multiply the row by the inverse of the first non-zero term, we call the first 1 in the row the leading 1.
- For any matrix to be in Row echelon form the following properties must hold:
- All non-zero rows must have a Leading 1,
- For any row, call it
, the leading 1 in the row beneath
must be further to the right than the Leading 1 in
, and
- All rows consisting entirely of zero elements are grouped at the bottom, and essentially can be ignored.
- As well as the above properties, for any matrix to be in Reduced Row Echelon Form, for each column containing a Leading 1, the Leading 1 must be the only element in that column.
- By bringing the augmented matrix of a system of linear equations to reduced row echelon form and by comparing it to the original system, we can see that the variables corresponding to the Leading 1's can be solved. This is called Gauss-Jordan Elimination.
- By bringing the augmented matrix of a system of linear equations to row echelon form, we solve the system by solving the lowermost equations, and working upwards, substituting where applicable. This is called Gaussian Elimination with Back Substitution. As with Gauss-Jordan Elimination, the solution set may be in parametric form.
- A Homogeneous System of Linear Equations is given by:
|
|---|
|
|
|
- The Trivial Solution is where every element of the solutions set is equal to 0, i.e.
- Theorem: A Homogeneous System of Linear Equations with more variables than equations has infinitely many solutions.
Matrix Operations
- Given any matrix
we define the main diagonal to be the line containing the elements
.
- Given any two matrices
and
, (noting the same sizes), we define the sum of the matrices by adding the corresponding entries of the two matrices, i.e.:
- Given any matrix
a constant number
, the product
is given by
- Given any two matrices
and
, (noting the common dimension of
), we define the
entry of the product
as follows
- The transpose of a matrix
is denoted by
and has the following property (note the reversal of
and
):
- Lemma: For any invertible matrices
and
with
where
:
- The trace of a matrix
is defined as
- Theorem: For all matrices and constants where defined, the following laws hold:
- Commutativity for addition,
- Associativity for addition and multiplication, and
- Distributive Laws, on both the left and the right sides.
Types of Matrices
- Any matrix that consists entirely of zeroes is called the Zero Matrix and is denoted by
. It acts as an additive identity.
- Any square matrix consisting of all elements on the main diagonal being 1 and zeroes elsewhere is called the Identity Matrix and is denoted by
. It acts as a multiplicative identity.
- For any square matrix, denoted
, if there exists a matrix
such that
, where
is the Identity Matrix, then
is said to be invertible, and
is the inverse of
, more commonly denoted by
.
- Lemma: Inverses are unique.
- Theorem: For any number of invertible matrices, the inverse of the product is equal to the product of the inverses in reverse order.
- Lemma: For any square matrix
, the identity
and
:
-
-
times
-
if
is invertible
-
-
- Lemma: For any invertible matrix
and
where
:
- Lemma: For any number of matrices, the transpose of the product is equal to the product of the transposes in reverse order.
- An elementary matrix is any square matrix from which the identity matrix may be derived from exactly one elementary row operation. An elementary matrix is usually denoted by
.
If
is an elementary matrix, and
is any square matrix, then
is the matrix
under the elementary operation that is performed on
to make it elementary.
- Theorem: The following statements are equivalent:
-
is an invertible matrix.
-
only has the trivial solution for any column matrix
.
- The reduced row-echelon form of
is the identity matrix.
-
can be expressed as a product of elementary Matrices.
-
is consistent for every column matrix
.
-
has exactly one solution for every column matrix
.
Vectors
Introduction
- Let
,
. These vectors have starting points in the origin. The vector that connects the endpoint of
to the endpoint of
is given by
. The vectors
and
are said to be equal iff
for all
.
- Addition and scalar multiplication are defined by
and
where
.
Also, note that for vectors and multiplicative scalars, the following properties hold:
- Commutativity for addition,
- Associativity for addition and multiplication,
- Additive and multiplicative identities exist, and
- Distributive Laws, on both the left and the right sides.
- Given
, the Norm of the vector
is given by
- Let
,
. The distance between these vectors is given by
- Lemma: Given
and
,
.
- Given any two vectors in
or
, denoted as
and
the dot product of the two vectors is a scalar defined as:
= | | if and
|
|---|---|---|
| 0 | if or
|
Where
is the angle formed between the two vectors.
- Theorem: Given any
,
,
- Theorem: Given any
, and
The following hold:
-
-
-
-
, and
- If
and
are vectors in
and
, then the vector component of
parallel to
is given by
and the vector component of
orthogonal or perpendicular to
is given by
- Lemma: If
and
are vectors in
and
, then
- Lemma: Given a line
and a point
, the straight line distance from the line to the point is given by
.
Vector Spaces
- We define
to be the set of all the column vectors (matrices) that are of the form:
where
- Lemma: Given a vector space
, the additive identity vector, namely the zero vector, is unique.
- For any vectors
and constant
, we have
. Combining the above two makes what's called a linear combination:
if
, and
, then
.
- For any set of vectors,
, if there exists a set of constants,
, that are not entirely equal to
, such that
, we say that the set of vectors are linearly dependent. If such a set of constants do not exist, then we say that the set of vectors is linearly independent.
- For any set of vectors,
, we define the rank of the set to be the maximum number of linearly independent vectors in
, and we denote it by
.
- Theorem: Elementary row operations on a matrix do not change the rank of that matrix.
- A set of vectors
is said to be complete if any vector
can be represented as a linear combination of that set, i.e.
for a set of constants,
.
- A set of vectors
is called a basis if it is both linearly independent and complete.
- Lemma: Given any basis in
denoted by
and any vector
, there exists exactly one expression of
as a linear combination of the basis.
- Lemma: Any set of more than
vectors in
is linearly dependent.
- Theorem: For a set of vectors
if there exists an
such that
and if
can be expressed as a linear combination of the linearly independent set
, then
.
- The Kronecker Capelli Theorem: Given a system of linear equations described by the coefficient matrix
and the augmented matrix
, the system of equations is consistent iff
.
- For a vector space
, a subset
of
is said to be a Vector Subspace of
if it is closed under addition and scalar multiplication.
- The span of a set of vectors
in
is denoted
and is defined as the smallest subspace of
that contains all these vectors.
- Given a vector space
, the dimension of
is denoted
and is defined as the number of vectors that form a basis for
. If
is finite,
is said to be finite dimensional. V is called infinite dimensional otherwise.
- Lemma: Given a vector space
,
is well defined, i.e., any bases of
have the same number of elements.
- A set of vectors
is called a field over
if addition and multiplication of vectors is defined on this set, and if it has the following properties:
- Addition and multiplication are both associative,
- Addition is also commutative,
- The left and right distributive laws hold over addition/multiplication,
- Additive and multiplicative identities exist, and
- Additive inverses exist.
Explicitly, the following properties must hold for all
and
:
-
,
-
,
- There exists a vector
such that
,
- For all
there exists a
such that
,
-
-
- There exists a vector
such that
Lines and Points
- Given any plane in
the normal is defined as a non-zero vector that is perpendicular to the plane.
- Lemma: Given any plane in
if the normal is given by
and the normal intersects the plane at
then the general equation of the plane, called the point-normal form of the equation of the plane, is given by
which may also be written as
where d is a constant.
- Lemma: Given any plane in
if the normal is given by
and the normal intersects the plane at
, the equation of the plane may be written in terms of the vectors from the origin to
, denoted by
and to any point in that plane, denoted
. The vector form of the plane is thus given by
.
- In
, given the point
and the vector
, the parametric equations of all lines passing through the point
that are parallel to the vector
are given by:\\
|
|---|
|
|
where t\in\mathbb{R}
- The parametric vector equation of the line passing through
and parallel to
, where
is the vector given by
and
is the vector to any point on the line, is given by
where
.
- Lemma: Given a line
and a point
, the straight line distance from the line to the point is given by:
.
The Cross Product
- Given any two vectors in
denoted
and
the cross product of
and
is a vector defined as
- Theorem: Given any three vectors in
denoted
and
, and scalar
:
-
if
is orthogonal to
-
if
is orthogonal to
-
, Lagrange's Identity
-
-
-
-
-
-
-
- The basis for
is given by the three vectors
,
and
are called the unit vectors for
. Note that the Cross Product may be written as:
- Lemma: Given two vectors in
denoted
and
,
, where
is the angle between the vectors.
- Theorem: Given three vectors
, the scalar triple product is defined as
and is given by
- Lemma: Given two vectors
, then the area of the parallelogram determined by the two vectors is given by:
- Lemma: Given three vectors
, the volume of the parallelepiped determined by there three vectors is given by:
Linear Transformations
- A Linear Transformation is a function that maps a set of linear equations from
. Often, this function can be expressed by multiplication of a column vector and a matrix. For example, let the function map
, which can be written as
, where
and
. The matrix
is called the transition matrix between the domain
and the codomain
. In this case, we say that the mapping
is multiplication by
.
- A Linear Transformation
is said to be one-to-one if T maps distinct points in
to distinct points in
.
- If
is a matrix and
is multiplication by
, then the following statements are equivalent:
-
is an invertible matrix.
-
only has the trivial solution for any column matrix
.
- The reduced row-echelon form of
is the identity matrix.
-
can be expressed as a product of elementary Matrices.
-
is consistent for every column matrix
.
-
has exactly one solution for every column matrix
.
-
.
- The smallest subset of
that contains all possible values of
is
.
-
is one-to-one.
- A transformation
is called linear if the following properties hold for all
and scalar
and
- If
is a linear transformation and
are the standard basis vectors for
, then the standard matrix form for
is
- Given a linear transformation
, where
and
are vector spaces, the kernel of
is denoted
and is defined as
. The image of
is denoted
and is defined as
. The nullity of A is denoted
, and is defined as the dimension of the kernel of
.
- Lemma: Note, by their definitions, if
is one-to-one, then
has only one element, and if
is onto, then
.
- Lemma: Given a linear transformation
,
and
are subspaces of
and
respectively.
- Lemma: For any linear transformation
,
.
- Lemma: For any linear transformation
,
.
Other Spaces
Introduction to Eigenspace
- Let
be a square matrix. If
is non-zero a column vector such that
, then we say that
is an eigenvector of A, and
is its corresponding eigenvalue.
- Given equation
, we can rewrite this as
, where
is the identity matrix. The Characteristic Polynomial of
is denoted
and is defined as
. As
is defined to be non-zero, there are non-trivial solutions, and thus
.
- Theorem: The roots of the characteristic polynomial are the eigenvalues of
.
- A matrix
is said to be similar to a matrix
if there exists an invertible matrix
such that
.
- Lemma: If two matrices
and
are similar, then
.
- Theorem:
is similar to a diagonal matrix iff it has
linearly independent eigenvectors.
- Lemma: Eigenvectors of a matrix that correspond to different eigenvalues are linearly independent.
- Corollary: If all eigenvalues of a matrix
are pairwise distinct, then
is similar to a diagonal matrix.
- The Cayley-Hamilton Theorem: For any matrix A, \chi_A(A) = 0 .
Jordan Normal Form
- A Jordan Block is a square matrix which is composed of an eigenvalue along the main diagonal, and 1's on the superdiagonal.
The Jordan Normal Form of a matrix A is a matrix of the form
where
are jordan blocks. If all the eigenvalues of
are not distinct, the repeated eigenvalues are grouped together in one Jordan Block. For example, if the eigenvalue
has
corresponding eigenvalues, then the Jordan Block for
will be of size
.
- Proposition: Every matrix is similar to its jordan normal form. The transition matrix
such that
is the matrix whose columns are the eigenvectors of
. One method of calculating these columns is to note that if the columns are
, then
, and
.
- Lemma: Let
be a matrix, with distinct eigenvalues
. Assume that there exists a polynomial
such that
and
. A is similar to a diagonal matrix with entries on the main diagonal equal to the roots of
.
- A matrix
is said to be strictly upper triangular if it has 1's on the superdiagonal (the diagonal above the main diagonal) and 0's elsewhere.
, similarly, is strictly lower triangular if it has 1's on the subdiagonal (the diagonal below the main diagonal) and 0's elsewhere.
- Lemma: Strictly triangular matrices shift when raised to integral powers. This is illustrated by the above example of U:
|
|
|
|
|---|---|---|
|
|
|
In other words, for the matrix
, the diagonal of 1's is
diagonals from the main diagonal. Consequently, for
in this example,
.
- Computing the powers of a jordan matrix is relatively straightforward. Considering the general Jordan Normal Form,
As this matrix is diagonal, we can say that
- The above reduces the problem to evaluating the powers of a jordan block. A Jordan block
can be expressed as
where
is the diagonal matrix containing the eigenvalues of
and
is a strictly upper triangular matrix. By the binomial theorem,
.
As
is a diagonal matrix, its powers are easily computed. The powers of
, as shown above, shift the diagonal of 1's. As
, the term
is a matrix with
in the diagonal
away from the main diagonal. This can be shown in the following (god-awful) matrix:
- Given the power series of a function
, e.g.
, we can determine the value of
where
is any square matrix by using the above theorem about Jordan Normal Forms. For example, in the above equation, instead of having to work out
, we can replace
by
, as
is similar to
, which reduced the problem to
, noting that
and that
and
are independent of the sum.
- For a vector space
and an operator
, a subspace
of
is called invariant if
.
- Let
and
. The Direct Sum of
and
is denoted
and is defined as the set
.
- Lemma: Given the direct sum of two subspaces
, every
can be uniquely represented as
, for
and
.
- Given a set of unknown functions
that have the property
| |
|
|---|---|---|
| |
|
| ||
| |
|
Or more compactly,
, where
is the matrix of the coefficients
. This system of equations is known as a system of differential equations.
- Lemma: Given initial conditions, namely
, there exists a unique solution to this system, namely
.
Euclidean Space
- A Euclidean Vector space is a vector space on which an inner product (as defined below) is defined.
- Given a real vector space
, a mapping
is called an inner product on
if the following properties hold for all
and
(note that for
, the inner product
is simply denoted
or sometimes
or
)
- Symmetry:
,
- Distribution:
,
- Homogeneity:
, and
- Positivity:
and
iff
- If an equation has, at most, a degree of 2, then it is called a quadratic form. In a vector space
, this can be thought of as a mapping from a vector to a real number. If this quadratic form is always non-negative, it is called a positive semi-definite quadratic form. If the quadratic form is equal to zero only when the vector is the zero vector, then it is called a positive definite quadratic form.
- Let
be a Euclidean vector space. For any
, the length of
is denoted
and is defined as
, where
is an inner product on
. Sometimes, this value is denoted
. Also, given a vector
, the vector
is the normalized form of
, which has a length of
.
- Lemma: Given two vectors
where
is a Euclidean vector space, the angle formed between these two vectors is given by
- Let
be an n-dimensional Euclidean vector space. A sequence of vectors
is an orthogonal basis of
if:
- the sequence
does not comprise entirely of the zero vector, and
-
for all
If
for all
in addition to the above properties, then
is an orthonormal basis of V
- Lemma: Any orthogonal basis in a Euclidean vector space
is a basis for
.
- Theorem: Any
-dimensional Euclidean vector space contains orthonormal bases.
- The process of finding an orthogonal basis is called the Gram-Schmidt Orthogonalisation process. The final step is to normalise the orthogonal basis to form an orthonormal basis. Given a basis
of an
-dimensional Euclidean vector space
, we form an orthogonal basis
by the following method.
- Let
,
- Let
for all
This gives an orthogonal basis for
. To find an orthonormal basis, we normalise the orthogonal basis. Thus
is an orthonormal basis.
Linear and Bilinear Forms
- A function
, where
is a vector space, is a linear form if:
-
for all
, and
-
for all
- A function
, where
is a vector space, is a bilinear form if:
-
for all
,
-
for all
, and
-
for all
, and
In other words, the function
must be a linear function in both
and
.
- Let
and
be two vectors in a vector space
. Given a basis
of
, and a bilinear mapping
, we note the following:
|
|
|---|
|
|
|
|
Thus, if we consider the matrix
, where
, then we say that the matrix
is the matrix of the bilinear form
relative to the basis
. Thus, the bilinear form of is given by
.
- Lemma: For any symmetric bilinear form, there exists a basis of this form where the matrix is diagonal.
- Given a bilinear form
on a vector space
, we denote the quadratic form of
by
and define
as
.
- Theorem: Any symmetric bilinear form can be uniquely determined from its quadratic form, namely that
.
- Theorem: For any quadratic form
, there exists a change of coordinates
where
| |
|
|---|---|---|
| |
|
| ||
| |
|
Such that
. This form is known as the signed sum of squares form.
- Bessel's Inequality: Let
be an orthonormal set of vectors in a Euclidian vector space
. For all
.
- Lemma: The set of continuous functions
is pairwise orthogonal with respect to the inner product
.
- Given a quadratic form in the signed sum of squares form, the signature of this form is given by
where
is the number of positive coefficients, and
the number of negative coefficients in the form. It is also referred to as inertia.
- Sylvester's Law of Inertia: The signature of a quadratic form does not depend on the method employed to bring this form to the signed sum of squares form. Namely, the signature of a quadratic orm does not depend on the basis chosen.
- Lemma: For any square symmetric matrix, these exists an orthonormal basis of eigenvectors.
- Given a Euclidian vector space
, the orthogonal complement of a vector
is given by
.
- Theorem: Let
be a vector space and let
be two symmetric bilinear forms on
. If
is positive definite, then there exists a basis of
such that both forms have diagonal matrices.
Complex Spaces
- A basic knowledge of complex numbers is required. For example, a complex number
, the conjugate of
is given by
, where
and
.
- Let
be a complex vector space and let
.
is a sesquilinear form if, for all
, and
.
-
,
-
,
-
, and
-
,
- A sesquilinear form
is Hermitan if,
. In parallel with the definition for a bilinear form, we can express a Hermitan form given a basis
. In particular, if
and
are vectors in a complex Vector space, then
where
is the matrix where the
entry is
.
- A complex vector space
is called a Hilbert space if it is equipped with a positive definite Hermitan form.
- The transpose of the conjugate matrix of a matrix
is called the adjoint matrix of A. Not to be confused with the matrix of cofactors. The adjoint of a matrix is written
. If
, then A is a self adjoint matrix.
- Lemma: For all matrices
and
with complex entries,
- If a matrix
has the property
, then we say that
is unitary.
- Given a matrix
, if
, then A is called a normal matrix.
- Lemma: Any two commutative operators
and
on a complex vector space
have a common eigenvector.
- Theorem: Any normal operator on a complex Hilbert space has an orthonormal basis of eigenvectors.
- Theorem: Given a vector space
, on which a Hermitan operator
is defined, there exists an orthonormal basis of
consisting of eigenvectors, where the corrsponding eigenvalues are real.
- Lemma: If
and
, then
.
- Theorem: If a matrix
with real coefficients satisfies the property that
, then
is diagonalisable over
.

=
and
or
