Existence, Unicity

  • Set of solutions to a linear equations

  • Existence: the range and rank of a matrix

  • Unicity: the nullspace and nullity of a matrix

  • Fundamental facts about range and nullspace

Consider the linear equation in x in mathbf{R}^n:

 Ax = y,

where A in mathbf{R}^{m times n} and y in mathbf{R}^m are given, and x in mathbf{R}^n is the variable.

The set of solutions to the above equation, if it is not empty, is an affine subspace. That is, it is of the form x_0 +mathbf{L} where mathbf{L} is a subspace.

We'd like to be able to

  • determine if a solution exists;

  • if so, determine if it is unique;

  • compute a solution x_0 if one exists;

  • find an orthonormal basis of the subspace mathbf{L}.

Existence: range and rank of a matrix

Range

The range (or, image) of a m times n matrix A is defined as the following subset of mathbf{R}^m:

 mathbf{R}(A): = left{ Ax ~:~ x in mathbf{R}^n right} .

The range describes the vectors y=Ax that can be attained in the output space by an arbitrary choice of a vector x in the input space. The range is simply the span of the columns of A.

If y notin mathbf{R}(A), we say that the linear equation Ax = y is infeasible. The set of solutions to the linear equation is empty.

The matlab function orth accepts a matrix A as input, and returns a matrix, the columns of which span the range of the matrix A, and are mutually orthogonal. Hence, U^TU = I_r, where r is the dimension of the range. One algorithm to obtain the matrix U is the Gram-Schmidt procedure.

Matlab syntax
>> U = orth(A); % columns of U span the range of A, and U'*U = identity

Example:

Rank

The dimension of the range is called the rank of the matrix. As we will see later, the rank cannot exceed any one of the dimensions of the matrix A: r le min(m,n). A matrix is said to be full rank if r = min(m,n).

Matlab syntax
r = rank(A); % r is the rank of A

Note that the rank is a very ‘‘brittle’’ notion, in that small changes in the entries of the matrix can dramatically change its rank. Random matrices, such as ones generated using the Matlab command rand, are full rank. We will develop here a better, more numerically reliable notion.

Examples:

Full row rank matrices

The matrix A is said to be full row rank (or, onto) if the range is the whole output space, mathbf{R}^m. The name ‘‘full row rank’’ comes from the fact that the rank equals the row dimension of A. Since the rank is always less than the smallest of the number of columns and rows, a m times n matrix of full row rank has necessarily less rows than columns (that is, m le n).

An equivalent condition for A to be full row rank is that the square, m times m matrix AA^T is invertible, meaning that it has full rank, m. Proof.

Unicity: nullspace of a matrix

Nullspace

The nullspace (or, kernel) of a m times n matrix A is the following subspace of mathbf{R}^n:

 mathbf{N}(A) := left{ x in mathbf{R}^n ~:~ Ax = 0 right} .

The nullspace describes the ambiguity in x given y=Ax: any z in mathbf{N}(A) will be such that A(x+z) = y, so x cannot be determined by the sole knowledge of y if the nullspace is not reduced to the singleton {0}.

The matlab function null accepts a matrix A as input, and returns a matrix, the columns of which span the nullspace of the matrix A, and are mutually orthogonal. Hence, U^TU = I_p, where p is the dimension of the nullspace.

Matlab syntax
U = null(A); % columns of U span the nullspace of A, and U'*U = I

Example:

  • Nullspace of a simple matrix.

Nullity

The nullity of a matrix is the dimension of the nullspace. The rank-nullity theorem states that the nullity of a m times n matrix A is n-r, where r is the rank of A.

Full column rank matrices

The matrix A is said to be full column rank (or, one-to-one) if its nullspace is the singleton {0}. In this case, if we denote by a_i the n columns of A, the equation

 (Ax = ) sum_{i=1}^n a_i x_i = 0

has x=0 as the unique solution. Hence, A is one-to-one if and only if its columns are independent. Since the rank is always less than the smallest of the number of columns and rows, a m times n matrix of full column rank has necessarily less columns than rows (that is, m ge n).

The term ‘‘one-to-one’’ comes from the fact that for such matrices, the condition y = Ax uniquely determines x, since Ax_1 = y and Ax_2 = y implies A(x_1-x_2) = 0, so that the solution is unique: x_1 = x_2. The name ‘‘full column rank’’ comes from the fact that the rank equals the column dimension of A.

An equivalent condition for A to be full column rank is that the square, n times n matrix A^TA is invertible, meaning that it has full rank, n. (Proof)

Example: Nullspace of a transpose incidence matrix.

Fundamental facts

Two important results about the nullspace and range of a matrix.

Rank-nullity theorem

The nullity (dimension of the nullspace) and the rank (dimension of the range) of a m times n matrix A add up to the column dimension of A, n.

Proof.

Another important result is involves the definition of the orthogonal complement of a subspace.

Fundamental theorem of linear algebra

The range of a matrix is the orthogonal complement of the nullspace of its transpose. That is, for a m times n matrix A:

 mbox{bf R}(A)^perp  = mbox{bf N}(A^T) .

Proof.

alt text 

The figure provides a sketch of the proof: consider a 3 times 2 matrix, and denote by a_i in mathbf{R}^3 (i=1,2) its rows, so that

 A= left( begin{array}{cc} a_1 & a_2 end{array} right) , ;; A^T = left( begin{array}{c} a_1^T a_2^T end{array} right).

Then A^Tx = 0 if and only if a_i^Tx = 0, i=1,2. In words: x is in the nullspace of A^T if and only if it is orthogonal to the vectors a_i, i=1,2. But those two vectors span the range of A, hence x is orthogonal to any element in the range.