# Projection (linear algebra)

Template:Icosahedron visualizations

In linear algebra and functional analysis, a **projection** is a linear transformation *P* from a vector space to itself such that *P*^{2} = *P*. That is, whenever *P* is applied twice to any value, it gives the same result as if it were applied once (idempotent). It leaves its image unchanged.^{[1]} Though abstract, this definition of "projection" formalizes and generalizes the idea of graphical projection. One can also consider the effect of a projection on a geometrical object by examining the effect of the projection on points in the object.

## Contents

## Simple example

### Orthogonal projection

For example, the function which maps the point (*x*, *y*, *z*) in three-dimensional space **R**^{3} to the point (*x*, *y*, 0) is an orthogonal projection onto the *x*–*y* plane. This function is represented by the matrix

The action of this matrix on an arbitrary vector is

To see that *P* is indeed a projection, i.e., *P* = *P*^{2}, we compute

### Oblique projection

A simple example of a non-orthogonal (oblique) projection (for definition see below) is

Via matrix multiplication, one sees that

proving that *P* is indeed a projection.

The projection P is orthogonal if and only if *α* = 0.

## Properties and classification

Let *W* be a finite dimensional vector space and *P* be a projection on *W*. Suppose the subspaces *U* and *V* are the range and kernel of *P* respectively.
Then *P* has the following basic properties:

- By definition,
*P*is idempotent (i.e.*P*^{2}=*P*). *P*is the identity operator*I*on*U*- We have a direct sum
*W*=*U*⊕*V*. Every vector*x*in*W*may be decomposed uniquely as*x*=*u*+*v*with*u*=*Px*and*v*=*x*−*Px*= (*I*−*P*)*x*, and where*u*is in*U*and*v*is in*V*.

The range and kernel of a projection are *complementary*, as are *P* and *Q* = *I* − *P*. The operator *Q* is also a projection and the range and kernel of P become the kernel and range of Q and vice-versa. We say *P* is a projection along *V* onto *U* (kernel/range) and *Q* is a projection along *U* onto *V*.

In infinite dimensional vector spaces spectrum of a projection is contained in {0, 1}, as

Only 0 and 1 can be an eigenvalue of a projection. The corresponding eigenspaces are (respectively) the kernel and range of the projection. Decomposition of a vector space into direct sums is not unique in general. Therefore, given a subspace *V*, there may be many projections whose range (or kernel) is *V*.

If a projection is nontrivial it has minimal polynomial *X*^{2} − *X* = *X*(*X* − *I*), which factors into distinct roots, and thus *P* is diagonalizable.

The product of projections is not, in general, a projection, even if they are orthogonal. If projections commute, then their product is a projection.

### Orthogonal projections

When the vector space *W* has an inner product and is complete (is a Hilbert space) the concept of orthogonality can be used. An **orthogonal projection** is a projection for which the range *U* and the null space *V* are orthogonal subspaces. Thus, for every *x* and *y* in *W*, . Equivalently:

A projection is orthogonal if and only if it is self-adjoint. Using the self-adjoint and idempotent properties of *P*, for any *x* and *y* in *W* we have *Px* ∈ *U*, *y* − *Py* ∈ *V*, and

where is the inner product associated with *W*. Therefore, *Px* and *y* − *Py* are orthogonal.^{[2]}
The other direction, namely that if *P* is orthogonal then it is self-adjoint, follows from

for every *x* and *y* in *W*; thus *P* = *P*^{*}.

Proof of existence Let

*H*be a complete metric space with an inner product, and let*U*be a closed linear subspace of*H*(and hence complete as well).For every

*x*the following set of non-negative norms has an infimum, and due to the completeness of*U*it is a minimum. We define*Px*as the point in*U*were this minimum is obtained.Obviously

*Px*is in*U*. It remains to show that*Px*satisfies <*x*−*Px*,*Px*> = 0 and that it is linear.Let us define . For every non-zero

*v*in*U*, the following holds:By defining we see that unless vanishes. Since

*Px*was chosen as the minimum of the abovementioned set, it follows that indeed vanishes. In particular, (for*v*=*Px*): .Linearity follows from the vanishing of for every

*v*in*U*:By taking the difference between the equations we have

But since we may choose

*v*=*Px*+*Py*−*P*(*x*+*y*) (as it is itself in*U*) it follows that*Px*+*Py*=*P*(*x*+*y*). Similarly we have*λPx*=*P*(*λx*) for every scalar*λ*.

#### Properties and special cases

An orthogonal projection is a bounded operator. This is because for every *v* in the vector space we have, by Cauchy–Schwarz inequality:

For finite dimensional complex or real vector spaces, the standard inner product can be substituted for .

##### Formulas

A simple case occurs when the orthogonal projection is onto a line. If *u* is a unit vector on the line, then the projection is given by

This operator leaves *u* invariant, and it annihilates all vectors orthogonal to *u*, proving that it is indeed the orthogonal projection onto the line containing *u*.^{[3]} A simple way to see this is to consider an arbitrary vector as the sum of a component on the line (i.e. the projected vector we seek) and another perpendicular to it, . Applying projection, we get

by the properties of the dot product of parallel and perpendicular vectors.

This formula can be generalized to orthogonal projections on a subspace of arbitrary dimension. Let *u*_{1}, ..., *u*_{k} be an orthonormal basis of the subspace *U*, and let *A* denote the *n*-by-*k* matrix whose columns are *u*_{1}, ..., *u*_{k}. Then the projection is given by

which can be rewritten as

The matrix *A*^{T} is the partial isometry that vanishes on the orthogonal complement of *U* and *A* is the isometry that embeds *U* into the underlying vector space. The range of *P _{A}* is therefore the

*final space*of

*A*. It is also clear that

*A*·

*A*

^{T}is the identity operator on

*U*.

The orthonormality condition can also be dropped. If *u*_{1}, ..., *u*_{k} is a (not necessarily orthonormal) basis, and *A* is the matrix with these vectors as columns, then the projection is

The matrix *A* still embeds *U* into the underlying vector space but is no longer an isometry in general. The matrix (*A*^{T}*A*)^{−1} is a "normalizing factor" that recovers the norm. For example, the rank-1 operator *uu*^{T} is not a projection if ||*u*|| ≠ 1. After dividing by *u*^{T}*u* = ||*u*||^{2}, we obtain the projection *u*(*u*^{T}*u*)^{−1}*u*^{T} onto the subspace spanned by *u*.

When the range space of the projection is generated by a frame (i.e. the number of generators is greater than its dimension), the formula for the projection takes the form
. Here *A*^{+} stands for the Moore–Penrose pseudoinverse. This is just one of many ways to construct the projection operator.

If a matrix is non-singular and *A*^{T} *B* = 0 (i.e., *B* is the null space matrix of *A*),^{[6]} the following holds:

If the orthogonal condition is enhanced to *A*^{T} *W* *B* = *A*^{T} *W*^{T} *B* = 0 with *W* being non-singular, the following holds:

All these formulas also hold for complex inner product spaces, provided that the conjugate transpose is used instead of the transpose.

### Oblique projections

The term *oblique projections* is sometimes used to refer to non-orthogonal projections. These projections are also used to represent spatial figures in two-dimensional drawings (see oblique projection), though not as frequently as orthogonal projections.

Oblique projections are defined by their range and null space. A formula for the matrix representing the projection with a given range and null space can be found as follows. Let the vectors *u*_{1}, ..., *u*_{k} form a basis for the range of the projection, and assemble these vectors in the *n*-by-*k* matrix *A*. The range and the null space are complementary spaces, so the null space has dimension *n* − *k*. It follows that the orthogonal complement of the null space has dimension *k*. Let *v*_{1}, ..., *v*_{k} form a basis for the orthogonal complement of the null space of the projection, and assemble these vectors in the matrix *B*. Then the projection is defined by

This expression generalizes the formula for orthogonal projections given above.^{[7]}

## Canonical forms

Any projection *P* = *P*^{2} on a vector space of dimension *d* over a field is a diagonalizable matrix, since its minimal polynomial is *x*^{2} − *x*, which splits into distinct linear factors. Thus there exists a basis in which *P* has the form

where *r* is the rank of *P*. Here *I*_{r} is the identity matrix of size *r*, and 0_{d−r} is the zero matrix of size *d* − *r*. If the vector space is complex and equipped with an inner product, then there is an *orthonormal* basis in which the matrix of *P* is^{[8]}

where *σ*_{1} ≥ *σ*_{2} ≥ ... ≥ *σ*_{k} > 0. The integers *k*, *s*, *m* and the real numbers are uniquely determined. Note that 2*k* + *s* + *m* = *d*. The factor *I*_{m} ⊕ 0_{s} corresponds to the maximal invariant subspace on which *P* acts as an *orthogonal* projection (so that *P* itself is orthogonal if and only if *k* = 0) and the *σ*_{i}-blocks correspond to the *oblique* components.

## Projections on normed vector spaces

When the underlying vector space *X* is a (not necessarily finite-dimensional) normed vector space, analytic questions, irrelevant in the finite-dimensional case, need to be considered. Assume now *X* is a Banach space.

Many of the algebraic notions discussed above survive the passage to this context. A given direct sum decomposition of *X* into complementary subspaces still specifies a projection, and vice versa. If *X* is the direct sum *X* = *U* ⊕ *V*, then the operator defined by *P*(*u* + *v*) = *u* is still a projection with range *U* and kernel *V*. It is also clear that *P*^{2} = *P*. Conversely, if *P* is projection on *X*, i.e. *P*^{2} = *P*, then it is easily verified that (*I* − *P*)^{2} = (*I* − *P*). In other words, (*I* − *P*) is also a projection. The relation *I* = *P* + (*I* − *P*) implies *X* is the direct sum Ran(*P*) ⊕ Ran(*I* − *P*).

However, in contrast to the finite-dimensional case, projections need not be continuous in general. If a subspace *U* of *X* is not closed in the norm topology, then projection onto *U* is not continuous. In other words, the range of a continuous projection *P* must be a closed subspace. Furthermore, the kernel of a continuous projection (in fact, a continuous linear operator in general) is closed. Thus a *continuous* projection *P* gives a decomposition of *X* into two complementary *closed* subspaces: *X* = ran(*P*) ⊕ ker(*P*) = ker(*I* − *P*) ⊕ ker(*P*).

The converse holds also, with an additional assumption. Suppose *U* is a closed subspace of *X*. *If* there exists a closed subspace *V* such that *X* = *U* ⊕ *V*, then the projection *P* with range *U* and kernel *V* is continuous. This follows from the closed graph theorem. Suppose *x _{n}* →

*x*and

*Px*→

_{n}*y*. One needs to show that

*Px*=

*y*. Since

*U*is closed and {

*Px*} ⊂

_{n}*U*,

*y*lies in

*U*, i.e.

*Py*=

*y*. Also,

*x*−

_{n}*Px*= (

_{n}*I*−

*P*)

*x*→

_{n}*x*−

*y*. Because

*V*is closed and {(

*I*−

*P*)

*x*} ⊂

_{n}*V*, we have

*x*−

*y*∈

*V*, i.e.

*P*(

*x*−

*y*) =

*Px*−

*Py*=

*Px*−

*y*= 0, which proves the claim.

The above argument makes use of the assumption that both *U* and *V* are closed. In general, given a closed subspace *U*, there need not exist a complementary closed subspace *V*, although for Hilbert spaces this can always be done by taking the orthogonal complement. For Banach spaces, a one-dimensional subspace always has a closed complementary subspace. This is an immediate consequence of Hahn–Banach theorem. Let *U* be the linear span of *u*. By Hahn–Banach, there exists a bounded linear functional *φ* such that *φ*(*u*) = 1. The operator *P*(*x*) = *φ*(*x*)*u* satisfies *P*^{2} = *P*, i.e. it is a projection. Boundedness of *φ* implies continuity of *P* and therefore ker(*P*) = ran(*I* − *P*) is a closed complementary subspace of *U*.

## Applications and further considerations

Projections (orthogonal and otherwise) play a major role in algorithms for certain linear algebra problems:

- QR decomposition (see Householder transformation and Gram–Schmidt decomposition);
- Singular value decomposition
- Reduction to Hessenberg form (the first step in many eigenvalue algorithms).
- Linear regression

As stated above, projections are a special case of idempotents. Analytically, orthogonal projections are non-commutative generalizations of characteristic functions. Idempotents are used in classifying, for instance, semisimple algebras, while measure theory begins with considering characteristic functions of measurable sets. Therefore, as one can imagine, projections are very often encountered in the context operator algebras. In particular, a von Neumann algebra is generated by its complete lattice of projections.

## Generalizations

More generally, given a map between normed vector spaces one can analogously ask for this map to be an isometry on the orthogonal complement of the kernel: that be an isometry; in particular it must be onto. The case of an orthogonal projection is when *W* is a subspace of *V.* In Riemannian geometry, this is used in the definition of a Riemannian submersion.

## See also

- Centering matrix, which is an example of a projection matrix.
- Orthogonalization
- Invariant subspace
- Properties of trace
- Dykstra's projection algorithm to compute the projection onto an intersection of sets

## Notes

## References

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

## External links

- MIT Linear Algebra Lecture on Projection Matrices at Google Video, from MIT OpenCourseWare
- Planar Geometric Projections Tutorial – a simple-to-follow tutorial explaining the different types of planar geometric projections.

{{#invoke: Navbox | navbox }}