Characteristic polynomial

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

In linear algebra, the characteristic polynomial of a square matrix is a polynomial which is invariant under matrix similarity and has the eigenvalues as roots. It has the determinant and the trace of the matrix among its coefficients. The characteristic polynomial of an endomorphism of a finite-dimensional vector space is the characteristic polynomial of the matrix of that endomorphism over any basis (that is, the characteristic polynomial does not depend on the choice of a basis). The characteristic equation, also known as the determinantal equation,[1][2][3] is the equation obtained by equating the characteristic polynomial to zero.

In spectral graph theory, the characteristic polynomial of a graph is the characteristic polynomial of its adjacency matrix.[4]

Motivation

[edit | edit source]

Eigenvalues and eigenvectors play a fundamental role in linear algebra, since, given a linear transformation, an eigenvector is a vector whose direction is not changed by the transformation, and the corresponding eigenvalue is the measure of the resulting change of magnitude of the vector.

More precisely, suppose the transformation is represented by a square matrix A. Then an eigenvector 𝐯 and the corresponding eigenvalue λ must satisfy the equation A𝐯=λ𝐯, or, equivalently (since λ𝐯=λI𝐯), (λIA)𝐯=𝟎 where I is the identity matrix, and 𝐯𝟎 (although the zero vector satisfies this equation for every λ, it is not considered an eigenvector).

It follows that the matrix (λIA) must be singular, and its determinant det(λIA)=0 must be zero.

In other words, the eigenvalues of A are the roots of det(xIA), which is a monic polynomial in x of degree n if A is a n×n matrix. This polynomial is the characteristic polynomial of A.

Formal definition

[edit | edit source]

Consider an n×n matrix A. The characteristic polynomial of A, denoted by pA(t), is the polynomial defined by[5] pA(t)=det(tIA) where I denotes the n×n identity matrix.

Some authors define the characteristic polynomial to be det(AtI). That polynomial differs from the one defined here by a sign (1)n, so it makes no difference for properties like having as roots the eigenvalues of A; however the definition above always gives a monic polynomial, whereas the alternative definition is monic only when n is even.

Examples

[edit | edit source]

To compute the characteristic polynomial of the matrix A=(2110). the determinant of the following is computed: tIA=(t211t0) and found to be (t2)t1(1)=t22t+1, the characteristic polynomial of A.

Another example uses hyperbolic functions of a hyperbolic angle φ. For the matrix take A=(cosh(φ)sinh(φ)sinh(φ)cosh(φ)). Its characteristic polynomial is det(tIA)=(tcosh(φ))2sinh2(φ)=t22t cosh(φ)+1=(teφ)(teφ).

Properties

[edit | edit source]

The characteristic polynomial pA(t) of a n×n matrix A is monic (its leading coefficient is 1) and its degree is n. The most important fact about the characteristic polynomial was already mentioned in the motivational paragraph: the eigenvalues of A are precisely the roots of pA(t) (this also holds for the minimal polynomial of A, but its degree may be less than n). All coefficients of the characteristic polynomial are polynomial expressions in the entries of the matrix. In particular its constant coefficient of t0 is det(A)=(1)ndet(A), the coefficient of tn is 1, and the coefficient of tn1 is tr(−A) = −tr(A), where tr(A) is the trace of A. (The signs given here correspond to the formal definition given in the previous section; for the alternative definition these would instead be det(A) and (−1)n – 1 tr(A) respectively.[6])

For a 2×2 matrix A, the characteristic polynomial is thus given by t2tr(A)t+det(A).

Using the language of exterior algebra, the characteristic polynomial of an n×n matrix A may be expressed as pA(t)=k=0ntnk(1)ktr(kA) where tr(kA) is the trace of the kth exterior power of A, which has dimension (nk). This trace may be computed as the sum of all principal minors of A of size k. The recursive Faddeev–LeVerrier algorithm computes these coefficients more efficiently [clarification needed].

When the characteristic of the field of the coefficients is 0, each such trace may alternatively be computed as a single determinant, that of the k×k matrix, tr(kA)=1k!|trAk100trA2trAk20trAk1trAk21trAktrAk1trA|.

The Cayley–Hamilton theorem states that replacing t by A in the characteristic polynomial (interpreting the resulting powers as matrix powers, and the constant term c as c times the identity matrix) yields the zero matrix. Informally speaking, every matrix satisfies its own characteristic equation. This statement is equivalent to saying that the minimal polynomial of A divides the characteristic polynomial of A.

Two similar matrices have the same characteristic polynomial. The converse however is not true in general: two matrices with the same characteristic polynomial need not be similar.

The matrix A and its transpose have the same characteristic polynomial. A is similar to a triangular matrix if and only if its characteristic polynomial can be completely factored into linear factors over K (the same is true with the minimal polynomial instead of the characteristic polynomial). In this case A is similar to a matrix in Jordan normal form.

Characteristic polynomial of a product of two matrices

[edit | edit source]

If A and B are two square n×n matrices then characteristic polynomials of AB and BA coincide: pAB(t)=pBA(t).

Proof: If λ is a non-zero generalized eigenvalue of AB of algebraic multiplicity k, and v belongs to the kernel of (BAλ)k, then Av belongs to the kernel of (ABλ)k, so the non-zero generalized eigenspaces of AB and BA have the same dimension. Therefore, since AB and BA are both n×n, the remaining generalized eigenspaces, with eigenvalue 0, have the same dimension. Therefore AB and BA have the same characteristic polynomial, because all generalized eigenvalues are the same, with the same algebraic multiplicities.

More generally, if A is a matrix of order m×n and B is a matrix of order n×m, then AB is m×m and BA is n×n matrix, and one has pBA(t)=tnmpAB(t).

To prove this, one may suppose n>m, by exchanging, if needed, A and B. Then, by bordering A on the bottom by nm rows of zeros, and B on the right, by, nm columns of zeros, one gets two n×n matrices A and B such that BA=BA and AB is equal to AB bordered by nm rows and columns of zeros. The result follows from the case of square matrices, by comparing the characteristic polynomials of AB and AB.

Characteristic polynomial of Ak

[edit | edit source]

If λ is an eigenvalue of a square matrix A with eigenvector 𝐯, then λk is an eigenvalue of Ak because Ak𝐯=Ak1A𝐯=λAk1𝐯==λk𝐯.

The multiplicities can be shown to agree as well, and this generalizes to any polynomial in place of xk:[7]

Theorem Let A be a square n×n matrix and let f(t) be a polynomial. If the characteristic polynomial of A has a factorization pA(t)=(tλ1)(tλ2)(tλn) then the characteristic polynomial of the matrix f(A) is given by pf(A)(t)=(tf(λ1))(tf(λ2))(tf(λn)).

That is, the algebraic multiplicity of λ in f(A) equals the sum of algebraic multiplicities of λ in A over λ such that f(λ)=λ. In particular, tr(f(A))=i=1nf(λi) and det(f(A))=i=1nf(λi). Here a polynomial f(t)=t3+1, for example, is evaluated on a matrix A simply as f(A)=A3+I.

The theorem applies to matrices and polynomials over any field or commutative ring.[8] However, the assumption that pA(t) has a factorization into linear factors is not always true, unless the matrix is over an algebraically closed field such as the complex numbers.

Proof

This proof only applies to matrices and polynomials over complex numbers (or any algebraically closed field). In that case, the characteristic polynomial of any square matrix can be always factorized as pA(t)=(tλ1)(tλ2)(tλn) where λ1,λ2,,λn are the eigenvalues of A, possibly repeated. Moreover, the Jordan decomposition theorem guarantees that any square matrix A can be decomposed as A=S1US, where S is an invertible matrix and U is upper triangular with λ1,,λn on the diagonal (with each eigenvalue repeated according to its algebraic multiplicity). (The Jordan normal form has stronger properties, but these are sufficient; alternatively the Schur decomposition can be used, which is less popular but somewhat easier to prove).

Let f(t)=iαiti. Then f(A)=αi(S1US)i=αiS1USS1USS1US=αiS1UiS=S1(αiUi)S=S1f(U)S. For an upper triangular matrix U with diagonal λ1,,λn, the matrix Ui is upper triangular with diagonal λ1i,,λni in Ui, and hence f(U) is upper triangular with diagonal f(λ1),,f(λn). Therefore, the eigenvalues of f(U) are f(λ1),,f(λn). Since f(A)=S1f(U)S is similar to f(U), it has the same eigenvalues, with the same algebraic multiplicities.

Secular function and secular equation

[edit | edit source]

Secular function

[edit | edit source]

The term secular function has been used for what is now called characteristic polynomial (in some literature the term secular function is still used). The term comes from the fact that the characteristic polynomial was used to calculate secular perturbations (on a time scale of a century, that is, slow compared to annual motion) of planetary orbits, according to Lagrange's theory of oscillations.

Secular equation

[edit | edit source]

Secular equation may have several meanings.

  • In linear algebra it is sometimes used in place of characteristic equation.
  • In astronomy it is the algebraic or numerical expression of the magnitude of the inequalities in a planet's motion that remain after the inequalities of a short period have been allowed for.[9]
  • In molecular orbital calculations relating to the energy of the electron and its wave function it is also used instead of the characteristic equation.

For general associative algebras

[edit | edit source]

The above definition of the characteristic polynomial of a matrix AMn(F) with entries in a field F generalizes without any changes to the case when F is just a commutative ring. Garibaldi (2004) defines the characteristic polynomial for elements of an arbitrary finite-dimensional (associative, but not necessarily commutative) algebra over a field F and proves the standard properties of the characteristic polynomial in this generality.

See also

[edit | edit source]

References

[edit | edit source]
  1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  3. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  4. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  5. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  6. ^ Theorem 4 in these lecture notes
  7. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  8. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  9. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  • T.S. Blyth & E.F. Robertson (1998) Basic Linear Algebra, p 149, Springer Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). .
  • John B. Fraleigh & Raymond A. Beauregard (1990) Linear Algebra 2nd edition, p 246, Addison-Wesley Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). .
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  • Werner Greub (1974) Linear Algebra 4th edition, pp 120–5, Springer, Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). .
  • Paul C. Shields (1980) Elementary Linear Algebra 3rd edition, p 274, Worth Publishers Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). .
  • Gilbert Strang (1988) Linear Algebra and Its Applications 3rd edition, p 246, Brooks/Cole Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). .