Matrix difference equation

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

A matrix difference equation is a difference equation in which the value of a vector (or sometimes, a matrix) of variables at one point in time is related to its own value at one or more previous points in time, using matrices.[1][2] The order of the equation is the maximum time gap between any two indicated values of the variable vector. For example,

𝐱t=𝐀𝐱tβˆ’1+𝐁𝐱tβˆ’2

is an example of a second-order matrix difference equation, in which x is an n Γ— 1 vector of variables and A and B are n Γ— n matrices. This equation is homogeneous because there is no vector constant term added to the end of the equation. The same equation might also be written as

𝐱t+2=𝐀𝐱t+1+𝐁𝐱t

or as

𝐱n=𝐀𝐱nβˆ’1+𝐁𝐱nβˆ’2

The most commonly encountered matrix difference equations are first-order.

Nonhomogeneous first-order case and the steady state

[edit | edit source]

An example of a nonhomogeneous first-order matrix difference equation is

𝐱t=𝐀𝐱tβˆ’1+𝐛

with additive constant vector b. The steady state of this system is a value x* of the vector x which, if reached, would not be deviated from subsequently. x* is found by setting xt = xtβˆ’1 = x* in the difference equation and solving for x* to obtain

𝐱*=[πˆβˆ’π€]βˆ’1𝐛

where I is the n Γ— n identity matrix, and where it is assumed that [I βˆ’ A] is invertible. Then the nonhomogeneous equation can be rewritten in homogeneous form in terms of deviations from the steady state:

[𝐱tβˆ’π±*]=𝐀[𝐱tβˆ’1βˆ’π±*]

Stability of the first-order case

[edit | edit source]

The first-order matrix difference equation [xt βˆ’ x*] = A[xtβˆ’1 βˆ’ x*] is stable—that is, xt converges asymptotically to the steady state x*—if and only if all eigenvalues of the transition matrix A (whether real or complex) have an absolute value which is less than 1.

Solution of the first-order case

[edit | edit source]

Assume that the equation has been put in the homogeneous form yt = Aytβˆ’1. Then we can iterate and substitute repeatedly from the initial condition y0, which is the initial value of the vector y and which must be known in order to find the solution:

𝐲1=𝐀𝐲0𝐲2=𝐀𝐲1=𝐀2𝐲0𝐲3=𝐀𝐲2=𝐀3𝐲0

and so forth, so that by mathematical induction the solution in terms of t is

𝐲t=𝐀t𝐲0

Further, if A is diagonalizable, we can rewrite A in terms of its eigenvalues and eigenvectors, giving the solution as

𝐲t=𝐏𝐃tπβˆ’1𝐲0,

where P is an n Γ— n matrix whose columns are the eigenvectors of A (assuming the eigenvalues are all distinct) and D is an n Γ— n diagonal matrix whose diagonal elements are the eigenvalues of A. This solution motivates the above stability result: At shrinks to the zero matrix over time if and only if the eigenvalues of A are all less than unity in absolute value.

Extracting the dynamics of a single scalar variable from a first-order matrix system

[edit | edit source]

Starting from the n-dimensional system yt = Aytβˆ’1, we can extract the dynamics of one of the state variables, say y1. The above solution equation for yt shows that the solution for y1,t is in terms of the n eigenvalues of A. Therefore the equation describing the evolution of y1 by itself must have a solution involving those same eigenvalues. This description intuitively motivates the equation of evolution of y1, which is

y1,t=a1y1,tβˆ’1+a2y1,tβˆ’2++any1,tβˆ’n

where the parameters ai are from the characteristic equation of the matrix A:

Ξ»nβˆ’a1Ξ»nβˆ’1βˆ’a2Ξ»nβˆ’2βˆ’βˆ’anΞ»0=0.

Thus each individual scalar variable of an n-dimensional first-order linear system evolves according to a univariate nth-degree difference equation, which has the same stability property (stable or unstable) as does the matrix difference equation.

Solution and stability of higher-order cases

[edit | edit source]

Matrix difference equations of higher order—that is, with a time lag longer than one period—can be solved, and their stability analyzed, by converting them into first-order form using a block matrix (matrix of matrices). For example, suppose we have the second-order equation

𝐱t=𝐀𝐱tβˆ’1+𝐁𝐱tβˆ’2

with the variable vector x being n Γ— 1 and A and B being n Γ— n. This can be stacked in the form

[𝐱t𝐱tβˆ’1]=[π€ππˆπŸŽ][𝐱tβˆ’1𝐱tβˆ’2]

where I is the n Γ— n identity matrix and 0 is the n Γ— n zero matrix. Then denoting the 2n Γ— 1 stacked vector of current and once-lagged variables as zt and the 2n Γ— 2n block matrix as L, we have as before the solution

𝐳t=𝐋t𝐳0

Also as before, this stacked equation, and thus the original second-order equation, are stable if and only if all eigenvalues of the matrix L are smaller than unity in absolute value.

Nonlinear matrix difference equations: Riccati equations

[edit | edit source]

In linear-quadratic-Gaussian control, there arises a nonlinear matrix equation for the reverse evolution of a current-and-future-cost matrix, denoted below as H. This equation is called a discrete dynamic Riccati equation, and it arises when a variable vector evolving according to a linear matrix difference equation is controlled by manipulating an exogenous vector in order to optimize a quadratic cost function. This Riccati equation assumes the following, or a similar, form:

𝐇tβˆ’1=𝐊+𝐀𝐇tπ€βˆ’π€π‡t𝐂[𝐂𝐇t𝐂+𝐑]βˆ’1𝐂𝐇t𝐀

where H, K, and A are n Γ— n, C is n Γ— k, R is k Γ— k, n is the number of elements in the vector to be controlled, and k is the number of elements in the control vector. The parameter matrices A and C are from the linear equation, and the parameter matrices K and R are from the quadratic cost function. See here for details.

In general this equation cannot be solved analytically for Ht in terms of t; rather, the sequence of values for Ht is found by iterating the Riccati equation. However, it has been shown[3] that this Riccati equation can be solved analytically if R = 0 and n = k + 1, by reducing it to a scalar rational difference equation; moreover, for any k and n if the transition matrix A is nonsingular then the Riccati equation can be solved analytically in terms of the eigenvalues of a matrix, although these may need to be found numerically.[4]

In most contexts the evolution of H backwards through time is stable, meaning that H converges to a particular fixed matrix H* which may be irrational even if all the other matrices are rational. See also Stochastic control Β§ Discrete time.

A related Riccati equation[5] is

𝐗t+1=βˆ’[𝐄+𝐁𝐗t][𝐂+𝐀𝐗t]βˆ’1

in which the matrices X, A, B, C, E are all n Γ— n. This equation can be solved explicitly. Suppose 𝐗t=𝐍t𝐃tβˆ’1, which certainly holds for t = 0 with N0 = X0 and with D0 = I. Then using this in the difference equation yields

𝐗t+1=βˆ’[𝐄+𝐁𝐍t𝐃tβˆ’1]𝐃t𝐃tβˆ’1[𝐂+𝐀𝐍t𝐃tβˆ’1]βˆ’1=βˆ’[𝐄𝐃t+𝐁𝐍t][[𝐂+𝐀𝐍t𝐃tβˆ’1]𝐃t]βˆ’1=βˆ’[𝐄𝐃t+𝐁𝐍t][𝐂𝐃t+𝐀𝐍t]βˆ’1=𝐍t+1𝐃t+1βˆ’1

so by induction the form 𝐗t=𝐍t𝐃tβˆ’1 holds for all t. Then the evolution of N and D can be written as

[𝐍t+1𝐃t+1]=[βˆ’πβˆ’π„π€π‚][𝐍t𝐃t]≑𝐉[𝐍t𝐃t]

Thus by induction

[𝐍t𝐃t]=𝐉t[𝐍0𝐃0]

See also

[edit | edit source]

References

[edit | edit source]
  1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  3. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  4. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  5. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).