Multivariate random variable

From Wikipedia, the free encyclopedia
(Redirected from Random vector)
Jump to navigation Jump to search

In probability and statistics, a multivariate random variable or random vector is a list or vector of mathematical variables each of whose value is unknown, either because the value has not yet occurred or because there is imperfect knowledge of its value. The individual variables in a random vector are grouped together because they are all part of a single mathematical system โ€” often they represent different properties of an individual statistical unit. For example, while a given person has a specific age, height and weight, the representation of these features of an unspecified person from within a group would be a random vector. Normally each element of a random vector is a real number.

Random vectors are often used as the underlying implementation of various types of aggregate random variables, e.g. a random matrix, random tree, random sequence, stochastic process, etc.

Formally, a multivariate random variable is a column vector ๐—=(X1,,Xn)๐–ณ (or its transpose, which is a row vector) whose components are random variables on the probability space (ฮฉ,โ„ฑ,P), where ฮฉ is the sample space, โ„ฑ is the sigma-algebra (the collection of all events), and P is the probability measure (a function returning each event's probability).

Probability distribution

[edit | edit source]

Every random vector gives rise to a probability measure on โ„n with the Borel algebra as the underlying sigma-algebra. This measure is also known as the joint probability distribution, the joint distribution, or the multivariate distribution of the random vector.

The distributions of each of the component random variables Xi are called marginal distributions. The conditional probability distribution of Xi given Xj is the probability distribution of Xi when Xj is known to be a particular value.

The cumulative distribution function F๐—:โ„nโ†ฆ[0,1] of a random vector ๐—=(X1,,Xn)๐–ณ is defined as[1]: p.15 

where ๐ฑ=(x1,,xn)๐–ณ.

Operations on random vectors

[edit | edit source]

Random vectors can be subjected to the same kinds of algebraic operations as can non-random vectors: addition, subtraction, multiplication by a scalar, and the taking of inner products.

Affine transformations

[edit | edit source]

Similarly, a new random vector ๐˜ can be defined by applying an affine transformation g:โ„nโ†’โ„n to a random vector ๐—:

๐˜=๐€๐—+b, where ๐€ is an nร—n matrix and b is an nร—1 column vector.

If ๐€ is an invertible matrix and ๐— has a probability density function f๐—, then the probability density of ๐˜ is

f๐˜(y)=f๐—(๐€โˆ’1(yโˆ’b))|det๐€|.

Invertible mappings

[edit | edit source]

More generally we can study invertible mappings of random vectors.[2]: p.284โ€“285 

Let g be a one-to-one mapping from an open subset ๐’Ÿ of โ„n onto a subset โ„› of โ„n, let g have continuous partial derivatives in ๐’Ÿ and let the Jacobian determinant det(โˆ‚๐ฒโˆ‚๐ฑ) of g be zero at no point of ๐’Ÿ. Assume that the real random vector ๐— has a probability density function f๐—(๐ฑ) and satisfies P(๐—โˆˆ๐’Ÿ)=1. Then the random vector ๐˜=g(๐—) is of probability density

f๐˜(๐ฒ)=f๐—(๐ฑ)|det(โˆ‚๐ฒโˆ‚๐ฑ)||๐ฑ=gโˆ’1(๐ฒ)๐Ÿ(๐ฒโˆˆR๐˜)

where ๐Ÿ denotes the indicator function and set R๐˜={๐ฒ=g(๐ฑ):f๐—(๐ฑ)>0}โІโ„› denotes support of ๐˜.

Expected value

[edit | edit source]

The expected value or mean of a random vector ๐— is a fixed vector E[๐—] whose elements are the expected values of the respective random variables.[3]: p.333 

Covariance and cross-covariance

[edit | edit source]

Definitions

[edit | edit source]

The covariance matrix (also called second central moment or variance-covariance matrix) of an nร—1 random vector is an nร—n matrix whose (i,j)th element is the covariance between the i th and the j th random variables. The covariance matrix is the expected value, element by element, of the nร—n matrix computed as [๐—โˆ’E[๐—]][๐—โˆ’E[๐—]]T, where the superscript T refers to the transpose of the indicated vector:[2]: p. 464 [3]: p.335 

By extension, the cross-covariance matrix between two random vectors ๐— and ๐˜ (๐— having n elements and ๐˜ having p elements) is the nร—p matrix[3]: p.336 

where again the matrix expectation is taken element-by-element in the matrix. Here the (i,j)th element is the covariance between the i th element of ๐— and the j th element of ๐˜.

Properties

[edit | edit source]

The covariance matrix is a symmetric matrix, i.e.[2]: p. 466 

K๐—๐—T=K๐—๐—.

The covariance matrix is a positive semidefinite matrix, i.e.[2]: p. 465 

๐šTK๐—๐—๐šโ‰ฅ0for all ๐šโˆˆโ„n.

The cross-covariance matrix Cov[๐˜,๐—] is simply the transpose of the matrix Cov[๐—,๐˜], i.e.

K๐˜๐—=K๐—๐˜T.

Uncorrelatedness

[edit | edit source]

Two random vectors ๐—=(X1,...,Xm)T and ๐˜=(Y1,...,Yn)T are called uncorrelated if

E[๐—๐˜T]=E[๐—]E[๐˜]T.

They are uncorrelated if and only if their cross-covariance matrix K๐—๐˜ is zero.[3]: p.337 

Correlation and cross-correlation

[edit | edit source]

Definitions

[edit | edit source]

The correlation matrix (also called second moment) of an nร—1 random vector is an nร—n matrix whose (i,j)th element is the correlation between the i th and the j th random variables. The correlation matrix is the expected value, element by element, of the nร—n matrix computed as ๐—๐—T, where the superscript T refers to the transpose of the indicated vector:[4]: p.190 [3]: p.334 

By extension, the cross-correlation matrix between two random vectors ๐— and ๐˜ (๐— having n elements and ๐˜ having p elements) is the nร—p matrix

Properties

[edit | edit source]

The correlation matrix is related to the covariance matrix by

R๐—๐—=K๐—๐—+E[๐—]E[๐—]T.

Similarly for the cross-correlation matrix and the cross-covariance matrix:

R๐—๐˜=K๐—๐˜+E[๐—]E[๐˜]T

Orthogonality

[edit | edit source]

Two random vectors of the same size ๐—=(X1,...,Xn)T and ๐˜=(Y1,...,Yn)T are called orthogonal if

E[๐—T๐˜]=0.

Independence

[edit | edit source]

Two random vectors ๐— and ๐˜ are called independent if for all ๐ฑ and ๐ฒ

F๐—,๐˜(๐ฑ,๐ฒ)=F๐—(๐ฑ)โ‹…F๐˜(๐ฒ)

where F๐—(๐ฑ) and F๐˜(๐ฒ) denote the cumulative distribution functions of ๐— and ๐˜ andF๐—,๐˜(๐ฑ,๐ฒ) denotes their joint cumulative distribution function. Independence of ๐— and ๐˜ is often denoted by ๐—โŠฅโŠฅ๐˜. Written component-wise, ๐— and ๐˜ are called independent if for all x1,โ€ฆ,xm,y1,โ€ฆ,yn

FX1,โ€ฆ,Xm,Y1,โ€ฆ,Yn(x1,โ€ฆ,xm,y1,โ€ฆ,yn)=FX1,โ€ฆ,Xm(x1,โ€ฆ,xm)โ‹…FY1,โ€ฆ,Yn(y1,โ€ฆ,yn).

Characteristic function

[edit | edit source]

The characteristic function of a random vector ๐— with n components is a function โ„nโ†’โ„‚ that maps every vector ๐Ž=(ฯ‰1,โ€ฆ,ฯ‰n)T to a complex number. It is defined by[2]: p. 468 

ฯ†๐—(๐Ž)=E[ei(๐ŽT๐—)]=E[ei(ฯ‰1X1+โ€ฆ+ฯ‰nXn)].

Further properties

[edit | edit source]

Expectation of a quadratic form

[edit | edit source]

One can take the expectation of a quadratic form in the random vector ๐— as follows:[5]: p.170โ€“171 

E[๐—TA๐—]=E[๐—]TAE[๐—]+tr(AK๐—๐—),

where K๐—๐— is the covariance matrix of ๐— and tr refers to the trace of a matrix โ€” that is, to the sum of the elements on its main diagonal (from upper left to lower right). Since the quadratic form is a scalar, so is its expectation.

Proof: Let ๐ณ be an mร—1 random vector with E[๐ณ]=ฮผ and Cov[๐ณ]=V and let A be an mร—m non-stochastic matrix.

Then based on the formula for the covariance, if we denote ๐ณT=๐— and ๐ณTAT=๐˜, we see that:

Cov[๐—,๐˜]=E[๐—๐˜T]โˆ’E[๐—]E[๐˜]T

Hence

E[XYT]=Cov[X,Y]+E[X]E[Y]TE[zTAz]=Cov[zT,zTAT]+E[zT]E[zTAT]T=Cov[zT,zTAT]+ฮผT(ฮผTAT)T=Cov[zT,zTAT]+ฮผTAฮผ,

which leaves us to show that

Cov[zT,zTAT]=tr(AV).

This is true based on the fact that one can cyclically permute matrices when taking a trace without changing the result (e.g.: tr(AB)=tr(BA)).

We see that

Cov[zT,zTAT]=E[(zTโˆ’E(zT))(zTATโˆ’E(zTAT))T]=E[(zTโˆ’ฮผT)(zTATโˆ’ฮผTAT)T]=E[(zโˆ’ฮผ)T(Azโˆ’Aฮผ)].

And since

(zโˆ’ฮผ)T(Azโˆ’Aฮผ)

is a scalar, then

(zโˆ’ฮผ)T(Azโˆ’Aฮผ)=tr((zโˆ’ฮผ)T(Azโˆ’Aฮผ))=tr((zโˆ’ฮผ)TA(zโˆ’ฮผ))

trivially. Using the permutation we get:

tr((zโˆ’ฮผ)TA(zโˆ’ฮผ))=tr(A(zโˆ’ฮผ)(zโˆ’ฮผ)T),

and by plugging this into the original formula we get:

Cov[zT,zTAT]=E[(zโˆ’ฮผ)T(Azโˆ’Aฮผ)]=E[tr(A(zโˆ’ฮผ)(zโˆ’ฮผ)T)]=tr(Aโ‹…E((zโˆ’ฮผ)(zโˆ’ฮผ)T))=tr(AV).

Expectation of the product of two different quadratic forms

[edit | edit source]

One can take the expectation of the product of two different quadratic forms in a zero-mean Gaussian random vector ๐— as follows:[5]: pp. 162โ€“176 

E[(๐—TA๐—)(๐—TB๐—)]=2tr(AK๐—๐—BK๐—๐—)+tr(AK๐—๐—)tr(BK๐—๐—)

where again K๐—๐— is the covariance matrix of ๐—. Again, since both quadratic forms are scalars and hence their product is a scalar, the expectation of their product is also a scalar.

Applications

[edit | edit source]

Portfolio theory

[edit | edit source]

In portfolio theory in finance, an objective often is to choose a portfolio of risky assets such that the distribution of the random portfolio return has desirable properties. For example, one might want to choose the portfolio return having the lowest variance for a given expected value. Here the random vector is the vector ๐ซ of random returns on the individual assets, and the portfolio return p (a random scalar) is the inner product of the vector of random returns with a vector w of portfolio weights โ€” the fractions of the portfolio placed in the respective assets. Since p = wT๐ซ, the expected value of the portfolio return is wTE(๐ซ) and the variance of the portfolio return can be shown to be wTCw, where C is the covariance matrix of ๐ซ.

Regression theory

[edit | edit source]

In linear regression theory, we have data on n observations on a dependent variable y and n observations on each of k independent variables xj. The observations on the dependent variable are stacked into a column vector y; the observations on each independent variable are also stacked into column vectors, and these latter column vectors are combined into a design matrix X (not denoting a random vector in this context) of observations on the independent variables. Then the following regression equation is postulated as a description of the process that generated the data:

y=Xฮฒ+e,

where ฮฒ is a postulated fixed but unknown vector of k response coefficients, and e is an unknown random vector reflecting random influences on the dependent variable. By some chosen technique such as ordinary least squares, a vector ฮฒ^ is chosen as an estimate of ฮฒ, and the estimate of the vector e, denoted e^, is computed as

e^=yโˆ’Xฮฒ^.

Then the statistician must analyze the properties of ฮฒ^ and e^, which are viewed as random vectors since a randomly different selection of n cases to observe would have resulted in different values for them.

Vector time series

[edit | edit source]

The evolution of a kร—1 random vector ๐— through time can be modelled as a vector autoregression (VAR) as follows:

๐—t=c+A1๐—tโˆ’1+A2๐—tโˆ’2+โ‹ฏ+Ap๐—tโˆ’p+๐žt,

where the i-periods-back vector observation ๐—tโˆ’i is called the i-th lag of ๐—, c is a k ร— 1 vector of constants (intercepts), Ai is a time-invariant k ร— k matrix and ๐žt is a k ร— 1 random vector of error terms.

References

[edit | edit source]
  1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ a b c d e Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  3. ^ a b c d e Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  4. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  5. ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

Further reading

[edit | edit source]
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

de:Zufallsvariable#Mehrdimensionale Zufallsvariable pl:Zmienna losowa#Uogรณlnienia