Normal-Wishart distribution

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Normal-Wishart
Notation (𝝁,𝜦)NW(𝝁0,λ,𝐖,ν)
Parameters 𝝁0D location (vector of real)
λ>0 (real)
𝐖D×D scale matrix (pos. def.)
ν>D1 (real)
Support 𝝁D;𝜦D×D covariance matrix (pos. def.)
PDF f(𝝁,𝜦|𝝁0,λ,𝐖,ν)=𝒩(𝝁|𝝁0,(λ𝜦)1) 𝒲(𝜦|𝐖,ν)

In probability theory and statistics, the normal-Wishart distribution (or Gaussian-Wishart distribution) is a multivariate four-parameter family of continuous probability distributions. It is the conjugate prior of a multivariate normal distribution with unknown mean and precision matrix (the inverse of the covariance matrix).[1]

Definition

[edit | edit source]

Suppose

𝝁|𝝁0,λ,𝜦𝒩(𝝁0,(λ𝜦)1)

has a multivariate normal distribution with mean 𝝁0 and covariance matrix (λ𝜦)1, where

𝜦|𝐖,ν𝒲(𝜦|𝐖,ν)

has a Wishart distribution. Then (𝝁,𝜦) has a normal-Wishart distribution, denoted as

(𝝁,𝜦)NW(𝝁0,λ,𝐖,ν).

Characterization

[edit | edit source]

Probability density function

[edit | edit source]
f(𝝁,𝜦|𝝁0,λ,𝐖,ν)=𝒩(𝝁|𝝁0,(λ𝜦)1) 𝒲(𝜦|𝐖,ν)

Properties

[edit | edit source]

Scaling

[edit | edit source]

Marginal distributions

[edit | edit source]

By construction, the marginal distribution over 𝜦 is a Wishart distribution, and the conditional distribution over 𝝁 given 𝜦 is a multivariate normal distribution. The marginal distribution over 𝝁 is a multivariate t-distribution.

Posterior distribution of the parameters

[edit | edit source]

After making n observations 𝒙1,,𝒙n, the posterior distribution of the parameters is

(𝝁,𝜦)NW(𝝁n,λn,𝐖n,νn),

where

λn=λ+n,
𝝁n=λ𝝁0+n𝒙¯λ+n,
νn=ν+n,
𝐖n1=𝐖1+i=1n(𝒙i𝒙¯)(𝒙i𝒙¯)T+nλn+λ(𝒙¯𝝁0)(𝒙¯𝝁0)T.[2]

Generating normal-Wishart random variates

[edit | edit source]

Generation of random variates is straightforward:

  1. Sample 𝜦 from a Wishart distribution with parameters 𝐖 and ν
  2. Sample 𝝁 from a multivariate normal distribution with mean 𝝁0 and variance (λ𝜦)1
[edit | edit source]

Notes

[edit | edit source]
  1. ^ Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning. Springer Science+Business Media. Page 690.
  2. ^ Cross Validated, https://stats.stackexchange.com/q/324925

References

[edit | edit source]
  • Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning. Springer Science+Business Media.