Normal-Wishart distribution

Normal-Wishart
Normal-Wishart
Notation	(𝝁,𝜦)∼NW(𝝁0,λ,𝐖,ν)
Parameters	𝝁0∈ℝD location (vector of real); λ>0 (real); 𝐖∈ℝD×D scale matrix (pos. def.); ν>D−1 (real)
Support	𝝁∈ℝD;𝜦∈ℝD×D covariance matrix (pos. def.)
PDF	f(𝝁,𝜦\|𝝁0,λ,𝐖,ν)=𝒩(𝝁\|𝝁0,(λ𝜦)−1) 𝒲(𝜦\|𝐖,ν)

In probability theory and statistics, the normal-Wishart distribution (or Gaussian-Wishart distribution) is a multivariate four-parameter family of continuous probability distributions. It is the conjugate prior of a multivariate normal distribution with unknown mean and precision matrix (the inverse of the covariance matrix).^[1]

Definition

Suppose

𝝁 | 𝝁_{0}, λ, 𝜦 \sim 𝒩 (𝝁_{0}, (λ 𝜦)^{- 1})

has a multivariate normal distribution with mean $𝝁_{0}$ and covariance matrix $(λ 𝜦)^{- 1}$ , where

𝜦 | 𝐖, ν \sim 𝒲 (𝜦 | 𝐖, ν)

has a Wishart distribution. Then $(𝝁, 𝜦)$ has a normal-Wishart distribution, denoted as

(𝝁, 𝜦) \sim N W (𝝁_{0}, λ, 𝐖, ν) .

Characterization

Probability density function

f (𝝁, 𝜦 | 𝝁_{0}, λ, 𝐖, ν) = 𝒩 (𝝁 | 𝝁_{0}, (λ 𝜦)^{- 1}) 𝒲 (𝜦 | 𝐖, ν)

Properties

Scaling

Marginal distributions

By construction, the marginal distribution over $𝜦$ is a Wishart distribution, and the conditional distribution over $𝝁$ given $𝜦$ is a multivariate normal distribution. The marginal distribution over $𝝁$ is a multivariate t-distribution.

Posterior distribution of the parameters

After making $n$ observations $𝒙_{1}, \dots, 𝒙_{n}$ , the posterior distribution of the parameters is

(𝝁, 𝜦) \sim N W (𝝁_{n}, λ_{n}, 𝐖_{n}, ν_{n}),

where

λ_{n} = λ + n,

𝝁_{n} = \frac{λ 𝝁_{0} + n \bar{𝒙}}{λ + n},

ν_{n} = ν + n,

𝐖_{n}^{- 1} = 𝐖^{- 1} + \sum_{i = 1}^{n} (𝒙_{i} - \bar{𝒙}) (𝒙_{i} - \bar{𝒙})^{T} + \frac{n λ}{n + λ} (\bar{𝒙} - 𝝁_{0}) (\bar{𝒙} - 𝝁_{0})^{T} .

^[2]

Generating normal-Wishart random variates

Generation of random variates is straightforward:

Sample $𝜦$ from a Wishart distribution with parameters $𝐖$ and $ν$
Sample $𝝁$ from a multivariate normal distribution with mean $𝝁_{0}$ and variance $(λ 𝜦)^{- 1}$

Related distributions

The normal-inverse Wishart distribution is essentially the same distribution parameterized by variance rather than precision.
The normal-gamma distribution is the one-dimensional equivalent.
The multivariate normal distribution and Wishart distribution are the component distributions out of which this distribution is made.

Notes

^ Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning. Springer Science+Business Media. Page 690.
^ Cross Validated, https://stats.stackexchange.com/q/324925

References

Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning. Springer Science+Business Media.

[bishop-1] Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning. Springer Science+Business Media. Page 690.

[2] Cross Validated, https://stats.stackexchange.com/q/324925

[1]

[2]

Normal-Wishart distribution

Contents

Definition

Characterization

Probability density function

Properties

Scaling

Marginal distributions

Posterior distribution of the parameters

Generating normal-Wishart random variates

Related distributions

Notes

References

Navigation menu

Normal-Wishart
Notation	$(𝝁, 𝜦) \sim N W (𝝁_{0}, λ, 𝐖, ν)$
Parameters	$𝝁_{0} \in ℝ^{D}$ location (vector of real) $λ > 0$ (real) $𝐖 \in ℝ^{D \times D}$ scale matrix (pos. def.) $ν > D - 1$ (real)
Support	$𝝁 \in ℝ^{D}; 𝜦 \in ℝ^{D \times D}$ covariance matrix (pos. def.)
PDF	$f (𝝁, 𝜦 \| 𝝁_{0}, λ, 𝐖, ν) = 𝒩 (𝝁 \| 𝝁_{0}, (λ 𝜦)^{- 1}) 𝒲 (𝜦 \| 𝐖, ν)$

Normal-Wishart distribution

Definition

Characterization

Probability density function

Properties

Scaling

Marginal distributions

Posterior distribution of the parameters

Generating normal-Wishart random variates

Related distributions

Notes

References

Navigation menu

Search