Logit-normal distribution

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Logit-normal
Probability density function
Plot of the Logitnormal PDF
Cumulative distribution function
Plot of the Logitnormal PDF
Notation P(๐’ฉ(ฮผ,ฯƒ2))
Parameters ฯƒ2 > 0 โ€” squared scale (real),
ฮผ โˆˆ R โ€” location
Support x โˆˆ (0, 1)
PDF 1ฯƒ2ฯ€eโˆ’(logit(x)โˆ’ฮผ)22ฯƒ21x(1โˆ’x)
CDF 12[1+erf(logit(x)โˆ’ฮผ2ฯƒ2)]
Mean no analytical solution
Median P(ฮผ)
Mode no analytical solution
Variance no analytical solution
MGF no analytical solution

In probability theory, a logit-normal distribution is a probability distribution of a random variable whose logit has a normal distribution. If Y is a random variable with a normal distribution, and t is the standard logistic function, then X = t(Y) has a logit-normal distribution; likewise, if X is logit-normally distributed, then Y = logit(X)= log (X/(1-X)) is normally distributed. It is also known as the logistic normal distribution,[1] which often refers to a multinomial logit version (e.g.[2][3][4]).

A variable might be modeled as logit-normal if it is a proportion, which is bounded by zero and one, and where values of zero and one never occur.

Characterization

[edit | edit source]

Probability density function

[edit | edit source]

The probability density function (PDF) of a logit-normal distribution, for 0 < x < 1, is:

fX(x;ฮผ,ฯƒ)=1ฯƒ2ฯ€1x(1โˆ’x)eโˆ’(logit(x)โˆ’ฮผ)22ฯƒ2

where ฮผ and ฯƒ are the mean and standard deviation of the variableโ€™s logit (by definition, the variableโ€™s logit is normally distributed).

The density obtained by changing the sign of ฮผ is symmetrical, in that it is equal to f(1-x;-ฮผ,ฯƒ), shifting the mode to the other side of 0.5 (the midpoint of the (0,1) interval).

File:LogitnormDensityGrid.svg
Plot of the Logitnormal PDF for various combinations of ฮผ (facets) and ฯƒ (colors)

Moments

[edit | edit source]

The moments of the logit-normal distribution have no analytic solution. The moments can be estimated by numerical integration, however numerical integration can be prohibitive when the values of ฮผ,ฯƒ2are such that the density function diverges to infinity at the end points zero and one. An alternative is to use the observation that the logit-normal is a transformation of a normal random variable. This allows us to approximate the n-th moment via the following quasi Monte Carlo estimate E[Xn]โ‰ˆ1Kโˆ’1โˆ‘i=1Kโˆ’1(P(ฮฆฮผ,ฯƒ2โˆ’1(i/K)))n,

where P is the standard logistic function, and ฮฆฮผ,ฯƒ2โˆ’1 is the inverse cumulative distribution function of a normal distribution with mean and variance ฮผ,ฯƒ2. When n=1, this corresponds to the mean.

Mode or modes

[edit | edit source]

When the derivative of the density equals 0 then the location of the mode x satisfies the following equation:

logit(x)=ฯƒ2(2xโˆ’1)+ฮผ.

For some values of the parameters there are two solutions, i.e. the distribution is bimodal.

Multivariate generalization

[edit | edit source]

The logistic normal distribution is a generalization of the logitโ€“normal distribution to D-dimensional probability vectors by taking a logistic transformation of a multivariate normal distribution.[1][5][6]

Probability density function

[edit | edit source]

The probability density function is:

fX(๐ฑ;๐,๐œฎ)=1(2ฯ€)Dโˆ’1|๐œฎ|121โˆi=1Dxieโˆ’12{log(๐ฑโˆ’DxD)โˆ’๐}โŠค๐œฎโˆ’1{log(๐ฑโˆ’DxD)โˆ’๐},๐ฑโˆˆ๐’ฎD,

where ๐ฑโˆ’D denotes a vector of the first (D-1) components of ๐ฑ and ๐’ฎD denotes the simplex of D-dimensional probability vectors. This follows from applying the additive logistic transformation to map a multivariate normal random variable ๐ฒโˆผ๐’ฉ(๐,๐œฎ),๐ฒโˆˆโ„Dโˆ’1 to the simplex:

๐ฑ=[ey11+โˆ‘i=1Dโˆ’1eyi,,eyDโˆ’11+โˆ‘i=1Dโˆ’1eyi,11+โˆ‘i=1Dโˆ’1eyi]โŠค
File:Gaussian and Logistic Normal pdfs.pdf
Gaussian density functions and corresponding logistic normal density functions after logistic transformation.

The unique inverse mapping is given by:

๐ฒ=[log(x1xD),,log(xDโˆ’1xD)]โŠค.

This is the case of a vector x which components sum up to one. In the case of x with sigmoidal elements, that is, when

๐ฒ=[log(x11โˆ’x1),,log(xD1โˆ’xD)]โŠค

we have

fX(๐ฑ;๐,๐œฎ)=1(2ฯ€)Dโˆ’1|๐œฎ|121โˆi=1D(xi(1โˆ’xi))eโˆ’12{log(๐ฑ1โˆ’๐ฑ)โˆ’๐}โŠค๐œฎโˆ’1{log(๐ฑ1โˆ’๐ฑ)โˆ’๐}

where the log and the division in the argument are taken element-wise. This is because the Jacobian matrix of the transformation is diagonal with elements 1xi(1โˆ’xi).

Use in statistical analysis

[edit | edit source]

The logistic normal distribution is a more flexible alternative to the Dirichlet distribution in that it can capture correlations between components of probability vectors. It also has the potential to simplify statistical analyses of compositional data by allowing one to answer questions about log-ratios of the components of the data vectors. One is often interested in ratios rather than absolute component values.

The probability simplex is a bounded space, making standard techniques that are typically applied to vectors in โ„n less meaningful. Statistician John Aitchison described the problem of spurious negative correlations when applying such methods directly to simplicial vectors.[5] However, mapping compositional data in ๐’ฎD through the inverse of the additive logistic transformation yields real-valued data in โ„Dโˆ’1. Standard techniques can be applied to this representation of the data. This approach justifies use of the logistic normal distribution, which can thus be regarded as the "Gaussian of the simplex".

Relationship with the Dirichlet distribution

[edit | edit source]
File:Logistic Normal approximation to Dirichlet distribution.pdf
Logistic normal approximation to Dirichlet distribution

The Dirichlet and logistic normal distributions are never exactly equal for any choice of parameters. However, Aitchison described a method for approximating a Dirichlet with a logistic normal such that their Kullbackโ€“Leibler divergence (KL) is minimized:

K(p,q)=โˆซ๐’ฎDp(๐ฑโˆฃ๐œถ)log(p(๐ฑโˆฃ๐œถ)q(๐ฑโˆฃ๐,๐œฎ))d๐ฑ

This is minimized by:

๐*=๐„p[log(๐ฑโˆ’DxD)],๐œฎ*=๐•๐š๐ซp[log(๐ฑโˆ’DxD)]

Using moment properties of the Dirichlet distribution, the solution can be written in terms of the digamma ฯˆ and trigamma ฯˆ functions:

ฮผi*=ฯˆ(ฮฑi)โˆ’ฯˆ(ฮฑD),i=1,โ€ฆ,Dโˆ’1
ฮฃii*=ฯˆ(ฮฑi)+ฯˆ(ฮฑD),i=1,โ€ฆ,Dโˆ’1
ฮฃij*=ฯˆ(ฮฑD),iโ‰ j

This approximation is particularly accurate for large ๐œถ. In fact, one can show that for ฮฑiโ†’โˆž,i=1,โ€ฆ,D, we have that p(๐ฑโˆฃ๐œถ)โ†’q(๐ฑโˆฃ๐*,๐œฎ*).

See also

[edit | edit source]

References

[edit | edit source]
  1. ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  3. ^ Peter Hoff, 2003. Link
  4. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  5. ^ a b J. Atchison. "The Statistical Analysis of Compositional Data." Monographs on Statistics and Applied Probability, Chapman and Hall, 1986. Book
  6. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

Further reading

[edit | edit source]
  • Frederic, P. & Lad, F. (2008) Two Moments of the Logitnormal Distribution. Communications in Statistics-Simulation and Computation. 37: 1263-1269
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
[edit | edit source]