Disintegration theorem

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

In mathematics, the disintegration theorem is a result in measure theory and probability theory. It rigorously defines the idea of a non-trivial "restriction" of a measure to a measure zero subset of the measure space in question. It is related to the existence of conditional probability measures. In a sense, "disintegration" is the opposite process to the construction of a product measure.

Motivation

[edit | edit source]

Consider the unit square S=[0,1]×[0,1] in the Euclidean plane 2. Consider the probability measure μ defined on S by the restriction of two-dimensional Lebesgue measure λ2 to S. That is, the probability of an event ES is simply the area of E. We assume E is a measurable subset of S.

Consider a one-dimensional subset of S such as the line segment Lx={x}×[0,1]. Lx has μ-measure zero; every subset of Lx is a μ-null set; since the Lebesgue measure space is a complete measure space, ELxμ(E)=0.

While true, this is somewhat unsatisfying. It would be nice to say that μ "restricted to" Lx is the one-dimensional Lebesgue measure λ1, rather than the zero measure. The probability of a "two-dimensional" event E could then be obtained as an integral of the one-dimensional probabilities of the vertical "slices" ELx: more formally, if μx denotes one-dimensional Lebesgue measure on Lx, then μ(E)=[0,1]μx(ELx)dx for any "nice" ES. The disintegration theorem makes this argument rigorous in the context of measures on metric spaces.

Statement of the theorem

[edit | edit source]

(Hereafter, 𝒫(X) will denote the collection of Borel probability measures on a topological space (X,T).) The assumptions of the theorem are as follows:

  • Let Y and X be two Radon spaces (i.e. a topological space such that every Borel probability measure on it is inner regular, e.g. separably metrizable spaces; in particular, every probability measure on it is outright a Radon measure).
  • Let μ𝒫(Y).
  • Let π:YX be a Borel-measurable function. Here one should think of π as a function to "disintegrate" Y, in the sense of partitioning Y into {π1(x) | xX}. For example, for the motivating example above, one can define π((a,b))=a, (a,b)[0,1]×[0,1], which gives that π1(a)=a×[0,1], a slice we want to capture.
  • Let ν𝒫(X) be the pushforward measure ν=π*(μ)=μπ1. This measure provides the distribution of x (which corresponds to the events π1(x)).

The conclusion of the theorem: There exists a ν-almost everywhere uniquely determined family of probability measures {μx}xX𝒫(Y), which provides a "disintegration" of μ into {μx}xX, such that:

  • the function xμx is Borel measurable, in the sense that xμx(B) is a Borel-measurable function for each Borel-measurable set BY;
  • μx "lives on" the fiber π1(x): for ν-almost all xX, μx(Yπ1(x))=0, and so μx(E)=μx(Eπ1(x));
  • for every Borel-measurable function f:Y[0,], Yf(y)dμ(y)=Xπ1(x)f(y)dμx(y)dν(x). In particular, for any event EY, taking f to be the indicator function of E,[1] μ(E)=Xμx(E)dν(x).

Applications

[edit | edit source]

Product spaces

[edit | edit source]

The original example was a special case of the problem of product spaces, to which the disintegration theorem applies.

When Y is written as a Cartesian product Y=X1×X2 and πi:YXi is the natural projection, then each fibre π11(x1) can be canonically identified with X2 and there exists a Borel family of probability measures {μx1}x1X1 in 𝒫(X2) (which is (π1)*(μ)-almost everywhere uniquely determined) such that μ=X1μx1μ(π11(dx1))=X1μx1d(π1)*(μ)(x1), which is in particular[clarification needed] X1×X2f(x1,x2)μ(dx1,dx2)=X1(X2f(x1,x2)μ(dx2x1))μ(π11(dx1)) and μ(A×B)=Aμ(Bx1)μ(π11(dx1)).

The relation to conditional expectation is given by the identities E(fπ1)(x1)=X2f(x1,x2)μ(dx2x1), μ(A×Bπ1)(x1)=1A(x1)μ(Bx1).

Vector calculus

[edit | edit source]

The disintegration theorem can also be seen as justifying the use of a "restricted" measure in vector calculus. For instance, in Stokes' theorem as applied to a vector field flowing through a compact surface Σ3, it is implicit that the "correct" measure on Σ is the disintegration of three-dimensional Lebesgue measure λ3 on Σ, and that the disintegration of this measure on ∂Σ is the same as the disintegration of λ3 on Σ.[2]

Conditional distributions

[edit | edit source]

The disintegration theorem can be applied to give a rigorous treatment of conditional probability distributions in statistics, while avoiding purely abstract formulations of conditional probability.[3] The theorem is related to the Borel–Kolmogorov paradox, for example.

See also

[edit | edit source]

References

[edit | edit source]
  1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  3. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).