Hoeffding's lemma

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

In probability theory, Hoeffding's lemma is an inequality that bounds the moment-generating function of any bounded random variable,[1] implying that such variables are subgaussian. It is named after the FinnishAmerican mathematical statistician Wassily Hoeffding.

The proof of Hoeffding's lemma uses Taylor's theorem and Jensen's inequality. Hoeffding's lemma is itself used in the proof of Hoeffding's inequality as well as the generalization McDiarmid's inequality.

Statement

[edit | edit source]

Let X be any real-valued random variable such that aXb almost surely, i.e. with probability one. Then, for all λ,

𝔼[eλX]exp(λ𝔼[X]+λ2(ba)28),

or equivalently,

𝔼[eλ(X𝔼[X])]exp(λ2(ba)28).

Proof

[edit | edit source]

The following proof is direct but somewhat ad-hoc.

Proof

Let μ=𝔼[X]. Since the conclusion involves ba, without loss of generality, one may replace X by Xμ, a by aμ, and b by bμ, which leaves the difference ba unchanged, and assume 𝔼[X]=0, so that a0b.

Since eλx is a convex function of x, we have that for all x[a,b],

eλxbxbaeλa+xabaeλb

So,

𝔼[eλX]b𝔼[X]baeλa+𝔼[X]abaeλb=bbaeλa+abaeλb=eL(λ(ba)),

where L(h)=haba+ln(1+aehaba). By computing derivatives, we find

L(0)=L(0)=0 and L(h)=abeh(baeh)2.

From the AMGM inequality we thus see that L(h)14 for all h, and thus, from Taylor's theorem, there is some 0θ1 such that

L(h)=L(0)+hL(0)+12h2L(hθ)18h2.

Thus, 𝔼[eλX]e18λ2(ba)2.

Statement

[edit | edit source]

This statement and proof uses the language of subgaussian variables and exponential tilting, and is less ad-hoc.[2]: Lemma 2.2 

Let X be any real-valued random variable such that aXb almost surely, i.e. with probability one. Then it is subgaussian with variance proxy norm Xvpba2.

Proof

By the definition of variance proxy, it suffices to show that its cumulant generating function K(t):=logE[et(XE[X])] satisfies K(t)(ba)2/4. Explicit calculation shows K(t)=E[(XE[X])2et(XE[X])]E[et(XE[X])](E[(XE[X])et(XE[X])]E[et(XE[X])])2Notice that the quantity E[(XE[X])et(XE[X])]E[et(XE[X])]=E[(XE[X])et(XE[X])E[et(XE[X])]] is precisely the expectation of a random variable obtained by exponentially tilting XE[X]. Let this variable be Yt. It remains to bound Var[Yt](ba)2/4.

Notice that Yt still has range [aE[X],bE[X]]. So translate it to Zt:=Yta+bE[X]2 so that its range has midpoint zero. It remains to bound Var[Zt](ba)2/4. However, now the bound is trivial, since |Zt|ba2.

Given this general case, the formula

𝔼[eλ(X𝔼[X])]eλ2(ba)28

is a mere corollary of a general property of variance proxy.

See also

[edit | edit source]

Notes

[edit | edit source]
  1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).