Chernoff Bound

II. SIMPLE EXPLANATIONS OF KEY IDEAS IN ORDINAL OPTIMIZATION

II.3. THE CHERNOFF BOUND EXPLAINED

We are interested in finding the probability of a random variable x exceeding certain value V, i.e., P(x >= V). Let us define another random variable Y_V such that

Y_V = 1 if x >= V
Y_V = 0 if x < V

Notice that E[Y_V] = Prob(x >= V).
Then elementary argument (see figure below) shows that for all s > 0.

e^sx >= e^sVY_V

Thus,

E[e^sx] >= e^sV E[Y^V] = e^sV Prob(x >= V)
=> Prob(x >= V) <= e^-sV E[e^sx], for all s >= 0

which is known as the Chernoff Bound. Furthermore,

Prob(x >= V) <= min_s>=0 e^-sV E[e^sx]
Prob(x >= V) <= min_s>=0 exp{-sV + log E[e^sx]}

and we can define the rate function

R(s) = sup_s>=0 {sV - log E[e^sx]}

finally yielding

Prob(x >= V) <= exp[-R(s)]

i.e., the probability that a random variable is outside some set decreases exponentially fast as a function of the size of the set (V in this case).

To Next Slide / To Table of Content