Pickands–Balkema–De Haan theorem

The Pickands–Balkema–De Haan theorem gives the asymptotic tail distribution of a random variable, when its true distribution is unknown. It is often called the second theorem in extreme value theory. Unlike the first theorem (the Fisher–Tippett–Gnedenko theorem), which concerns the maximum of a sample, the Pickands–Balkema–De Haan theorem describes the values above a threshold.

The theorem owes its name to mathematicians James Pickands, Guus Balkema, and Laurens de Haan.

Conditional excess distribution function

For an unknown distribution function $F$ of a random variable $X$ , the Pickands–Balkema–De Haan theorem describes the conditional distribution function $F_{u}$ of the variable $X$ above a certain threshold $u$ . This is the so-called conditional excess distribution function, defined as

F_{u}(y)=P(X-u\leq y|X>u)={\frac {F(u+y)-F(u)}{1-F(u)}}

for $0\leq y\leq x_{F}-u$ , where $x_{F}$ is either the finite or infinite right endpoint of the underlying distribution $F$ . The function $F_{u}$ describes the distribution of the excess value over a threshold $u$ , given that the threshold is exceeded.

Statement

Let $F_{u}$ be the conditional excess distribution function. Pickands,^[1] Balkema and De Haan^[2] posed that for a large class of underlying distribution functions $F$ , and large $u$ , $F_{u}$ is well approximated by the generalized Pareto distribution, in the following sense. Suppose that there exist functions $a(u),b(u)$ , with $a(u)>0$ such that $F_{u}(a(u)y+b(u))$ as $u\rightarrow \infty$ converge to a non-degenerate distribution, then such limit is equal to the generalized Pareto distribution:

F_{u}(a(u)y+b(u))\rightarrow G_{k,\sigma }(y),{\text{ as }}u\rightarrow \infty

,

where

$G_{k,\sigma }(y)=1-(1+ky/\sigma )^{-1/k}$ , if $k\neq 0$
$G_{k,\sigma }(y)=1-e^{-y/\sigma }$ , if $k=0.$

Here σ > 0, and y ≥ 0 when k ≥ 0 and 0 ≤ y ≤ −σ/k when k < 0. These special cases are also known as

Exponential distribution with mean $\sigma$ , if k = 0,
Uniform distribution on $[0,\sigma ]$ , if k = -1,
Pareto distribution, if k > 0.

The class of underlying distribution functions $F$ are related to the class of the distribution functions $F$ satisfying the Fisher–Tippett–Gnedenko theorem.^[3]

Since a special case of the generalized Pareto distribution is a power-law, the Pickands–Balkema–De Haan theorem is sometimes used to justify the use of a power-law for modeling extreme events.

The theorem has been extended to include a wider range of distributions.^[4]^[5] While the extended versions cover, for example the normal and log-normal distributions, still continuous distributions exist that are not covered.^[6]

References

^ Iii, James Pickands (1975-01-01). "Statistical Inference Using Extreme Order Statistics". The Annals of Statistics. 3 (1). doi:10.1214/aos/1176343003. ISSN 0090-5364.
^ Balkema, A. A.; de Haan, L. (1974-10-01). "Residual Life Time at Great Age". The Annals of Probability. 2 (5). doi:10.1214/aop/1176996548. ISSN 0091-1798.
^ Balkema, A. A.; de Haan, L. (1974-10-01). "Residual Life Time at Great Age". The Annals of Probability. 2 (5). doi:10.1214/aop/1176996548. ISSN 0091-1798.
^ Papastathopoulos, Ioannis; Tawn, Jonathan A. (2013). "Extended Generalised Pareto Models for Tail Estimation". Journal of Statistical Planning and Inference. 143 (1): 131–143. arXiv:1111.6899. doi:10.1016/j.jspi.2012.07.001. S2CID 88512480.
^ Lee, Seyoon; Kim, Joseph H. T. (2019-04-18). "Exponentiated generalized Pareto distribution: Properties and applications towards extreme value theory". Communications in Statistics - Theory and Methods. 48 (8): 2014–2038. arXiv:1708.01686. doi:10.1080/03610926.2018.1441418. ISSN 0361-0926. S2CID 88514574.
^ Smith, Richard L.; Weissman, Ishay. Extreme Values (PDF) (draft 2/27/2020 ed.).

[1] Iii, James Pickands (1975-01-01). "Statistical Inference Using Extreme Order Statistics". The Annals of Statistics. 3 (1). doi:10.1214/aos/1176343003. ISSN 0090-5364.

[2] Balkema, A. A.; de Haan, L. (1974-10-01). "Residual Life Time at Great Age". The Annals of Probability. 2 (5). doi:10.1214/aop/1176996548. ISSN 0091-1798.

[3] Balkema, A. A.; de Haan, L. (1974-10-01). "Residual Life Time at Great Age". The Annals of Probability. 2 (5). doi:10.1214/aop/1176996548. ISSN 0091-1798.

[4] Papastathopoulos, Ioannis; Tawn, Jonathan A. (2013). "Extended Generalised Pareto Models for Tail Estimation". Journal of Statistical Planning and Inference. 143 (1): 131–143. arXiv:1111.6899. doi:10.1016/j.jspi.2012.07.001. S2CID 88512480.

[5] Lee, Seyoon; Kim, Joseph H. T. (2019-04-18). "Exponentiated generalized Pareto distribution: Properties and applications towards extreme value theory". Communications in Statistics - Theory and Methods. 48 (8): 2014–2038. arXiv:1708.01686. doi:10.1080/03610926.2018.1441418. ISSN 0361-0926. S2CID 88514574.

[6] Smith, Richard L.; Weissman, Ishay. Extreme Values (PDF) (draft 2/27/2020 ed.).

[1]

[2]

[3]

[4]

[5]

[6]

Conditional excess distribution function

Statement

See also

References