Professional Documents
Culture Documents
6 Order Statistics
Order Statistics
Colin Rundel
X(1) X(2) X(3) X(4) X(5)
For X1 , X2 , . . . , Xn iid random variables Xk is the kth smallest X , usually For a continuous random variable we can see that
called the kth order statistic.
f (x) ≈ P(x ≤ X ≤ x + ) = P(X ∈ [x, x + ])
X(1) is therefore the smallest X and lim f (x) = lim P(X ∈ [x, x + ])
→0 →0
f (x) f (x + ✏)
x x+✏
Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 2 / 24 Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 3 / 24
Section 4.6 Order Statistics Section 4.6 Order Statistics
For X1 , X2 , . . . , Xn iid continuous random variables with pdf f and cdf F For X1 , X2 , . . . , Xn iid continuous random variables with pdf f and cdf F
the density of the maximum is the density of the minimum is
P(X(n) ∈ [x, x + ]) = P(one of the X ’s ∈ [x, x + ] and all others < x) P(X(1) ∈ [x, x + ]) = P(one of the X ’s ∈ [x, x + ] and all others > x)
n
X n
X
= P(Xi ∈ [x, x + ] and all others < x) = P(Xi ∈ [x, x + ] and all others > x)
i=1 i=1
= nP(X1 ∈ [x, x + ] and all others < x) = nP(X1 ∈ [x, x + ] and all others > x)
= nP(X1 ∈ [x, x + ])P(all others < x) = nP(X1 ∈ [x, x + ])P(all others > x)
= nP(X1 ∈ [x, x + ])P(X2 < x) · · · P(Xn < x) = nP(X1 ∈ [x, x + ])P(X2 > x) · · · P(Xn > x)
= nf (x)F (x)n−1 = nf (x)(1 − F (x))n−1
Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 4 / 24 Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 5 / 24
Density of the kth Order Statistic Cumulative Distribution of the min and max
For X1 , X2 , . . . , Xn iid continuous random variables with pdf f and cdf F For X1 , X2 , . . . , Xn iid continuous random variables with pdf f and cdf F
the density of the kth order statistic is the density of the kth order statistic is
P(X(k) ∈ [x, x + ]) = P(one of the X ’s ∈ [x, x + ] and exactly k − 1 of the others < x) F(1) (x) = P(X(1) < x) = 1 − P(X(1) > x)
n
X = 1 − P(X1 > x, . . . , Xn > x) = 1 − P(X1 > x) · · · P(Xn > x)
= P(Xi ∈ [x, x + ] and exactly k − 1 of the others < x) = 1 − (1 − F (x))n
i=1
!
n−1 d dF (x)
f(k) (x) = nf (x) F (x)k−1 (1 − F (x))n−k f(1) (x) = (1 − F (x))n = n(1 − F (x))n−1 = nf (x)(1 − F (x))n−1
k −1 dx dx
d dF (x)
f(n) (x) = F (x)n = nF (x)n−1 = nf (x)F (x)n−1
dx dx
Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 6 / 24 Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 7 / 24
Section 4.6 Order Statistics Section 4.6 Order Statistics
Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 8 / 24 Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 9 / 24
The connection between the Beta distribution and the kth order statistic Let X ∼ Beta(r , s) then
of n standard Uniform random variables allows us to simplify the Beta
function. 1
Z
1
E (X ) = x x r −1 (1 − x)s−1 dx
0 B(r , s)
Z
Z 1 1
B(r , s) = x r −1 (1 − x)s−1 dx = , 1x (r +1)−1 (1 − x)s−1 dx
B(r , s) 0
0
1 B(r + 1, s)
B(k, n − k + 1) = =
n−1 B(r , s)
n k−1
(k − 1)!(n − 1 − k + 1)! r !(s − 1)! (r + s − 1)!
= =
n(n − 1)! (r + s)! (r − 1)!(s − 1)!
(r − 1)!(n − k)! r! (r + s − 1)!
= =
n! (r − 1)! (r + s)!
r
(r − 1)!(s − 1)! Γ(r )Γ(s) =
= = r +s
(r + s − 1)! Γ(r + s)
Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 10 / 24 Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 11 / 24
Section 4.6 Order Statistics Section 4.6 Order Statistics
Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 12 / 24 Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 13 / 24
= nλe −nλx
Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 16 / 24 Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 17 / 24
Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 18 / 24 Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 19 / 24
Section 4.6 Order Statistics Section 4.6 Order Statistics
1.0
Median(X(n) ) = m(n)
0.3
0.8
0.6
0.2
X(n) − an
F(x)
f(x)
P < mG = 1/2
0.4
bn
0.1
0.2
0.0
0.0
-5 0 5 10 -5 0 5 10
x x m(n) = an + bn mG
−x √ 1 1
F (x) = exp(−e ) E (X ) = π/ 6 = log n − log log 2
−x λ λ
f (x) = e exp(−e −x ) Median(X ) = − log(log(2))
Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 20 / 24 Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 21 / 24
Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 22 / 24 Statistics 104 (Colin Rundel) Lecture 15 March 14, 2012 23 / 24
Section 4.6 Order Statistics
Maximum of Paretos
This is a distribution we have not seen yet, but is useful for describing many physical
processes. It’s key feature is that it has long tails meaning it goes to 0 slower than a
distribution like the normal.
iid
Let X1 , X2 , . . . , Xn ∼ Pareto(α, k) and an = 0, bn = kn1/α then the cdf of X is
( α
1 − kx if x ≥ k,
FX (x) =
0 otherwise
The cdf of X(n) is then given by
α n
n k
F(n) (x) = F (x) = 1 −
x
α n n
x −α
k
F(n) (an + bn x) = F (an + bn x)n = 1 − = 1 −
kxn1/α n
α
lim F(n) (an + bn x) = e −x
n→∞