How do you calculate the probability density function of the maximum of a sample of IID uniform random variables?

Question

Given the random variable

$$Y = \max(X_1, X_2, \ldots, X_n)$$

where $X_i$ are IID uniform variables, how do I calculate the PDF of $Y$?

If this is homework, please read the FAQ and update your question accordingly. — cardinal, Nov 15 '11 at 20:01
Can one use Vandermonde’s identity to show joint function of 2 order Statistics say F_y(r)*G_y(r) ? — larry mintz, Jun 09 '18 at 21:10
Out of interest, what course covers this kind of problem? It is not something that I encountered in my engineering probability course. — Alex, Aug 26 '19 at 22:27
@Alex What about a statistics course that covers resampling? — SOFe, Oct 01 '19 at 13:38

score 99 · Accepted Answer · edited Jun 18 '21 at 20:59

99

It is possible that this question is homework but I felt this classical elementary probability question was still lacking a complete answer after several months, so I'll give one here.

From the problem statement, we want the distribution of

$$Y = \max \{ X_1, ..., X_n \}$$

where $X_1, ..., X_n$ are iid ${\rm Uniform}(a,b)$. We know that $Y < x$ if and only if every element of the sample is less than $x$. Then this, as indicated in @varty's hint, combined with the fact that the $X_i$'s are independent, allows us to deduce

$$ P(Y \leq x) = P(X_1 \leq x, ..., X_n \leq x) = \prod_{i=1}^{n} P(X_i \leq x) = F_{X}(x)^n$$

where $F_{X}(x)$ is the CDF of the uniform distribution that is $\frac{y-a}{b-a}$. Therefore the CDF of $Y$ is $$F_{Y}(y) = P(Y \leq y) = \begin{cases} 0 & y \leq a \\ \phantom{} \left( \frac{y-a}{b-a} \right)^n & y\in(a,b) \\ 1 & y \geq b \\ \end{cases}$$

Since $Y$ has an absolutely continuous distribution we can derive its density by differentiating the CDF. Therefore the density of $Y$ is

$$ p_{Y}(y) = \frac{n(y-a)^{n-1}}{(b-a)^{n}}$$

In the special case where $a=0,b=1$, we have that $p_{Y}(y)=ny^{n-1}$, which is the density of a Beta distribution with $\alpha=n$ and $\beta=1$, since ${\rm Beta}(n,1) = \frac{\Gamma(n+1)}{\Gamma(n)\Gamma(1)}=\frac{n!}{(n-1)!} = n$.

As a note, the sequence you get if you were to sort your sample in increasing order - $X_{(1)}, ..., X_{(n)}$ - are called the order statistics. A generalization of this answer is that all order statistics of a ${\rm Uniform}(0,1)$ distributed sample have a Beta distribution, as noted in @bnaul's answer.

edited Jun 18 '21 at 20:59

Amir Pourmand

5
3

answered Jul 15 '12 at 22:52

Macro

40,561
8
143
148

This actually was a homework question for me. Thanks for the explanation. – Paul P M Mar 26 '16 at 22:20
i feel like i should be able to take your insights here and answer [this question](http://math.stackexchange.com/q/1722361/219075), but i'm not seeing how to do that. can you help me out? can you recommend a textbook or chapter that speaks to this general issue? – Mar 31 '16 at 20:38
2

@PaulPM Out of interest, what course covers this kind of problem? It is not something that I encountered in my engineering probability course. – Alex Aug 26 '19 at 22:27
@Alex this comes up in MIT's 6.041/6.431, Introduction to Probability (archived on MITx and OCW) – julianstanley Apr 07 '21 at 01:58
@julianstanley Hi! Could you specify which lecture of MIT's 6.041/6.431 deals with this concept? I did a quick scan of the class PPTs but could not find them. I might have missed it or something. But I would really appreciate it if you could state the lecture number as I am rushing against a deadline to solve a similar problem. Thanks in advance!! – nashynash Jul 01 '21 at 06:12
@nashynash -- Ahh, I might not be much help--I don't think there's a particular lecture. This came up in my section of 6.431 last semester, during PSET5 (I was here while working on that pset, which is when I left the above comment) so I assumed it would also be in the archived material. Generally the psets and textbook cover much more than the lectures, which I felt like just introduced the topics of each week. I think the Bertsekas Intro to Prob text covers it, but not sure where. E.g. problem 5d in Chapter 5 might be helpful? But the explanation above covers it beautifully. – julianstanley Jul 02 '21 at 00:57

score 12 · Answer 2 · answered Nov 15 '11 at 19:55

12

The maximum of a sample is one of the order statistics, in particular the $n$th order statistic of the sample $X_1,\dots,X_n$. In general, computing the distribution of order statistics is difficult, as described by the Wikipedia article; for some special distributions, the order statistics are well-known (e.g. for the uniform distribution, which has Beta-distributed order statistics).

EDIT: The Wikipedia article on sample maximum and minimum is also helpful and more specific to your problem.

answered Nov 15 '11 at 19:55

bnaul

3,011
1
17
16

6

For distributions with densities, computing the marginal distribution of a particular order statistic is quite straightforward. It is even easier for "special" order statistics like the minimum and maximum. – cardinal Nov 15 '11 at 20:03
I guess it depends on what is meant by "calculate" in the original question. Certainly doing so numerically is straightforward; I interpreted the question as asking how to find a closed form solution, which is in general not easy. – bnaul Nov 15 '11 at 20:59
I meant in a closed form. for the uniform distribution it's not that difficult. I will reply later with the algebrical explaination. I knew that answer, it was just hidden deep in my brain. 4 years deep :P – Mascarpone Nov 15 '11 at 21:12
8

@bnaul: Let $F(x) = \mathbb P(X \leq x)$ be an *arbitrary* distribution function and let $X_1,\ldots,X_n$ be an iid sample from $F$. Let $\newcommand{Xk}{X_{(k)}}\Xk$ be the $k$th order statistic. Then $$\mathbb P(\Xk \leq x) = \sum_{m=k}^n \mathbb P( |\{i: X_i \leq x\}| = m) = \sum_{m=k}^n {n \choose m} F(x)^m (1-F(x))^{n-m} \>.$$ **QED**. – cardinal Nov 15 '11 at 21:16
1

Perhaps a way to understand cardinals answer (given that you understand order statistic for uniform) is that because cdfs are monotonic 1-to-1 transformations of a uniform cdf, we can always express the event {X – probabilityislogic Feb 14 '12 at 01:01
2

@probabilityislogic: The intuition is good, though it seems you have continuous random variables in mind in your comment. (The result in my second comment above, e.g., works for an arbitrary distribution function.) – cardinal Feb 21 '12 at 13:04
@bnaul Thanks for your answer! Is there any theorem that tells us anything about the asymptotic behavior of the two order statistics - maximum and minimum of a finite random sample of size $n,$ as $n\to \infty?$ – Mathmath Aug 06 '20 at 13:51

score 3 · Answer 3 · edited Jul 20 '12 at 00:48

3

If $F_{Y}(y)$ is the CDF of $Y$, then $$F_Y(y)=\text{Prob}(y>X_1,y>X_2,...,y>X_n)$$ You can then use the iid property and the cdf of a uniform variate to compute $F_Y(y)$.

edited Jul 20 '12 at 00:48

Macro

40,561
8
143
148

answered Nov 15 '11 at 20:04

varty

1,276
8
6

score -2 · Answer 4 · edited Jul 24 '12 at 19:24

The maximum of a set of IID random variables when appropriately normalized will generally converge to one of the three extreme value types. This is Gnedenko's theorem,the equivalence of the central limit theorem for extremes. The particular type depends on the tail behavior of the population distribution. Knowing this you can use the limiting distribution to approximate the distribution for the maximum.

Since the uniform distribution on [a, b] is the subject of this question Macro has given the exact distribution for any n and a very nice answer. The result is rather trivial. For the normal distribution a nice closed form is not possible but appropriately normalized the maximum for the normal converges to the Gumbel distribution F(x)=exp(- e $^-$$^x$).

For the uniform the normalization is (b-a)-x/n and F$^n$(b-a-x/n)=(1-x/[n(b-a)])$^n$

which converges to e$^-$$^x$$^/$$^($$^b$$^-$$^a$$^)$. Note here that y=b-a-x/n. and F$^n$(y) converges to 1 as y goes to b-a. This holds for all 0

In this case it is easy to compare the exact value to its asymptotic limit.

For this answer to be practicable, you need to stipulate, in detail, how one "appropriately normalizes" the values and you also need to provide some way to estimate how large $n$ must be before the asymptotic formula becomes a reliable approximation. — whuber, Jul 20 '12 at 15:27
@whuber Anyone can look at Gnedenko's theorem to see the normalization. Equally as important is the tail characteristics that determine which of the three types applies. The theorem generalizes to stationary stochastic processes. So anyone who want to know the nitty gritty details can look at Leadbetter's book or my PhD thesis. When n is large enough is a difficult question to answer for any form of asymptotics. I guess the Berry-Esseen theorem helps for the central limit theorem. I don't know what is comparable for extremes. — Michael R. Chernick, Jul 20 '12 at 16:57

How do you calculate the probability density function of the maximum of a sample of IID uniform random variables?

4 Answers4

Linked

Related