My friend collects stickers. There are 200 unique stickers to be sticked into the album. As the album gets fuller, it gets less and less likely that any newly obtained sticker is not already in the album.
I assumed every sticker to be obtained equally likely and figured the expected number of stickers necessary to be collected in order to completely fill the album, that is so that 200 unique stickers have been obtained, would be the sum of the expectation of 200 geometric random variables:
$E(n) = \sum_{x=1}^{200} \frac{200}{x} \approx 1176$
- Is this correct?
- I computed the variance, under the (in her case, very unrealistic) assumption of independence, as the sum of the variances of these random variables. Is this correct? $Var(n) = \sum_{x=1}^{200} \frac{1-\frac{x}{200}}{(x/200)^2} \approx 64422$ .
- How would I give her a confidence interval on the number of stickers she'd most likely have to collect to make the album full?
PS: This isn't homework as I'm asking out of curiosity, but I found no appropriate tag and think it kind of looks a bit like homework.