Halving time for a population of dice

Question

The following question arose today on a Norwegian facebook forum for math teachers, with some unclear definitions and a lot of simulation.

We have a population of $K$ (regular, 6-sided, equal-probability) dice, which are thrown repeatedly (all at once, independently). Each time a dice shows a 6 it is removed from the population before the process continues. What is the distribution of the waiting time before the population size is reduced to $K/2$ (the halving time)? (Or its expectation, median, the question as asked there was unclear about this) ... feel free to assume $K$ is even, or to generalize to the waiting time until population is reduced to some level $m<K$.

I can see many ways to attack this (aside from simulation), but I am especially interested in simple, intuitive ways that could be understood without advanced probability. Markov chains seems a way to go, but the simpler the better. (A bonus question: Is there some meaningful limit of this problem when $K \to\infty$?) (I am attempting an answer but will first wait to see ...)

Sextus Empiricus · Answer 1 · 2020-12-11T14:36:51.373

For the limit you might be tempted to solve $$\left(\frac{5}{6}\right)^t = \frac{1}{2}$$ giving $$t = \frac{\log(1/2)}{\log(5/6)}\approx 3.8$$

But the interesting part is the deviation from this solution due to random fluctuations (when you have not yet reached the limit) and due to the discreteness.

Regarding the discreteness: eventually, the limit should be 4 instead of 3.8 because you will almost never reach halve the number dice in 3 steps and almost always you will need 4 steps.

Related problem 2 (exact solution):

The approach to the accepted solution in the answer to the following question

What are the chances rolling 6, 6-sided dice that there will be a 6?

will also work here.

We roll all the $K$ dices $n$ times and consider for each dice the case that they rolled at least once a '6' which has probability $p_{individual} = 1-(5/6)^n$. Then the probability to have $X_n$ dices having rolled (at least once ore more) a '6' after $n$ rolls is binomially distributed.

$$X_n \sim Binom(K,1-(5/6)^n)$$

and from this you can compute the probability to roll more than half the dices to a 6 within $n$ rolls which is equivalent to needing $n$ or more rolls to get half the dices rolled, using the correspondence between probability for a certain waiting time and probability for a certain number of events within some time.

set.seed(1)

expected <- function(K, p = 1/6) {
  n <- c(1:30)
  pK <- pbinom(K/2,K,1-(1-p)^n)
  pK
  1+sum(pK)
}

expected <- Vectorize(expected)
K <- seq(10,100000,1)*2
plot(K,expected(K), type = "l", log = "x", ylab = "expected rolls",
     xaxt = "n", main = "method 1 (red) and method 2 (black)", xlim = c(10,10^5), ylim = c(3,5))

axis(1, 10^c(1:5), labels = c("10", "100", "1000", "10000","100000") )
axis(1, at = 10*c(c(2:9),c(2:9)*10,c(2:9)*100,c(2:9)*1000) , labels = rep("",8*4), tck = -0.01)


expected2 <- function(K, d = 6) {
  d_eff <- -1/log(1-1/d)
  mu = -d_eff*log(0.5)
  v = d_eff^2/K
  n <- c(1:30)
  pK <- pnorm(n, mu, sqrt(v))
  1+sum(1-pK)
}
expected2 <- Vectorize(expected2)

lines(K, expected2(K), col = 2)

Halving time for a population of dice

1 Answers1

Related problem 1 (approximate):

Related problem 2 (exact solution):