119

The question (slightly modified) goes as follows and if you have never encountered it before you can check it in example 6a, chapter 2, of Sheldon Ross' A First Course in Probability:

Suppose that we possess an infinitely large urn and an infinite collection of balls labeled ball number 1, number 2, number 3, and so on. Consider an experiment performed as follows: At 1 minute to 12 P.M., balls numbered 1 through 10 are placed in the urn and one ball removed at random. (Assume that the withdrawal takes no time.) At 1/2 minute to 12 P.M., balls numbered 11 through 20 are placed in the urn and another ball removed at random. At 1/4 minute to 12P.M., balls numbered 21 through 30 are placed in the urn and another ball removed at random... and so on. The question of interest is, How many balls are in the urn at 12 P.M.?

This question, as it's posed, forces basically everyone to get it wrong --- usually the intuition is to say there will be infinitely many balls at 12 P.M. The answer provided by Ross, however, is that with probability one the urn will be empty at 12 P.M.

When teaching probability theory this problem is one of those for which is very hard to give a good intuitive explanation.

On the one hand, you could try to explain it like this: "think of the probability of any ball i being on the urn at 12 P.M. During the infinite random draws, it will eventually be removed. Since this holds for all balls, none of them can be there at the end".

However, students will correctly argue with you: "but I'm putting 10 balls and removing 1 ball at each time. It's impossible there will be zero balls at the end".

What's the best explanation we can give to them to solve these conflicting intuitions?

I'm also open to the argument the question is ill-posed and that if we formulate it better the "paradox" disappears or to the argument that the paradox is "purely mathematical" (but please try to be precise about it).

Carlos Cinelli
  • 10,500
  • 5
  • 42
  • 77
  • 6
    +1. I like the version where the urn begins with $2$ balls (and one is removed), then another $4$ are added (and one is removed), then another $8$ are added, etc. :-) @Neil What is that argument, exactly? Could you sketch it? – whuber Nov 24 '17 at 18:49
  • 16
    Many of the misconceptions and much of the confusion about probability comes from problems of limits and infinities. This is an excellent example of that as @enumaris's answer explains well. It is also an excellent example of a textbook example that will only lead students to the conclusion that they cannot succeed in the subject. – Michael Lew Nov 24 '17 at 20:18
  • 16
    While it's clear that each particular ball has probability zero of being in the urn at midnight, it's not obvious to me that there is a well-defined probability distribution on the set of patterns of which balls are left at midnight, or there is a well-defined probability distribution on the variable "how many balls at midnight?". –  Nov 25 '17 at 04:04
  • 15
    Or more precisely, the sample space here is the infinite sequences of choices of which ball is removed at which time. It's not obvious there is a reasonable $\sigma$-algebra on the sample space for which "how many balls at midnight?" is a measurable function. –  Nov 25 '17 at 04:58
  • 5
    "Infinity is not a real number" is probably the key. Note that the problem uses "an infinitely large urn", "an infinite collection of balls", and uses infinite time slots (in Zeno's style). Three infinities in the same question is a sure sign of problems to come. – Rolazaro Azeveires Nov 25 '17 at 14:55
  • 5
    Fascinating reading, all this, but as a layman I would observe that a calculated probability of zero does not mean there would be zero balls. The error to me is a philosophical one where two concepts have been equated. All that has actually happened at midday is that an infinite set of numbers has been shuffled and split into two sets, with a zero probability that any one particular number is in either. – Sentinel Nov 25 '17 at 15:31
  • 4
    "As the number of balls in the urn increases without bound, the probability that any particular ball will be chosen to be removed *also* goes to zero. But that's the same for all balls, and therefore no balls will be removed." That argument is obviously nonsense, but it seems to me to be identical to the argument that all the balls will be removed. If my argument is bad then why is the Ross argument good? – Eric Lippert Nov 25 '17 at 15:47
  • 5
    There has been 10+ answers and probably 100+ comments in this thread by now, but it seems that most of the people did not bother to look in the Ross'es book (when I google the title I get a direct link to PDF among the first few results). The presentation there is very clear. In particular, Ross starts with two non-probabilistic variations, which lead to either infinite or zero balls at midnight. Before this is understood, it does not make sense to proceed to the probabilistic variant. But it seems that many disputants here are disagreeing about these two *preliminary* cases. – amoeba Nov 25 '17 at 20:27
  • Comments are not for extended discussion; this conversation has been [moved to chat](https://chat.stackexchange.com/rooms/76768/discussion-on-question-by-carlos-cinelli-at-each-step-of-a-limiting-infinite-pro). – whuber Apr 29 '18 at 21:41
  • Infinite series that are alternating and whose absolute value is infinite are at best conditionally convergent. Conditional convergence means that the reorder of addition can yield a result of zero, a number or infinity. See [Agana MJ (2015) The classical theory of rearrangements. Master of Science in Mathematics Thesis, Boise State University](https://scholarworks.boisestate.edu/td/1039/). – Carl Jun 29 '21 at 08:07

20 Answers20

143

Ross describes three versions of this "paradox" in the Example 6a in his textbook. In each version, 10 balls are added to the urn and 1 ball is removed at each step of the procedure.

  1. In the first version, $10n$-th ball is removed at the $n$-th step. There are infinitely many balls left after midnight because all balls with numbers not ending in zero are still in there.

  2. In the second version, $n$-th ball is removed at the $n$-th step. There are zero balls left after midnight because each ball is eventually going to be removed at the corresponding step.

  3. In the third version, balls are removed uniformly at random. Ross computes the probability of each ball to be removed by step $n$ and finds that it converges to $1$ as $n\to\infty$ (note that this is not evident! one actually has to perform the computation). This means, by Boole's inequality, that the probability of having zero balls in the end is also $1$.

You are saying that this last conclusion is not intuitive and hard to explain; this is wonderfully supported by many confused answers and comments in this very thread. However, the conclusion of the second version is exactly as un-intuitive! And it has absolutely nothing to do with probability or statistics. I think that after one accepts the second version, there is nothing particularly surprising about the third version anymore.

So whereas the "probabilistic" discussion must be about the third version [see very insightful answers by @paw88789, @Paul, and @ekvall], the "philosophical" discussion should rather focus on the second version which is much easier and is similar in spirit to the Hilbert's hotel.


The second version is known as the Ross-Littlewood paradox. I link to the Wikipedia page, but the discussion there is horribly confusing and I do not recommend reading it at all. Instead, take a look at this MathOverflow thread from years ago. It is closed by now but contains several very perceptive answers. A short summary of the answers that I find most crucial is as follows.

We can define a set $S_n$ of the balls present in the urn after step $n$. We have that $S_1=\{2,\ldots 10\}$, $S_2=\{3,\ldots 20\}$, etc. There is a mathematically well-defined notion of the limit of a sequence of sets and one can rigorously prove that the limit of this sequence exists and is the empty set $\varnothing$. Indeed, what balls can be in the limit set? Only the ones that are never removed. But every ball is eventually removed. So the limit is empty. We can write $S_n \to \varnothing$.

At the same time, the number $|S_n|$ of the balls in the set $S_n$, also known as the cardinality of this set, is equal to $10n-n=9n$. The sequence $9n$ is obviously diverging, meaning that the cardinality converges to the cardinality of $\mathbb N$, also known as aleph-zero $\aleph_0$. So we can write that $|S_n|\to \aleph_0$.

The "paradox" now is that these two statements seem to contradict each other: \begin{align} S_n &\to \varnothing \\ |S_n| &\to \aleph_0 \ne 0 \end{align}

But of course there is no real paradox and no contradiction. Nobody said that taking cardinality is a "continuous" operation on sets, so we cannot exchange it with the limit:$$\lim |S_n| \ne |\lim S_n|.$$ In other words, from the fact that $|S_n|=9n$ for all integer $n\in \mathbb N$ we cannot conclude that $|S_\omega|$ (the value at the first ordinal) is equal to $\infty$. Instead, $|S_\omega|$ has to be computed directly and turns out to be zero.

So I think what we get out of this really is the conclusion that taking cardinalities is a discontinous operation... [@HarryAltman]

So I think this paradox is just the human tendency to assume that "simple" operations are continuous. [@NateEldredge]


This is easier to understand with functions instead of sets. Consider a characteristic (aka indicator) function $f_n(x)$ of set $S_n$ which is defined to be equal to one on the $[n, 10n]$ interval and zero elsewhere. The first ten functions look like that (compare the ASCII art from @Hurkyl's answer):

$\quad\quad\quad$Indicator functions for the first 10 steps

Everybody will agree that for each point $a\in\mathbb R$, we have $\lim f_n(a) = 0$. This by definition means that functions $f_n(x)$ converge to the function $g(x)=0$. Again, everybody will agree to that. However, observe that the integrals of these functions $\int_0^\infty f(x)dx = 9n$ get larger and larger and the sequence of integrals diverges. In other words,

$$\lim\int f_n(x)dx \ne \int \lim f_n(x) dx.$$

This is a completely standard and familiar analysis result. But it is an exact reformulation of our paradox!

A good way to formalize the problem is to describe the state of the jug not as a set (a subset of $\mathbb N$), because those are hard to take limits of, but as its characteristic function. The first "paradox" is that pointwise limits are not the same as uniform limits. [@TheoJohnson-Freyd]

The crucial point is that "at midnight noon" the whole infinite sequence has already passed, i.e. we made a "trasfinite jump" and arrived to the transfinite state $f_\omega = \lim f_n(x)$. The value of the integral "at midnight noon" has to be the value of the integral of $\lim f_n$, not the other way around.


Please note that some of the answers in this thread are misleading despite being highly upvoted.

In particular, @cmaster computes $\lim_{n\to\infty} \operatorname{ballCount}(S_n)$ which is indeed infinite, but this is not what the paradox asks about. The paradox asks about what happens after the whole infinite sequence of steps; this is a transfinite construction and so we need to be computing $\operatorname{ballCount}(S_\omega)$ which is equal to zero as explained above.

MaxW
  • 370
  • 2
  • 9
amoeba
  • 93,463
  • 28
  • 275
  • 317
  • 8
    Your answer along with @paw88789's answer seems enough to solve conflicting intuitions. Basically one can say: (i) your intuition will fail because cardinality is not continuous; and, (ii) if the physical analogy bothers you, think about the following question: is the "removal" function $f:\mathbb{N} \rightarrow \mathbb{N}$ surjective? In the probabilistic version, what's the probability we pick a surjective map? Of course, there's still the matter of whether these objects can model any real phenomena, but that's a different problem. Overall, I appreciate Ross example even more now. – Carlos Cinelli Nov 26 '17 at 01:58
  • 3
    You can make this problem even more paradoxical if you have *two* labels on the balls. Take your version 2 of the problem, but now suppose that you also put stickers with numbers on each ball, and move the stickers in each stage, such that at stage $n$ the balls numbered $\{n+1,\dots,10n\}$ are in the urn, where ball $n+i$ has sticker $i$, i.e. the stickers read $\{1,\dots,9n\}$ in the same order as the balls. Now we must conclude that at midnight, every ball has been removed, but you have an infinite number of stickers attached to the missing balls! – Mario Carneiro Nov 26 '17 at 09:07
  • 12
    @MichaelLew There are many counter-intuitive results in mathematics, and this is one of them. A sequence of sets S1={2,...10}, S2={3,...20}, etc. converges to the empty set even though each subsequent set has more elements than the previous one. This is just how it is. Please note that the formulation of the paradox asks what happens after the *infinite* number of steps. Clearly such a setup does not have any connection to the physical world; it is a mathematical abstraction, and has to be approached as such. [cont.] – amoeba Nov 26 '17 at 20:22
  • 6
    [cont.] Intuitions can fail when dealing with infinities, so one has to rely on mathematical rigour. Perhaps this reformulation will help you: consider a sequence of functions where n-th function is zero everywhere apart from an interval [n+1, 10n]. This sequence converges to a function that is constant zero, even though each subsequent function has a longer non-zero interval. Most of us are more familiar with convergence of functions than with convergence of sets, so this reformulation might be easier to understand. – amoeba Nov 26 '17 at 20:22
  • 3
    Are you exercising Socratic method @Martijn? Please cut to the chase. Obviously this sequence of integrals diverges. – amoeba Nov 26 '17 at 21:15
  • 6
    @Martijn The functions $f_n(x)=I([n+1, 10n])$ converge to $g(x) =0$ because for each point $a\in\mathbb R$ it is true that $f_n(a)=0$ for all $n>a$, i.e. [by definition](https://en.wikipedia.org/wiki/Pointwise_convergence#Definition). At the same time, the sequence of integrals $\int f_n$ diverges because $\int f_n=9n-1\to\infty$. This is not a contradiction because $\lim\int \ne \int\lim$. One can exchange them only when so called *uniform convergence* holds which is a much stronger condition than simple (pointwise) convergence. This is alluded to in https://mathoverflow.net/a/7113. – amoeba Nov 26 '17 at 21:46
  • 4
    If you formulate the problem purely mathematically, then go right ahead. "Using these sets of axioms, what is the result of this computation?" The answer is 0, no problem. When you say "I'm putting balls in urns" you are fundamentally making a mapping of ACTIONS to MATHEMATICAL OPERATIONS are you not? In this case, the "break down" between intuitions is how we are mapping the actions to the math. To then claim that student's intuitions are "wrong" because they aren't performing the same mapping that you are is ludicrous because there's no way to prove which mapping is actually correct. – enumaris Nov 26 '17 at 22:10
  • 3
    @enumaris Philosophically I can see where you are coming from, but it would be weird if a student, after giving an answer "3h" to the question "How long will it take a car going with 60 km/h to get from A to B if the distance between them is 120 km?", would start defending their solution by saying that they mapped these words/actions to mathematical operations in their own way and that the examiner's mapping is not more correct. I don't think any examiner would be impressed. – amoeba Nov 26 '17 at 22:16
  • 4
    The entire argument relies on this statement "The crucial point is that 'at midnight' the whole infinite sequence has already passed". That statement is patently false. It is impossible for an infinite sequence to pass. Thus, this entire post can be hand-waved away just like people are hand-waving away the correct answers. The fact is that infinity is not real. At best the rules defined for it work in most situations, but since it is not real it doesn't work in all situations and will lead to false conclusions in some situations, as is the case in this question. – Dunk Nov 27 '17 at 20:59
  • 3
    @Dunk You're essentially re-arguing Zeno's paradox. Do you claim that in moving from point 0 to point 1 along the real line, it is impossible to pass the whole infinite sequence [1/2, 3/4, 7/8, 15/16, ...]? _That_ claim is patently false. Similarly, in the problem, all of the removal/addition steps will have been completed by midnight because _they're all stated to occur before midnight_ and to take no time. – Aaron Novstrup Nov 27 '17 at 22:09
  • 3
    Analogously, there are metric spaces in which the sequence $f_n$ has no limit, because these spaces implicitly require uniform convergence. If you are modeling some problem in the real world, you may prefer to work in such a metric space for physical reasons, and in that case you could quite rigorously say that $f_n$ has no limit. Whether you believe this urn question has an answer or not depends on what mathematical structure you believe to be an appropriate model for the process described in the question. – Paul Nov 28 '17 at 02:00
  • 3
    @amoeba minor point - uniform limit vs. pointwise limit is not the relevant distinction here, as Lebesgue measure can still "escape to infinity" in uniform limits. To ensure interchange of the integral and limit you need something like Lebesgue's dominated convergence theorem. – Paul Nov 28 '17 at 04:00
  • 7
    Another way to explain this, is to ask the following: Are there more even numbers or natural numbers? Even though in any finite interval there are more natural numbers, they actually have the same cardinality. After that, are there more multiples of $10$ or natural numbers? Again, most people agree they have the same cardinality. Therefore, you add a "natural numbers" amount of balls, but you remove a "multiple of 10 amount of balls" - they have same cardinality, so in the end the urn is empty. (I know the analogy doesn't hold exactly, like ross 1st version shows, but it gives some intuition) – Ant Nov 28 '17 at 10:33
  • 3
    (-1) After due consideration. Any interpretation of an indeterminate form can be made to look reasonable by hiding the fact that an indeterminate form is being treated deterministically. This answer misses that point entirely. There is no paradox, and there is no numerical answer. – Carl Dec 02 '17 at 23:45
  • Comments are not for extended discussion; this conversation has been [moved to chat](https://chat.stackexchange.com/rooms/76767/discussion-on-answer-by-amoeba-at-each-step-of-a-limiting-infinite-process-put). – whuber Apr 29 '18 at 21:37
28

Hurkyl (in an answer) and Dilip Sarwate (in a comment) give two common deterministic variants of this puzzle. In both variants, at step $k$, balls $10k-9$ through $10k$ are added to the pile ($k=1,2,...$).

In Hurkyl's variation, ball $k$ is removed. In this variant, in can be definitively argued that there are no balls left because ball $n$ is removed at step $n$.

In Dilip Sarwate's variation, ball $10k$ is removed at step $k$, and so in this variant, all balls that are not multiples of $10$ remain. In this variant, there are infinitely many balls in the urn at the end.

With these two variants as edge cases, we see that lots of different things can happen when doing this process. For instance, you could arrange to have any finite set of balls remaining at the end, by doing Hurkyl's process but skipping the removal of certain balls. In fact for any set $B$ with countably infinite complement (in the (positive) natural numbers), you can have that set of balls remaining at the end of the process.

We could look at the random variation of the problem (given in the original post) as selecting a function $f:\mathbb{N}\to\mathbb{N}$ with the conditions that (i) $f$ is one-to-one and (ii) $f(k)\le 10k$ for all $k\in \mathbb{N}$.

The argument given in the Sheldon Ross book (referenced in the post) shows that almost all (in the probabilistic sense) such functions are in fact onto functions (surjections).

I see this as being somewhat analogous to the situation of selecting a number, $x$ from a uniform distribution on $[0,1]$ and asking what is the probability that the number is in the Cantor set (I am using the Cantor set rather than say the rational numbers because the Cantor set is uncountable). The probability is $0$ even though there are many (uncountably many) numbers in the Cantor set that could have been chosen. In the ball removing problem, the set of sequences in which there are any balls left is playing the role of the Cantor set.


Edit: BenMillwood correctly points out that there are some finite sets of balls that cannot be the remaining set. For instance, $1,2,...,10$ cannot be the remaining set. You can have at most $90\%$ of the first $10n$ balls remaining for $n=1,2,3,...$.

amoeba
  • 93,463
  • 28
  • 275
  • 317
paw88789
  • 389
  • 2
  • 5
  • 4
    You can't have *any* finite set of balls remaining at the end -- e.g. you can't have the set 1..10. – Ben Millwood Nov 25 '17 at 05:05
  • 1
    "The argument given in the Sheldon Ross book (referenced in the post) shows that almost all (in the probabilistic sense) such functions are in fact onto functions (surjections)." -- (+1) this is a very interesting way to look at the problem, and it actually might be easier and less confusing to present it as such than with the "physical story" of balls in an urn. – Carlos Cinelli Nov 25 '17 at 06:31
  • 5
    +1. I think this is currently the only answer that has actually any bearing on the problem. Everybody else seems to be discussing whether or not there will be zero balls left if on the n-th step ball #n is removed. In other words, most of the discussion that I see in this thread is actually about the 2nd paragraph of your answer and does not move any further than that. Cc to @CarlosCinelli. – amoeba Nov 25 '17 at 21:13
  • 3
    This is actually the first answer to really make me understand what is the reasoning behind a result. You show how result we obtain is connected with choice function we apply -- that makes perfect sense and helps to move further than just to accept that amount could be zero because of cardinality not being contigious. – sukhmel Nov 27 '17 at 11:58
  • (+1) I like this answer because the indeterminate nature of specious arguments based on indeterminate forms is better suggested. This can be made a lot simpler by saying that $0\times\infty$ is an indeterminate form and be done with it. Also, see my answer below which argues this more directly. – Carl Dec 02 '17 at 23:26
  • @Carl You upvoted and agreed with this answer saying (in the 2nd paragraph) that in the deterministic version when the $n$-th ball is removed at the $n$-st step "in can be definitively argued that there are no balls left", but you downvoted and disagreed with my answer saying exactly the same thing. This does not make any sense to me. – amoeba Dec 06 '17 at 09:43
  • @amoeba Gee, you got me there. I look for answers I can upvote, and I found something I liked. I tend to be generous and only very rarely downvote. Make you a deal, explain why our answers are different, and I'll change my opinion. – Carl Dec 06 '17 at 13:04
24

Enumaris' answer is perfectly right on the diverging limits problem. Nevertheless, the question can actually be answered in an unambiguous way. So, my answer will show you precisely where the zero balls solution goes wrong, and why the intuitive solution is the correct one.


It is true, that for any ball $n$, the probability of it being in the urn at the end $P(n)$ is zero. To be precise, it's only the limit that's zero: $P(n) = \lim_{N\to\infty} P(n, N) = 0$.

Now, you try to compute the sum $$\lim_{N\to\infty} \operatorname{ballCount}(N) = \lim_{N\to\infty} \sum_{n=1}^{n\leq 10N} P(n,N).$$ The broken calculation jumps right in to that $P(n,N)$ part, saying that's zero in the limit, so the sum contains only terms of zero, so the sum is zero itself: $$\begin{align} \lim_{N\to\infty}\operatorname{ballCount}(N) &= \lim_{N\to\infty} \sum_{n=1}^{n\leq 10N} P(n,N) \\ \text{broken step here }\longrightarrow &= \lim_{N\to\infty} \sum_{n=1}^{n\leq 10N} \lim_{N\to\infty} P(n,N) \\ &= \lim_{N\to\infty} \sum_{n=1}^{n\leq 10N} P(n) \\ &= \lim_{N\to\infty} \sum_{n=1}^{n\leq 10N} 0 \\ &= \lim_{N\to\infty} 10 N\times 0 \\ &= 0 \end{align}$$

However, this is illegally splitting the $\lim$ into two independent parts. You cannot simply move the $\lim$ into the sum if the bounds of the sum depend on the parameter of the $\lim$. You must solve the $\lim$ as a whole.

Thus, the only valid way to solve this $\lim$ is to solve the sum first, using the fact that $\sum_{n=1}^{n\leq 10N} P(n,N) = 9N$ for any finite $N$. $$\begin{align} \lim_{N\to\infty} \operatorname{ballCount}(N) &= \lim_{N\to\infty} \sum_{n=1}^{n\leq 10N} P(n,N) \\ &= \lim_{N\to\infty} 9N \\ &= \infty \end{align}$$

The intuitive solution did precisely that, it's the "clever" solution that's fundamentally broken.

David Z
  • 236
  • 1
  • 8
  • 10
    That formulates the paradox, for sure. It amounts to this: Asserting that infinitely many balls remain raises the natural question: *which balls?* Can you name a single ball that has a nonzero chance of remaining? If not, then it seems the countable additivity axiom implies no balls remain, because there are only countably many balls. Thus, by claiming the intuitive solution is correct, you are implicitly denying a fundamental axiom of probability. – whuber Nov 24 '17 at 22:36
  • 13
    @whuber I don't need to name a ball with non-zero probability: I have infinitely many balls. And the limit of the product of two things, with one going to zero and the other to infinity, can be anything. It can be zero, it can be infinity, it can be anything in between (like 42). That depends on how the product behaves as a whole. It's the same kind of "paradox" that makes any point within a distribution in R have zero probability - it's only intervals of infinitely many points that have a non-zero probability to occur. **There is really no paradox in the mathematical sense.** – cmaster - reinstate monica Nov 24 '17 at 22:49
  • 6
    You have to do the mathematics correctly before you can claim no paradox. Let me illustrate. $\mathbb{N}=\{0,1,2,\ldots\}$ is the set of natural numbers. Consider the sequence of sets in which at step $i=0,1,2,\ldots$ all the numbers from $0$ through $i$ have been removed. At each step infinitely many numbers remain. How many numbers remain in the limit? Your "only valid way," if I interpret it correctly, would answer "infinitely many" because "$\lim_{n\to\infty}\infty=\ldots=\infty$." The fact that the limit is empty is strong evidence that your approach is mathematically suspect. – whuber Nov 24 '17 at 22:56
  • 7
    @Michael Unfortunately, that's a miscalculation. Each ball's chance of remaining in the limit is $0$. – whuber Nov 24 '17 at 22:59
  • 6
    @Michael You have no basis for asserting that correct mathematical reasoning leads to a wrong answer--apart from your assertion that it must be wrong. – whuber Nov 24 '17 at 23:07
  • 3
    @Michael When working with cardinal infinities, that statement isn't meaningful. Nine times $\aleph_0$ is still $\aleph_0$. I don't deny it, but it isn't even relevant! – whuber Nov 25 '17 at 00:21
  • 6
    @whuber I find the argument "Asserting that infinitely many balls remain raises the natural question: which balls? Can you name a single ball that has a nonzero chance of remaining?" a bit suspect. Imagine a slightly different problem. We start with one ball and at each time the existing ball is removed and a new one is added, with probability one. The replacement takes no time. I think we all agree that (with probability one) there is exactly one ball at any time. It is also true that any individual ball has zero probability of remaining. – Luca Citi Nov 25 '17 at 00:38
  • 13
    Just commenting here again to make sure people are aware this answer is incorrect. @cvote you should read Ross' argument, your answer does not address his derivation at all. – Carlos Cinelli Nov 25 '17 at 06:07
  • 4
    Ross should have added a pseudo algorithmic implementation if he wanted to avoid confusion. Crossing his t's and dotting his i's, I get: ` set.seed(123) urn – user603 Nov 25 '17 at 10:31
  • 5
    @Carlos_Cinelli "to make sure people are aware this answer is incorrect". I think we are still debating. I can't see how you can assert so firmly that this answer is wrong. – Luca Citi Nov 25 '17 at 13:52
  • 3
    @LucaCiti this answer performs a wrong derivation to get zero and claims this is the source of the problem for the answer "zero". Then it presumes without any proof and further thinking that a particular limit expression should be the correct answer to the question and simply states the limit diverges. It further states there's no mathematical paradox based on these two derivations --- the answer is incorrect in basically all levels. – Carlos Cinelli Nov 25 '17 at 19:43
  • 3
    Most of this answer seems to rest on a misunderstanding of how to derive the result that there are no balls in the urn at time 12, with probability 1. The fact that an irrelevant calculation with a "broken step" can lead to the same result doesn't mean the result itself is wrong. – ekvall Nov 25 '17 at 19:53
  • 3
    @CarlosCinelli The gist of the answer is that taking the limit as $N$ (current step) goes to infinity of the probability of survival of the ball of index $n$ _then_ taking the limit as $n$ goes to infinity is misleading. I agree with this view, see my answer below. I think one of the issues is the second step, i.e. that if the limit of $P(n,N)$ is zero as $N\to\infty$ for all $n$, then the urn must be empty. This conclusion seems to rely on sigma additivity, which may not apply to a probability over $\mathbb{N}$ just like a uniform probability over $\mathbb{N}$ is improper. – Luca Citi Nov 25 '17 at 20:19
  • 6
    @CarlosCinelli It is true that $$\int_x^x{1dt} = 0$$. This holds for all $$x\in [0,1]$$. So, is it true that $$\int_0^1{1dt} = \int_0^1{\int_x^x{1dt}dx} = \int_0^1{0dx} = 0$$? No way. The correct solution is $$\int_0^1{1dt} = 1$$. This is really a fully analogous situation: You have an infinite sum over terms that are all individually zero. Yet the sum is not zero because it sums over *infinitely* many quantities that only **approach** zero. If Ross' argument that the urn is empty at twelve is correct, then he is a genius, and all analysis that builds on top of integrals needs to be rewritten… – cmaster - reinstate monica Nov 25 '17 at 23:08
  • 4
    @cmaster the derivation for the answer zero does not involve any of this, it's as simple as the following: let $F_i$ be the event "ball i at 12 P.M.". The event "at least one ball at 12 P.M." is $\cup_{i \in \mathbb{N}} F_i$. If $P(F_i) = 0$ then $P(\cup_{i \in \mathbb{N}} F_i) \leq \sum_{i \in \mathbb{N}}P(F_i) = 0$. – Carlos Cinelli Nov 25 '17 at 23:23
  • 4
    @cmaster Have you not read the example in Ross yet? $P(F_i) = 0$ is rigorously proven. I don't understand why you keep making statements that are blatantly false. – ekvall Nov 26 '17 at 12:39
  • 5
    @cmaster Good, so then we are in agreement. Since the event that there is *any* ball in the urn at time 12 can be written $F = \cup_{i = 1}^\infty F_i$, we have by sub-additivity of measures that $0\leq P(F) \leq \sum_{i = 1}^\infty P(F_i) = 0$. Having now seen the real derivation of the result (I assume you've read Ross by now), you should edit your answer and remove the part that claims it's the result of a "broken step". – ekvall Nov 26 '17 at 21:36
  • 4
    And just to be clear: yes, the step you illustrate in your answer is incorrect. But that step is used nowhere in the rigorous proof that there are no balls in the urn at 12, almost surely – ekvall Nov 26 '17 at 21:41
  • Comments are not for extended discussion; this conversation has been [moved to chat](https://chat.stackexchange.com/rooms/76769/discussion-on-answer-by-cmaster-at-each-step-of-a-limiting-infinite-process-put). – whuber Apr 29 '18 at 21:43
14

This argument is focused on the tendency for infinite sets and sequences to behave in unitnuitive ways. This is no more surprising than the Hilbert Hotel. In such a case, you will indeed have taken out an infinite number of balls, but you will have put an infinite number in. Consider the Hilbert Hotel in reverse. You can remove an infinite number of guests from the hotel, and still have an infinite number left.

Whether this is physically realizable is another question entirely.

As such, I would consider it not necessarily ill formed, but rather put in the wrong book. This sort of counting question belongs in a set theory course, not a probability course.

Cort Ammon
  • 547
  • 2
  • 5
  • 2
    The argument given to support an answer of 0 is more sophisticated than just "infinity minus infinity is zero" so I don't think this answer really addresses it. You can also remove an infinite number of guests from the hotel and have zero left, and in some sense the challenge here is to work out which one you've done. It's by no means obvious that set theory has the answer to that question and probability theory doesn't. – Ben Millwood Nov 25 '17 at 05:31
  • 3
    @BenMillwood Which would be why I argue that this puzzle belongs in a set theory book, rather than a probability book. – Cort Ammon Nov 25 '17 at 05:35
14

I think it helps to remove the superfluous temporal component of the problem.

The more basic variant of this paradox is to always remove the lowest numbered ball. For ease of drawing, I will also only add two balls at each step.

The procedure describes how to fill out an infinite two-dimensional grid:

.*........
..**......
...***....  ....
....****..
.....*****

 :  :  :
 :  :  :

where each row is formed from the previous by adding two asterisks on the right then removing the leftmost.

The questions one then asks are:

How many columns end with repeated asterisks rather than repeated dots?

In my opinion, the idea to mistakenly equate this result with "the limit of the number of asterisks in each row" is much less compelling.

  • 1
    I like your approach but I don't get your conclusion. The question asks "How many balls are in the urn at 12 P.M.?", which in my opinion translates exactly into "the limit of the number of asterisks in each row". – Luca Citi Nov 25 '17 at 10:02
  • 2
    @LucaCiti: Which balls are in the urn? The ones corresponding to the columns that end with repeated astrisks. How many columns end in repeated astrisks? None. –  Nov 25 '17 at 14:54
  • 3
    Asking which balls is not the same as asking how many. – Sentinel Nov 25 '17 at 15:46
  • 2
    @Hurkyl I think we don't care which balls are in the urn but how many. My answer below suggests that the balls that are in the urn tend to be those that were put last. In particular at step $N $ the last $N/2$ have probability higher that 7/8 of being still in the urn. This number grows as $O (N)$. – Luca Citi Nov 25 '17 at 16:23
  • 3
    @LucaCiti: How many columns end in asterisks? None. That is the *specific* question that Ross means to ask of this diagram. (in fact, part of the whole point of phrasing the problem this way is to make it clear what *specific* question is being asked) –  Nov 25 '17 at 16:35
  • 1
    ... Actually, the last $10\cdot N/2$ balls. – Luca Citi Nov 25 '17 at 16:38
  • 1
    @Hurkyl Sorry but the text cited in the yellow box asks "How many balls are in the urn at 12 P.M.?" – Luca Citi Nov 25 '17 at 16:40
  • 5
    @Hurkyl The question that has practical applications and is IMHO more meaningful is how many balls not which ones. Consider a room with an open window. At all times molecules of oxygen enter and leave the room. The probability that a molecule that entered at finite time $t $ is still in the room at time $T $ goes to zero as $T\to\infty $. This doesn't mean that the room will be depleted of oxygen as $T\to\infty $. – Luca Citi Nov 25 '17 at 16:43
  • 4
    @LucaCiti: I suppose it wasn't clear, but the grid extends infinitely downwards and to the right. There is no "last". Yes, that is what the text in the yellow box says -- the formalization I give in my post is what was meant by that text. This is a standard problem, and Ross's actual analysis agrees with my formalization. You can ask a *different* question, but that will be a *different* problem. –  Nov 25 '17 at 16:55
  • 1
    @LucaCiti: Also, oxygen molecules move about in a smooth continuous way (albeit rather erratically). The problem about the balls in urns, if it were to happen in a hypothetical universe, would be highly discontinuous at midnight. This is another source of misconceptions about the problem. –  Nov 25 '17 at 16:57
  • 1
    +1, I really like your ASCII-art illustration. I posted my own answer now where I try to explain why the limit of the number of asterisks in each row (this number clearly grows to infinity) is not the same as the number of asterisks *after the infinite number of rows* (note that we are making a [transfinite step](https://en.wikipedia.org/wiki/Transfinite_induction) here). That's what @LucaCiti asked in the first comment above. What all of this actually shows is that taking the number of asterisks is not a continuous operation, so the limit of the number is not equal to the number of the limit. – amoeba Nov 26 '17 at 00:03
  • 1
    You mention removing the _temporal_ component of the problem, but I actually found it more useful to keep that and remove the _probabilistic_ component (as you and @amoeba both did). It's easy to show that for any given ball $n$, that ball will have been removed by the time $midnight-\epsilon$ if $\epsilon$ is sufficiently small. Thus, the number of balls that remain _at_ midnight is clearly zero. – Aaron Novstrup Nov 26 '17 at 00:17
  • 1
    @AaronNovstrup At time 12pm-$\epsilon $ you cannot have removed all balls. First of all because at each finite time you only have removed one tenth of the balls added so far. Second, because you cannot have removed the balls that you haven't added yet and that you are going to add between 12pm-$\epsilon $ and 12pm. – Luca Citi Nov 26 '17 at 09:27
  • 2
    Anyway, I find amoeba's answer convincing. My error was that I was assuming the cardinality to be a continuous operation while apparently it is not. I find his/her expansion of the paradox very useful. – Luca Citi Nov 26 '17 at 09:59
  • @LucaCiti You may have misunderstood what I was saying. For every ball $b$, there is an $\epsilon_b$ such that you will have (added and) removed ball $b$ by 12pm-$\epsilon_b$. – Aaron Novstrup Nov 27 '17 at 18:46
  • If you're going to rephrase the problem, you could also rephrase it as always removing the highest number ball. In that formulation, there's infinitely many balls left. Now, given that two (apparently equivalent) reformulations of the problem give different results from the randomly-picked version, which is the correct reformulation? (both, neither?) – R.M. Dec 01 '17 at 12:28
  • @R.M.: The point of my post is to demonstrate the difference between "the limit of the number of stars in each row" and "the number of columns ending in stars" in the simplest way possible; that's all I'm trying to achieve with this. Taking the highest numbered ball would make for a poor example, since the *answers* to the two questions would turn out to be the same. Not an ideal situation when I'm trying to highlight how the questions are different! –  Dec 01 '17 at 18:07
  • @Hurkyl From the way your answer is currently phrased, it's unclear that you intend it to be "here's a completely different problem that shows that a certain line of thinking is fallacious" rather than "here's an equivalent restating of the problem that provides a simpler way of obtaining the result". -- It might be helpful to emphasize that your answer doesn't actually say anything about how many balls are left under the original problem statement, instead it only demonstrates that it's more complicated that it might seem. – R.M. Dec 02 '17 at 13:42
14

This answer aims to do four things:

  1. Review Ross's mathematical formulation of the problem, showing how it follows directly and unambiguously from the problem description.

  2. Defend the position that Ross's paradoxical solution is both mathematically sound and relevant to our understanding of the physical world, whether or not it is 100% physically realizable.

  3. Discuss certain fallacious arguments rooted in physical intuition, and show that the oft-stated "physical" solution of infinite balls at noon is not only in contradiction to mathematics, but to physics as well.

  4. Describe a physical implementation of the problem which may make Ross's solution more intuitive. Start here for the answer to Carlos's original question.

1. How to Describe the Problem Mathematically

We will unpack the initial "infinite process modeling" step of Ross's argument (p. 46). Here is the statement we will focus on justifying:

Define $E_n$ to be the event that ball number 1 is still in the urn after the first n withdrawals have been made... The event that ball number 1 is in the urn at 12 P.M. is just the event $\bigcap_{n=1}^\infty E_n$.

Before we unpack Ross's statement, let's consider how it is even possible to understand the urn's contents at noon, after an infinite sequence of operations. How could we possibly know what is in the urn? Well, let's think about a specific ball $b$; you can imagine $b=1$ or $1000$ or whatever you want. If ball $b$ was taken out at some stage of the process before noon, certainly it won't be in the urn at noon. And conversely, if a given ball was in the urn at every single stage of the process up until noon (after it was added), then it was in the urn at noon. Let's write these statements out formally:

A ball $b$ is in the urn at noon if and only if it was in the urn at every stage $n \in \{n_b, n_b + 1, n_b + 2, ...\}$ before noon, where $n_b$ is the stage the ball was added to the urn.

Now let's unpack Ross's statement - what does $\bigcap_{n=1}^\infty E_n$ mean in plain English? Let's take a single realization $x$ of the urn process and talk it out:

  • $x \in E_1$ means that ball 1 is in the urn after stage 1 of the process.
  • $x \in E_1 \bigcap E_2$ means that ball 1 is in the urn after stages 1 and 2 of the process.
  • $x \in E_1 \bigcap E_2 \bigcap E_3$ means that ball 1 is in the urn after stages 1, 2, and 3 of the process.
  • For any $k \in \{1, 2, 3, ...\}$, $x \in \bigcap_{k=1}^n E_k$ means that the ball is in the urn after stages $1$ thru $n$.

Clearly, then, $x \in \bigcap_{k \in \{1, 2, 3...\}} E_k$ means that, in realization $x$ of this urn process, ball 1 is in the urn after stages 1, 2, 3, et cetera: all finite stages $k$ before noon. The infinite intersection $\bigcap_{n = 1}^\infty E_n$ is just another way of writing that, so $\bigcap_{n = 1}^\infty E_n$ contains precisely the realizations of the process where ball 1 was in the urn at all stages before noon. An event is just a defined set of realizations of a process, so the last sentence is precisely equivalent to saying that $\bigcap_{n = 1}^\infty E_n$ is the event that ball 1 was in the urn at all stages before noon, for this random process.

Now, the punchline: by our "if and only if" statement above, this is exactly the same as saying that ball 1 was in the urn at noon! So $\bigcap_{n = 1}^\infty E_n$ is the event that ball 1 is in the urn at noon, just as Ross originally stated. QED

In the derivation above, everything we said is equally valid for both the deterministic and probabilistic versions, because deterministic modeling is a special case of probabilistic modeling in which the sample space has one element. No measure theoretic or probability concepts were even used, beyond the words "event" and "realization" (which are just jargon for "set" and "element").

2. The Paradoxical Solution is Mathematically Sound and Relevant to Physics

After this setup point, the deterministic and probabilistic variants diverge. In the deterministic variant (version 2 from amoeba's post), we know ball 1 is taken out on the first step, so $E_1 = \emptyset$ and the infinite intersection, of course, is also empty. Similarly, any other ball $b$ is taken out at stage $b$ and is not present at noon. Thus the urn cannot contain any numbered ball $b$ at noon and must therefore be empty.

In the probabilistic variant, the same phenomenon happens, just in a softer "in-expectation" sense. The probability of any given ball's being present dwindles to zero as we approach noon, and at the limiting time of noon, the ball is almost surely not present. Since each ball is present with probability zero, and the sum of infinitely many zeros is still zero, there are almost surely no balls in the urn at noon. All of this is shown completely rigorously by Ross; details can be filled in with a knowledge of graduate-level measure theory, as @ekvall's answer shows.

If you accept the standard arguments about mathematical objects expressed as infinite sequences, for example $0.999... = 1$, the argument here should be just as acceptable, as it relies on the exact same principles. The only question remaining is whether the mathematical solution applies to the real world, or just the platonic world of mathematics. This question is complex and is discussed further in section 4.

That said, there is no reason to presuppose that the infinite urn problem is unphysical, or to reject it as irrelevant even if it is unphysical. Many physical insights have been gained from studying infinite structures and processes, for example, infinite wires and percolation lattices. Not all of these systems are necessarily physically realizable, but their theory shapes the rest of physics. Calculus itself is "unphysical" in some ways, because we don't know if it is possible to physically realize the arbitrarily small distances and times that are often its subject of study. That doesn't stop us from putting calculus to incredibly good use in the theoretical and applied sciences.

3. The Unphysicality of Solutions Based on "Physical Intuition"

For those who still believe that Ross's math is wrong or physically inaccurate in the deterministic variant, and the true physical solution is infinitely many balls: regardless of what you think happens at noon, it is impossible to deny the situation before noon: every numbered ball added to the urn eventually gets removed. So if you think there are somehow still infinitely many balls in the urn at noon, you must admit that not one of those balls can be a ball added before noon. So those balls must have come from somewhere else: you are asserting that infinitely many balls, unrelated to the original problem process, suddenly pop into existence at precisely noon to rescue the continuity of cardinality from being violated. As unphysical as the "empty set" solution might seem intuitively, this alternative is objectively and demonstrably unphysical. Infinite collections of objects do not pop into being in an instant just to satisfy poor human intuitions about infinity.

The common fallacy here seems to be that we can just look at the number of balls as time approaches noon, and assume that the divergent trend yields infinitely many balls at noon, without regard to exactly which balls are being taken in and out. There has even been an attempt to justify this with the "principle of indifference", which states that the answer shouldn't depend on whether the balls are labeled or not.

Indeed, the answer does not depend on whether the balls are labeled or not, but that is an argument for Ross's solution, not against it. From the perspective of classical physics, the balls are effectively labeled whether you think of them as labeled or not. They have distinct, permanent identities which are equivalent to labels, and a truly physical analysis must account for this, whether or not numbers are literally written on the balls. The labels themselves do not directly affect how the solution comes out, but they are needed to describe exactly how the balls are moved around. Some procedures leave balls in the urn forever, others provably remove every ball that is added, and labels are needed to even describe the difference between these procedures. Attempting to ignore the labels is not "physical", it's just neglecting to understand the physical problem precisely enough to solve it. (The same goes for complicated variants that reshuffle the labels at each stage. What matters is which balls are in the urn, not the labels someone has placed or replaced on them. This can be determined by ignoring the complicated relabeling scheme entirely and simply using a single unchanging labeling scheme, the one of Ross's original problem.)

The only way distinguishability would fail to be true is if the "balls" were quantum mechanical particles. In this case, the indifference principle fails spectacularly. Quantum physics tells us that indistinguishable particles behave completely differently than distinguishable ones. This has incredibly fundamental consequences for the structure of our universe, such as the Pauli exclusion principle, which is perhaps the single most important principle of chemistry. No one has attempted to analyze a quantum version of this paradox yet.

4. Describing the Solution Physically

We have seen how vague "physical" intuitions can lead us astray on this problem. Conversely, it turns out that a more physically precise description of the problem helps us understand why the mathematical solution is actually the one that makes the most physical sense.

Consider an infinite Newtonian Universe governed by the laws of classical mechanics. This Universe contains two objects: an infinite Shelf and an infinite Urn, which start at the Origin of the Universe and run alongside one another, one feet apart, forever and ever. The Shelf lies on the line $y = 0$ feet, while the Urn lies on the line $y = 1$ feet. Along the Shelf are laid infinitely many identical balls, evenly spaced one foot apart, the first being one foot from the Origin (so ball $n$ is on the line $x = n$ feet). The Urn - which is really just like the Shelf, but a bit more ornate, closed over, and generally Urnish - is empty.

An Aisle connects the Shelf and Urn at the bottom, and on top of the Aisle, at the Origin, sits an Endeavor robot with an infinite power supply. Beginning at 11 AM, Endeavor activates and begins zooming back and forth in the Aisle, transferring balls between Urn and Shelf according to Ross-Littlewood's programmed instructions:

  • When the program commands ball $n$ to be inserted into the Urn, the ball $n$ feet from the Origin is transferred from the Shelf to the Urn.
  • When the program commands ball $n$ to be removed from the Urn, the ball $n$ feet from the Origin is transferred from the Urn to the Shelf.

In either case, the transfer is made straight across, so the ball remains $n$ feet from the Origin. The process unfolds as specified in the Ross-Littlewood problem:

  • At 11:00 AM, Endeavor transfers balls 1-10 from Shelf to Urn, then moves one of the Urn balls back to Shelf.
  • At 11:30 AM, Endeavor transfers balls 11-20 from Shelf to Urn, then moves one of the Urn balls back to Shelf.
  • At 11:45 AM, Endeavor transfers balls 21-30 from Shelf to Urn, then moves one of the Urn balls back to Shelf.
  • et cetera...

As the process continues, each new step requires longer trips up and down the Aisle, and only half the time to make the trips. Thus, Endeavor must move up and down the Aisle exponentially faster as noon closes in. But it always keeps up with the program, because it has an infinite power supply and can move as fast as needed. Eventually, noon arrives.

What happens in this more vividly imagined version of the paradox? Watched from above, the approach towards noon is truly spectacular. Within the Urn, a Wave of balls appears to propagate outward from the Origin. The Wave's size and speed grow without bound as noon approaches. If we were to take pictures immediately after each step what would the layout of balls would look like? In the deterministic case, they would look exactly like the step functions in amoeba's answer. The ball positions $(x, y)$ would follow precisely the curves he has plotted. In the probabilistic case, it would look roughly similar, but with more straggling near the Origin.

When noon arrives, we take stock of what has happened. In the deterministic version, each ball was transferred from the Shelf to the Urn exactly once, then moved back at a later step, with both transfers happening before noon. At noon, the Universe must be back to its original 11 AM state. The Wave is no more. Each ball is back exactly where it started. Nothing has changed. The Urn is empty. In the probabilistic version the same thing happens, except now the result is only almost sure rather than sure.

In either case, "physical objections" and complaints about infinity seem to vanish into thin air. Of course the Urn is empty at noon. How could we have imagined otherwise?

The only remaining mystery is the fate of Endeavor. Its displacement from the Origin and its velocity became arbitrarily large as noon approached, so at noon, Endeavor is nowhere to be found in our infinite Newtonian Universe. The loss of Endeavor is the only violation of physics which has occurred during the process.

At this point, one could object that Endeavor is not physically possible, since its speed grows without bound and would eventually violate the relativistic limit, the speed of light. However, we can change the scenario slightly to resolve this issue. Instead of a single robot, we could have infinitely many robots, each responsible for a single ball. We could program them beforehand to ensure perfect coordination and timing according to Ross's instructions.

Is this variation 100% physical? Probably not, because the robots would have to operate with arbitrarily precise timing. As we approach noon, the precision demanded would eventually fall below the Planck time and create quantum mechanical issues. But ultimately, an infinite wire and an infinite percolation lattice might not be all that physical either. That doesn't stop us from studying infinite systems and processes and determining what would happen if the obstructing physical constraints were suspended.

4a. Why Count Monotonicity is Violated

A number of Ross skeptics have questioned how it is possible that the number of balls in the urn increases without bound as we approach noon, then is zero at noon. Ultimately we must believe in rigorous analysis over our own intuition, which is often wrong, but there is a variation of the paradox that helps illuminate this mystery.

Suppose that instead of infinitely many balls, we have $10N$ balls labeled 1, 2, 3, up to $10N$, and we issue the following addition to the rules for the ball mover:

  • If the instructions ask you to move a ball that does not exist, ignore that instruction.

Note that the original problem is unchanged if we add to it this instruction, since the instruction will never be activated with infinitely many balls. Thus, we can think of the original problem and this new family of problems to be part of the same family, with the same rules. Examining the finite $N$ family, especially for very large $N$, can help us to understand the "N = $\infty$" case.

In this variation, the balls accumulate 9 per step as before, but only up to step $N$ of the process. Then the numbers for balls to be added no longer correspond to actual balls, and we can only comply with the instruction to remove balls, and the process stops after $9N$ additional steps, for a total of $10N$ steps. If $N$ is very large, the removal-only phase occurs very close to noon, when the tasks are being done very rapidly, and the urn is emptied out very quickly.

Now suppose we do this variation of the experiment for each value of $N$ and graph the ball count over time, $f_N(t)$, where $t$ ranges from 0 to 1 hour after 11AM (i.e. 11AM to noon). Typically $f_N(t)$ rises for a while, then falls back to zero at or before $t=1$. In the limit as $N$ approaches infinity, the graph rises ever higher and the fall is ever more rapid. By noon the urn is always empty: $f_N(1) = 0$. In the limiting graph, $f(t) = \lim_{N \rightarrow \infty} f_N(t)$, the curve approaches infinity for $t < 1$ but $f(1) = 0$. This is precisely the result derived in Ross's proof: the ball count diverges to infinity before noon, but is zero at noon. In other words, Ross's solution preserves continuity with respect to N: the pointwise limit of the ball count as $N \rightarrow \infty$ matches the ball count in the infinite ball case.

I do not consider this a primary argument for Ross's solution, but it may be helpful for those who are puzzled about why the ball count goes up forever, than crashes to zero at noon. While strange, it is the limiting behavior of the finite version of the problem as $N \rightarrow \infty$, and thus does not come as a "sudden shock" in the infinite case.

A Final Reflection

Why has this problem proven to be such a tar-pit for so many? My speculation is that our physical intuition is much vaguer than we think it is, and we often draw conclusions based on imprecise and incomplete mental conceptions. For example, if I ask you to think of a square that is also a circle, you may imagine something squarish and circlish, but it won't be precisely both of those things - that would be impossible. The human mind can easily mash together vague, contradictory concepts into a single mental picture. If the concepts are less familiar, like the Infinite, we can convince ourselves that these vague mental mashups are actually conceptions of the Real Thing.

This is precisely what happens in the urn problem. We do not really conceive of the whole thing at once; we think about bits and pieces of it, like how many balls there are over time. We wave away supposedly irrelevant technicalities, like what happens to each humble little ball over time, or how exactly an "urn" can hold infinitely many balls. We neglect to set out all the details precisely, not realizing that the result is a mashup of inconsistent, incompatible mental models.

Mathematics is designed to rescue us from this condition. It disciplines and steels us in the face of the unfamiliar and the exotic. It demands that we think twice about the things that "must" be true... right? It reminds us that no matter how strange things get, one and one is still two, a ball is either in an urn or it is not, and a statement is either true or false. If we persevere, these principles eventually bring clarity to most of our problems.

Those who subordinate mathematical analysis to "physical" or "common-sense" intuitions do so at their peril. Hand-waving about intuitions is only the start of physics. Historically, all successful branches of physics have eventually founded themselves on rigorous mathematics, which culls away incorrect physical intuitions, strengthens correct ones, and enables the rigorous study of ideal systems, such as the infinite current-carrying wire, which illuminate the behavior of the more complicated, messy real world. Ross-Littlewood is a physical problem, typically interpreted as one of classical mechanics, and classical mechanics has a completely mature and rigorous mathematical foundation. We should rely upon mathematical modeling and analysis for our intuitions about the world of classical physics, not the other way around.

Paul
  • 9,773
  • 1
  • 25
  • 51
  • 3
    This is the way to go. However, the full meaning of "this has nothing to do with probability" isn't completely clear, because there are necessary assumptions on the probability: without them, the conclusions change. For instance, if you assign zero probability at each stage to the chance of withdrawing ball $1$, then ball $1$ will remain after midnight. – whuber Nov 28 '17 at 21:22
  • Comments are not for extended discussion; this conversation has been [moved to chat](https://chat.stackexchange.com/rooms/76771/discussion-on-answer-by-paul-at-each-step-of-a-limiting-infinite-process-put-10). – whuber Apr 29 '18 at 21:45
14

Several posters have been concerned the computations in Ross may not be rigorous. This answer addresses that by proving the existence of a probability space where all sets of outcomes considered by Ross are indeed measurable, and then repeats the vital parts of Ross's computations.

Finding a suitable probability space

To make Ross's conclusion that there are no balls in the urn at 12 P.M., almost surely, rigorous, we need the existence of a probability space $(\Omega, \mathcal F, P)$ where the event "no balls in the urn at 12 P.M." can be constructed formally and shown to be measurable. To that end, we shall use Theorem 33 [Ionescu - Tulcea] in these lecture notes, slightly reworded, and a construction suggested by @NateEldredge in a comment to the question.

Theorem. (Ionescu - Tulcea Extension Theorem) Consider a sequence of measurable spaces $(\Xi_n, \mathcal X_n), n = 1, 2, \dots$. Suppose that for each $n$, there exists a probability kernel $\kappa_n$ from $(\Xi_1, \mathcal X_1) \times \dots \times(\Xi_{n-1}, \mathcal X_{n-1})$ to $(\Xi_n, \mathcal X_n)$ (taking $\kappa_1$ to be a kernel insensitive to its first argument, i.e., a probability measure). Then there exists a sequence of random variables $X_n, n = 1, 2, \dots$ taking values in the corresponding $\Xi_n$, such that, for every $n$, the joint distribution of $(X_1, \dots, X_n)$ is that implied by the kernels $\kappa_1, \dots, \kappa_n$.

We let $X_n$ denote the label of the ball removed at the $n$th withdrawal. It's clear that the (infinite) process $X = (X_1, X_2, \dots)$, if it exists, tells us everything we need to know to mimic Ross's arguments. For example, knowing $X_1, \dots, X_m$ for some integer $m \geq 0$ is the same as knowing the number of balls in the urn after withdrawal $m$: they are precisely the added balls with labels $\{1, 2, \dots, 10m\}$, minus the removed balls $\{X_1, \dots, X_m\}$. More generally, events describing which, and how many, balls are in the urn after any given withdrawal can be stated in terms of the process $X$.

To conform with Ross's experiment we need that, for every $n\geq 2$, the distribution of $X_n \mid X_{n-1}, \dots, X_{1}$ is uniform on $\{1, 2, \dots, 10n\} \setminus {X_1, \dots, X_{n-1}}$. We also need the distribution of $X_1$ to be uniform on $\{1,\dots, 10\}$. To prove that an infinite process $X = (X_1, X_2, \dots)$ with these finite-dimensional distributions indeed exists, we check the conditions of the Ionescu-Tulcea Extension Theorem. For any integer $n$, let $\mathcal I_n = \{1, 2, \dots, n\}$ and define the measurable spaces $(\Xi_n, \mathcal X_n) = (\mathcal I_{10n}, 2^{\mathcal I_{10n}})$, where $2^B$ denotes the power set of the set $B$. Define the measure $\kappa_1$ on $(\Xi_1, \mathcal X_1)$ to be the one that puts mass $1/10$ on all elements of $\Xi_1$. For any $n \geq 2$, and $(x_1, \dots, x_{n-1}) \in \Xi_1 \times \dots \times \Xi_{n-1}$ define $\kappa_n(x_1, \dots, x_{n-1}, \cdot)$ to be the probability kernel that puts equal mass on all points in $\Xi_n \setminus \{x_1, \dots, x_{n-1}\}$, and mass zero on all other points, i.e. on the integers $x_i \in \Xi_n, i = 1, \dots, n - 1$. By construction, the probability kernels agree with the uniform removal probability specified by Ross. Thus, the infinite process $X$ and the probability space $(\Omega, \mathcal F, P)$, the existence of which are given by the theorem, give us a way to formally carry out Ross's argument.

Let $E_{in}$ denote the set of outcomes such that ball $i$ is in the urn after withdrawal $n$. In terms of our stochastic process $X$ this means that, for all $i$ and $n$ such that $i \leq 10n$ we define $E_{in} = \cap_{j = 1}^n\{\omega: X_j(\omega) \neq i\}$, i.e. ball $i$ was not removed in any of the draws up to and including the $n$th. For $i > 10n$ we can clearly define $E_{in} = \emptyset$ since ball $i$ has not yet been added to the turn. For every $j$ and $i$, the set $\{\omega: X_j(\omega) \neq i\}$ is measurable since $X_j$ is a random variable (measurable). Thus, $E_{in}$ is measurable as the finite interesection of measurable sets.

We are interested in the set of outcomes such that there are no balls in the urn at 12 P.M. That is, the set of outcomes such that for every integer $i = 1, 2\dots$, ball $i$ is not in the urn at 12 P.M. For every $i$, let $E_i$ be the set of outcomes ($\omega \in \Omega$) such that ball $i$ is in the urn at 12 P.M. We can construct $E_i$ formally using our $E_{in}$ as follows. That $i$ is in the urn at 12 P.M. is equivalent to it being in the urn after every withdrawal made after it was added to the urn, so $E_i = \cap_{n:i\leq 10n}E_{in}$. The set of outcomes $E_i$ is now measurable as the countable intersection of measurable sets, for every $i$.

The outcomes for which there is at least one ball in the urn at 12 P.M. are those for which at least one of the $E_i$ happen, i.e. $E = \cup_{i = 1}^\infty E_i$. The set of outcomes $E$ is measurable as the countable union of measurable sets. Now, $\Omega \setminus E$ is the event that there are no balls in the urn at 12 P.M., which is indeed measurable as the complement of a measurable set. We conclude that all desired sets of outcomes are measurable and we can move on to computing their probabilities, as Ross does.

Computing the probability $P(\Omega \setminus E)$

We first note since the family of events $E_i, i = 1, 2, \dots$ is countable, we have by countable sub-additivity of measures that

$$ P(E) \leq \sum_{i = 1}^\infty P(E_i) = \lim_{N\to \infty}\sum_{i = 1}^N P(E_i). $$ For ease of notation, let's denote the real number $P(E_i) = a_i$ for all $i$. Clearly, to show that $P(E) = 0$ it suffices to show that $\sum_{i = 1}^N a_i = 0$ for all $N$. This is equivalent to showing that $a_i = 0$ for every $i$, which we shall do now.

To that end, note that for all $n$ such that ball $i$ has been added to the urn, i.e. $10n \geq i$, $E_{in} \supseteq E_{i(n + 1)}$. This is so because if ball $i$ is in the urn at step $n + 1$, it is also in the urn at step $n$. In other words, the sets $E_{in}$, form a decreasing sequence for all $n$ such that $10n \geq i$. For ease of notation, let $a_{in} = P(E_{in})$. Ross proves that $a_{1n} \to 0$ as $n \to \infty$ and states that this can also be shown for all other $i$, which I will take as true. The proof consists of showing that $a_{in} = \prod_{k = i}^n[9k / (9k + 1)]$ and $\lim_{n \to \infty}a_{in} = 0$ for all $i$, an elementary but lenghty calculation I will not repeat here. Armed with this result, and the fact that the family of events $E_{in}$, $10n > i$ is countable for every i, continuity of measures gives

$$a_i = P(\cap_{n: 10n > i}E_{in}) = \lim_{n \to \infty} P(E_{in}) = \lim_{n \to \infty}a_{in} = 0.$$

We conclude that $P(E) = 0$, and thus $P(\Omega\setminus E) = 1$ as claimed. QED.


Some common misunderstandings:

  1. One answer is concerned with the fact that (in my notation) $\lim_{N \to \infty}\sum_{i = 1}^N \lim_{n \to \infty }a_{in} \neq \lim_{N \to \infty}\sum_{i = 1}^N a_{iN}$. This, however, has no bearing on the validity of the solution bevause the quantity on the right hand side is not the one of interest per the provided argument.
  2. There has been some concern that the limit cannot be moved inside the sum, or in other words cannot be interchanged with the sum in the sense that it may be the case that $\sum_{i = 1}^\infty \lim_{n \to \infty}a_{in} \neq \lim_{n \to \infty}\sum_{i = 1}^\infty a_{in}$. Like the previous remark, this is irrelevant to the solution because the quantity on the right hand side is not the one of interest.
ekvall
  • 4,361
  • 1
  • 15
  • 37
  • 4
    @ekvall Kudos for putting in this thankless work. What people should generally understand is that, if you define some events and do countable set operations on those events, the resulting sets are measurable in the sigma algebra generated by those events. That's precisely what sigma algebras are designed to do: give us a universe where we can do countable set operations without any concern about measurability. – Paul Nov 30 '17 at 21:46
  • Comments are not for extended discussion; this conversation has been [moved to chat](https://chat.stackexchange.com/rooms/76772/discussion-on-answer-by-ekvall-at-each-step-of-a-limiting-infinite-process-put). – whuber Apr 29 '18 at 21:46
10

On the one hand, you could try to explain it like this: "think of the probability of any ball i being on the urn at 12 P.M. During the infinite random draws, it will eventually be removed. Since this holds for all balls, none of them can be there at the end".

I don't find this argument convincing. If this argument works, then the following argument works: Every year, some people are born (say a constant fraction of the total population), and some people die (suppose a constant fraction). Then, since in the limit any particular person is almost surely dead, then the human race must go extinct! Now, the human race may go extinct for other reasons, but this argument is garbage.

It doesn't make any sense for this problem to have one solution when the balls are numbered and for it to have a totally different answer when the balls are anonymous. By symmetry, arbitrary labels should not affect the solution. Jaynes called this argument the principle of indifference, which I accept.

In other words, if someone told you that they put ten balls into an urn and remove one repeatedly, and how full is the urn in the limit, would your answer be "It depends on whether the balls are numbered"? Of course not. That urn's contents diverge just like the urn in this problem.

Therefore, I think the solution lies in how we formalize the problem. From the usual definition of set-theoretic limit, we have

$$\liminf_{n \to \infty} S_n = \bigcup_{n \ge 1} \bigcap_{j \geq n} S_j.$$ $$\limsup_{n \to \infty} S_n = \bigcap_{n \ge 1} \bigcup_{j \geq n} S_j$$

Let the limit of the cardinality of the set be

$$k\triangleq \lim_{n\to\infty}|S_n|$$

and the cardinality of the $\liminf$-limit of the set be

$$l \triangleq \left|\liminf_{n\to\infty} (S_n)\right|.$$

I propose that set-theoretic limit be redefined so that:

\begin{align} \lim_{n\to\infty} S_n &\triangleq \begin{cases} \liminf_{n\to\infty} (S_n) &\text{if } \liminf_{n\to\infty} (S_n) = \limsup_{n\to\infty} (S_n), k \text{ exists, and }k=l \\ \alpha_k &\text{if }\liminf_{n\to\infty} (S_n) = \limsup_{n\to\infty} (S_n), k\text{ exists, and }k \ne l \\ \text{undefined} &\text{otherwise.} \end{cases} \end{align}

This special “anonymous set” $\alpha_k$ describes what happens at infinity. Just as $\infty$ stands in for the limiting behavior of numbers, $\alpha$ stands in for the limiting behavior of sets. Namely, we have $i \notin \alpha_k \forall i$, and $|\alpha_k| = k$. The benefit of this formalism is that it gives us continuity of cardinality and consistency with the principle of indifference.

For the urn problem, we have $S_n = \{n+1, \dotsc, 10n\}$ is the set of balls in the urn. And $$\lim_{n\to\infty} S_n = \alpha_{\infty}.$$ Thus, the elements don't "fall off a cliff" at infinity, which doesn't make sense any more than it makes sense for humanity to go extinct merely because no man is immortal.

Similarly, suppose we modify the problem so that at each step one ball is added and the lowest-numbered ball is removed. Then, how many balls are in the urn in the limit? Anonymous sets give the intuitive answer:

$$\lim_{n\to\infty}\{n\} = \alpha_1.$$

I recognize that mathematicians can disagree about resolutions to this paradox, but to me, this is the most intuitive resolution.

Neil G
  • 13,633
  • 3
  • 41
  • 84
  • 8
    Anyone arguing that the math needs to be fixed *must* deliver a very convincing demonstration of why. Otherwise, the default position has to be that one's intuition deserves correction. If not, then we can scarcely claim to have advanced at all beyond Zeno during the last 2500 years. – whuber Nov 24 '17 at 20:47
  • 5
    If you accept the regular probability axioms and if you further accept that the probability of any particular ball being in the urn is zero, then by Boole's inequality you are bound to accept that the probability that none of the balls are in the urn is one. – Carlos Cinelli Nov 24 '17 at 20:51
  • 3
    @NeilG: Often, I find "the theory doesn't align with my intuition" means "I'd rather think about a different problem than the one posed" rather than any actual flaw in the theory or in the intuition. –  Nov 25 '17 at 03:49
  • 5
    The human race isn't doomed to extinction by your argument because we will never reach a point at which infinitely many births / deaths have occurred -- there's never a need to take the limit. The fact that at 12pm, infinitely many things have happened, is pretty much the key source of the problem. – Ben Millwood Nov 25 '17 at 05:28
  • 6
    -1. Consider the modification of this paradox when ball #n is removed at the n-th step (instead of a random ball). It's clear that zero balls will be left at midnight (because every ball will be removed at the corresponding step) but we are still adding 10 balls and removing only 1 ball at each step, so I'd say it is as un-intuitive. However, this modification has *nothing* to do with probability or statistics. So there cannot be any "failure of modern statistics" here. – amoeba Nov 25 '17 at 21:16
  • 6
    @NeilG This point was made explicitly on the MathOverflow post, and on ameoba's answer. Cardinality is not a continuous operation, so just because $S_i\to\emptyset$ doesn't mean $|S_i|\to 0$. Calculus is not broken, but rather you have invented a limit rule that doesn't exist. – Mario Carneiro Nov 26 '17 at 08:48
  • Comments are not for extended discussion; this conversation has been [moved to chat](https://chat.stackexchange.com/rooms/76773/discussion-on-answer-by-neil-g-at-each-step-of-a-limiting-infinite-process-put). – whuber Apr 29 '18 at 21:47
6

The problem is either ill-formed or not in first-order logic.

Root cause: execution of the "last" step will write an infinite number of digits on a ball, causing that step to take itself an infinite time to execute.

The ability to execute an infinite process with an infinite step implies the ability to solve all first-order logic problems (Gödel is therefore false) by execution of the following sequence H (for theorem X):

Z = asymptotic_coroutine(
  FOR N = 1...∞
    FOR P = 1...N
      Convert number P to string S by characters.
      IF S is a proof for theorem X
      THEN
        OUTPUT "yes" and HALT
) + asymptotic_coroutine(
  FOR N = 1...∞
    FOR P = 1...N
      Convert number P to string S by characters.
      IF S is a proof for theorem ¬X
      THEN
        OUTPUT "no" and HALT
)
IF Z = "" 
THEN Z = "independent"
IF Z = "yesno" ∨ Z = "noyes"
THEN Z = "paradox"
OUTPUT Z

where the infinite step is unspooling the output

The program inside the asymptotic_coroutine is merely an exhaustive search for a theorem that proves (or disproves) X. Converting P to S results in "aa", "ab", "ac", ... "a∨", ... where every symbol that can appear in a theorem is generated. This results in generating all theorems of length logcharacters N in turn. Since N grows without limit in the outer loop this will eventually generate all theorems.

The side that is false will never terminate but we don't have to care about that because we are allowed to execute infinite steps. In fact we depend on being able to do this to detect independence as both sides will never finish. Except for one thing. We allowed an infinite number of steps to execute in a finite time by asymptotic increase of execution speed. This is the surprising part. The asymptotic_coroutine that never finishes and never generates output has "finished"* after the asymptotic time and still has never generated any output.

*If we placed an OUTPUT after the FOR N = 1...∞ it would not be reached but we are not going to do that.

The strong form of Gödel's Incompleteness Theorem may be stated "For every first-order logic system F there is a statement GF that is true in F but cannot be proven to be true in F." But proof method H cannot fail to prove all must-be-true statements in F(H).

Dilemma: ¬Gödel ∨ ¬(infinite steps are allowed)
Therefore:
Dilemma: ¬Gödel ∨ ¬(315502 is well formed in first order logic)

Joshua
  • 233
  • 1
  • 2
  • 7
  • 1
    Good point (+1). Note that there is research on infinite-time Turing machines, see e.g. https://arxiv.org/abs/math/0212047v1 and https://mathoverflow.net/a/22038. It's not first order of course. – amoeba Nov 29 '17 at 08:21
  • 5
    Joshua, your answer assumes knowledge that most people here is not familiar with so they won't be able to judge it. If you could elaborate further that would be great. – Carlos Cinelli Nov 29 '17 at 19:15
  • For any finite number, the length is finite. For any infinite (aka transfinite) numbe, itr can be written in Cantor Normal Form, which is finite in length. It could be called "base infinity". So writing digits is not a limitation. – Craig Hicks Dec 20 '17 at 23:57
  • @CraigHicks: That doesn't work when you had to write down all the intermediate numbers in between too. Hint: what's the stopping constraint on the loop when it switches from base 10 integer to cantor normal form output. – Joshua Dec 21 '17 at 00:02
  • That's only a constraint on a machine which doesn't have $\infty$ in it's symbol table. To analyze in finite time the infinite +10 -1 process described by Ross, it is not necessary to simulate the entire process. A smart program would connect to Mathematica and get it done much faster. – Craig Hicks Dec 21 '17 at 00:28
  • @CraigHicks: The problem isn't what mathematica thinks of the problem. The problem is approaching infinity via adding one each time will write all of the finite numbers onto balls. However much space you say is allotted on a ball will eventually be exceeded, and each number is written in a shorter amount of time than the previous. That is a sufficient infinity for my construction to work. In fact if it requires more than 3 digits on a ball my construction has used less ability. – Joshua Dec 21 '17 at 03:09
  • I do indeed appreciate the ironic humor in finding the theorems in the numbers written on the balls. Thank you very much for that. But the winning number which translated into this answer of yours - isn't there some history and reason and intuition behind it? – Craig Hicks Dec 21 '17 at 06:53
  • @CraigHicks: It's a transform of https://en.wikipedia.org/wiki/P_versus_NP_problem applied to the halting problem. If you manage to notice that Gödel depends on halts(Turing) is unsolvable rather than halts(self) is unsolvable than it's not to hard to discover. This last piece took me years to notice. – Joshua Dec 21 '17 at 16:16
4

What's the best explanation we can give to them to solve these conflicting intuitions?

Here's the best answer, and it has very little to do with probabilities. All balls have numbers, let's call them birth numbers. The birth numbers start from B1, B2, B3... and go to infinity, because we really never stop. We get closer to 12:00AM but keep adding and removing balls, that's why there is not a final number of a ball. This is a very important consideration, btw.

We put the balls into a box in 10 ball batches, such as batch #7: B71, B72,...,B80. Let's forget about these for a minute, and focus on the balls that are removed from the box. They come at a random order. I'll explain why randomness is important later, but for now all it means is that any ball with a brith number from B1 to B10k that's still in the box at step K can be drawn out. We're going to index the balls that we remove by the order in which they were removed, let's call them death numbers: D1, D2, D3 ... DK.

By 12:00AM we put infinite number of balls into a box, and surely we never ran out of balls to remove from it. Why? Because we first put 10 balls, THEN ONLY remove one. So, there's always a ball to remove. This means that we also removed infinite number of balls by 12:00AM.

This also means that each removed ball was indexed from 1 to infinity, i.e. we could pair each removed ball to a ball that was put in the box: B1 to D1, B2 to D2, etc. This means that we removed as many balls as we put in, because each birth number was paired with each death number.

Now that was the solution. Why does it defeat our intuition? It's elementary, Dr Watson. The reason is because we surely know that for all K this holds: $$K<10K$$ That's why after K steps, we should not be able to remove all ball from the box, because we put 10K balls and removed only K of them. Right?

There is a little problem. The matter is that when $K=\infty$, this is no longer true: $$10\times\infty\nless\infty$$ That's why the intuition breaks down.

Now, if the balls were not removed at random. Two thing may happen as in @amoeba's canonical answer. First, let's say we were putting 10 balls then immediately removing the last one. It's as if we were putting only nine balls in. This will match our intuition, and at 12:00AM there will be infinite number of balls. How come? Because we were not removing balls randomly, we were following the algorithm where the birth numbers were paired to death numbers as $B10K=DK$ at the time of removal. So, we paired each removed ball to one of the balls that we put in: $B10\to D1,B20\to D2,B30\to D3,\dots$ , this means a ton of balls were never ever paired B1,B2,...,B9,B11,... etc.

The second thing that may happen with non random ball removal is also related to pairing at removal: we correlate BK=DK. We can do this by removing a ball with BK at each step K, which ensures that BK is paired to DK. This way each removed ball is paired with each ball that we put in, i.e. the same end result like in the random draw of removed balls. Obviously, this means that there are no balls left in the box after 12:00AM.

I just have shown that the problem has very little to do with probabilities per se. It has everything to do with powers of infinite countable (?) sets. The only real problem that I avoided discussing is whether the sets are truly countable. You see when you get closer to 12:00AM your rate of ball inserts is increasing rather quickly, to put it mildly. So, it's not so trivial to devise whether the number of balls that we put into the box is actually countable.

Unraveling

Now, I'm going to unravel this canonical solution of the paradox, and get back to our intuition.

How is is possible that we put 10 balls in, remove one and still run out of all the balls at 12 hour? Here's what really is happening. 12 hour is unreachable.

Let as reformulate the problem. We don't halve time intervals anymore. We put and remove balls every minute. Isn't this exactly the same as in the original problem? Yes and no.

Yes, because nowhere in my exposition above I referred explicitly to time but at the very end. I was counting the steps k. So, we can keep counting the steps and dead balls by k.

No, because now we're never going to stop. We'll keep adding and removing balls till the end of time, which never arrives. While in the original problem the end is at 12 hour.

This explains how our intuition fails. Although we put balls at 9x rate of removal, because time never ends, every ball that we put in will be removed eventually! It may take infinite number of minutes, but it's Ok, because we have infinite number of minutes remaining. That's the true solution of the problem.

In this formulation would you ask "how many balls are in the box after infinity is over?" No! Because it's a nonsensical question. That's why the original question is nonsensical too. Or you could call it ill-posed.

Now, if you go back to the original problem, then the end of time apparently happens. It's at 12. The fact that we stopped putting balls in means that time just ended, and we reached beyond the end of it. So, the true answer to the question is that 12 o'clock should never occur. It's unreachable.

Aksakal
  • 55,939
  • 5
  • 90
  • 176
  • This is a fun way to look at it, and seems to argue more like we are not able to finish the supertask. But the example of Ross is full of probabilistic reasoning. It would be interesting if you could provide a connection to that. – Sextus Empiricus Dec 01 '17 at 20:37
  • 2
    @MartijnWeterings, I didn't do the probabilities because the paradox was constructed specifically to exploit measure theoretic foundations of probabilities. Whoever made the paradox must have first realized that it's about power of infinite countable sets. That's why it's prersented in three versions in the book as in amoeba's answer. The first version shows how a set of every tenths natural number has the same power as the set of all natural numbers, for example. The second and third versions are essentially the same. Probability here is just the landscape, all the action is in sets. – Aksakal Dec 01 '17 at 22:07
  • 1
    This reasoning does not seem to be able to distinguish between versions #1 and #2 from Ross book (see my answer), even though these versions lead to opposite results: in one case the urn gets empty and in another case it doesn't. – amoeba Dec 01 '17 at 22:16
  • @amoeba, it does, because in #1 B10k=Dk by construction. I'll clarify this. I noted that the balls are removed at random though – Aksakal Dec 01 '17 at 22:37
  • @Aksakal do I get you correctly if you say 1) We *can* make Minkoswki substractions of different infinite sets $B_1 - B_2$. When $B_1 = \lbrace 10k, k \in \mathbb{N} \rbrace + \lbrace 1,2,3,4,5,6,7,8,9,10 \rbrace$ the result is empty for $B_2 = \lbrace k, k \in \mathbb{N} \rbrace$ and the result is infinity for $B_2 = \lbrace 10k, k \in \mathbb{N} \rbrace$. 2) While one can ponder over these *correct* set theoretic substractions, they do not represent the supertask. The supertask can't be finished. Putting a Zeno's paradox on it, which most agree can be finished, is falsely argueing for RL. – Sextus Empiricus Dec 02 '17 at 13:16
  • If that is the case, then can we not at least argue for the 1st case, ignoring the paradox and just study the subtractions of infinite sequences. Then we may wonder. of all infinitely many possible situations, can we measure how many are giving an empty set after subtraction? That is like, if we randomly pick an infinite sequence $B_2$, out of the set of infinite sequences that satisfy certain conditions to make it suitable to the RL-"experiment". Does it contain all numbers in $N$? – Sextus Empiricus Dec 02 '17 at 13:21
  • 1
    I think the truth is that you can’t reach 12. That’s the true solution. Consider the same problem but instead of halving time at each step you make steps of equal duration in time, say 1 minute. This will go on forever. It will never stop. But the question will be “when you stop what’s in the box?” So your answer will be that it’s a nonsensical question because time never ends. – Aksakal Dec 02 '17 at 14:31
  • Of course you can reach 12. 1 hour intervals that follow the halving format of Ross's problem occur literally every hour of every day. To claim that this particular hour cannot pass because of the events that occur within it is the fallacy of special pleading. – Paul Dec 04 '17 at 15:46
  • Your argument basically says "an infinite sequence of halving intervals of time is kind of like an infinite sequence of constant intervals of time, and since an infinite sequence of constant intervals of time never passes, neither does the halving sequence." This is obviously weak reasoning, and if we accepted it as valid, we would also have to accept Zeno's paradoxes which prove that motion is impossible. Zeno's reasoning is basically the same as yours; "you can't cross an infinite sequence of events". Physical reality disproves this every second, so the reasoning to prove it is also false. – Paul Dec 04 '17 at 15:52
  • No, you can't refer here to the physical reality, because if you look at the way this process is constructed you'd have to swallow the universe before you even get close to 12. I'm not referring here to physical reality at all, because that would throw the whole problem out the window in the first one minute of thinking about it. I'm saying that by construction you'll get stuck in the first minute halving time intervals and trying to push more and more balls into the box. You'll never end doing it, so, you can't reach 12 – Aksakal Dec 04 '17 at 16:37
  • If your argument is "the problem is impossible because a finite urn can't hold infinitely many balls", then your answer should just say that. The problem is asking about ordinary time in ordinary physical reality. It is not too hard to conceive of an infinite urn that could hold the balls; indeed, my answer's section 4 describes exactly how to set it up. Such infinite physical systems are studied by physicists and mathematicians all the time. The urn problem is no less physical than the infinite wire in electromagnetics or the infinite lattice in percolation theory. – Paul Dec 06 '17 at 05:30
  • 1
    No. This is no ordinary time. That’s point. This problem sets up the time in a very different way than ordinary physical time. The urn is infinite and it’s ok – Aksakal Dec 06 '17 at 13:44
  • No, it is ordinary time. There is no problem with infinitely many events happening in finite time. No physical law prevents it. – Paul Dec 06 '17 at 17:14
  • 1
    Are you a physicist? What physical process you know that even remotely resembles this one? – Aksakal Dec 06 '17 at 17:47
  • I'm physicist enough for this conversation. There are plenty of infinite physical structures and processes that are studied in physics but are not necessarily physically realizable. The infinite wire of electromagnetics, the infinite lattice of percolation theory, etc. Correctly reasoning about these structures has profound practical implications. And once we accept the possibility of infinite structures, as theoretical physics does all the time, the possibility of infinite events in finite time is obvious. Ball 1 moves at 11AM, then ball 2 moves at 11:30AM, then ball 3 moves at 11:45AM, etc. – Paul Dec 06 '17 at 17:54
  • The setup of the problem in section 4 of my answer is pretty tame for theoretical physics. There are many more exotic structures that are successfully described and analyzed, with practical implications for everything from cosmology to the design of the computer you're typing on right now. – Paul Dec 06 '17 at 17:56
4

I want to make a reformulation that is as easy as possible to make the answer of 0 more intuitive, starting from the simplified example that balls are not removed randomly, but ball $n$ is removed at the $n$-th step.

Consider this: I put all balls into the urn at the beginning. In step 1, I take out ball 1. In step 2, I take out ball 2, and so on. Any doubt that the urn will be empty after infinite steps?

Okay. But if I don't put all balls into the urn at first, but only some balls, how could the urn be fuller in the end?

Thern
  • 281
  • 1
  • 5
  • 1
    +1. Nice. It's like every person one by one moving out of the fully occupied [Hilbert's Hotel](https://en.wikipedia.org/wiki/Hilbert%27s_paradox_of_the_Grand_Hotel); the hotel will be left empty. – amoeba Dec 04 '17 at 13:39
  • After every finite step n, the urn is not empty. Transactions however can only happen at finite steps. Contradiction. – Wilhelm Apr 27 '18 at 16:55
  • @Wilhelm Can you elaborate on that? I don't get the point. – Thern Apr 28 '18 at 08:38
  • @Thern: A ball can be removed only at a finite step n. But after every finite step there are balls in the urn (in the original example and in yours). Therefore the limit cannot be empty. Otherwise something must have happened between all finite steps and the limit. Contradiction. – Wilhelm Apr 28 '18 at 10:37
  • The contradiction is created by your belief in the following principle: "When the members of a sequence have a property I like, that property is preserved by taking the limit of the sequence." This is not a valid principle of mathematics (or physics for that matter). – Paul May 01 '18 at 00:12
  • There is no "property" of elements but the definition of the procedure: Only at finite steps elements can be transferred. It cannot be dropped without violating the constraints. If all finite states are not empty but the limit is empty, then the limit is state that differs from the finite states. This difference must be realized. It is forbidden by induction: there is always a next step, never a limit. The unphysicality of the limit gets obvious by modelling the ordered set $(0, 1, 2, ...) = \omega$ by a merry-go-round. The revolutions are the natural numbers. What will happen in the limit? – Wilhelm May 01 '18 at 11:23
  • The fact that a merry-go-round has no limiting state does not invalidate the existence of limiting states for other processes. Your mistaken assumption is that theories are invalid if they do not provide an answer to all possible questions. If this assumption were true, then logic would be invalid because it cannot consistently assign a truth value to the sentence "This statement is false". – Paul May 01 '18 at 16:27
  • As for the rest: "This difference must be realized" is meaningless words, and "there is ... never a limit" is an unsupported belief that makes your mathematics very underpowered and unappealing. – Paul May 01 '18 at 16:43
  • It is true that changes only happen at finite steps, and that the ball count at each finite step is increasing. Yet standard mathematics proves that the limiting set is empty. There is no contradiction. It is simply a proof that cardinality does not interchange with limits. The professional mathematical world knows that when fallible human intuitions meet with a rigorous proof, it also reveals the intuitions to be erroneous. We do not cling to them. – Paul May 01 '18 at 16:45
  • @Paul: A single counterexample shows the failure of a theory, Set theory undisputedly predicts that the sequence {1}, {2}, {3}, ... has the limit { } because the sequence {1}, {1, 2}, {1, 2, 3}, ... has the limit $\omega$. If the merry-go-round contradicts this, then this is a contradiction. (Of course all other applications of this theorem, like Ross-Littlewood, are as meaningless but they are not so easy to recognize as meaningless.) – Wilhelm May 01 '18 at 17:15
  • @Wilhelm "A ball can be removed only at a finite step n. But after every finite step there are balls in the urn (in the original example and in yours). Therefore the limit cannot be empty." If that would be true, then the following statement would be true as well: "The limit of $\frac{1}{n}$ for $n\rightarrow\infty$ can't be zero, because in every finite step, the number is greater than zero. Therefore the limit can not be zero." – Thern May 01 '18 at 17:22
  • "Yet standard mathematics proves that the limiting set is empty." That is not mathematics but matheology, absolutely useless for practical purposes (see https://www.hs-augsburg.de/~mueckenh/Transfinity/Transfinity/pdf, pp. 124ff, although Cantor deliberately devised it for practical application) and contradicting mathematics, here the (improper) limit of cardinalities. It is no solution to claim that cardinalities are not continuous. It is simply violating the conditions of the given example. No "fallible intuition is required to see this. – Wilhelm May 01 '18 at 17:22
  • @Thern: There are two very different limits: The limit of $(1/n)$ is $0$. But it is not "reached" by the terms of the sequence. It is simply a real number such that for every $\epsilon > 0$ there is an $m$ such that for $n > m: |a_n - 0| < \epsilon$. For this definition there need not be a limit of the sequence of indices $(n)$ or an actual infinity. It is sufficient to prove, for instance by mathematical induction, that the inequality is never violated for $n > m$. The set-theoretical limit is quite different. In our example the sets are evolving, by adding and losing balls, into each other. – Wilhelm May 01 '18 at 17:36
  • Of course after every finite step there are balls in the urn. This fact does never change. It is simply a mistake to assume that the sequence $(n)$ could be finished. It is assumed by mathematicians who are unable to understand the real infinite. Every ball leaves the urn. But every ball is followed by 10 others - and this does never end (that's why it is called infinite or unending). – Wilhelm May 01 '18 at 17:39
  • "If the merry-go-round contradicts this, then this is a contradiction." You have shown no contradiction, merely asked a vague question about what will happen in the situation. Your Transfinity book makes no explanation whatsoever of what the set theoretic limit has to do with the carousel. – Paul May 01 '18 at 19:33
  • @Paul: Of course you must say so. It is impossible for you to understand my arguments. You said: "It is true that changes only happen at finite steps, and that the ball count at each finite step is increasing. Yet standard mathematics proves that the limiting set is empty." This shows that what you call standard mathematics contradicts the facts. But nevertheless you believe it. See section Psychological aspects of set theory in https://www.hs-augsburg.de/~mueckenh/Transfinity/Transfinity/pdf, pp. 344ff since mathematical arguments will not help. They will not be understood by you and yours. – Wilhelm May 01 '18 at 20:19
  • @Wilhelm So you would say that if I start with an infinite number of balls and draw out one by one, the resulting limit would still be infinity? – Thern May 02 '18 at 08:29
  • @Thern: The answer depends on the conditions. (1) *In fact* you cannot start with an infinite number of balls, because there is no actual, finished, or completed infinity. (2) In set theory you can start with an infinite number, but when you draw them out one by one, you get to zero after a finite number of steps. How can we accomplish that? See Goodstein sequences, for instance in https://www.hs-augsburg.de/~mueckenh/Transfinity/Transfinity/pdf, p. 53f. It is a general rule, forced by the well-foundedness of the sequence of ordinal numbers, (contd) – Wilhelm May 02 '18 at 09:23
  • ... that every strictly decreasing sequence of ordinal numbers reaches its smallest element after a finite number of steps. Since limit ordinals have no direct predecessors, we have to jump down from them to some predecessor. – Wilhelm May 02 '18 at 09:24
  • @Wilhelm I currently do not have time to delve into this deeply, but it seems to me that you are mixing up cardinality and ordinality. You may construct a Goodstein sequence with an infinite ordinal number, but that does not mean that the cardinality of the infinite urn (number of balls inside) can be handled the same way. Especially, you can't say that the number of balls in the urn is $\omega$; by using ordinal numbers $\omega$ would just be another ball in the urn, rendering your Goodstein sequence useless. – Thern May 02 '18 at 12:04
  • In ZFC the axiom of foundation holds for all sets. See, e.g., https://www.hs-augsburg.de/~mueckenh/Transfinity/Transfinity/pdf, p. 46f. – Wilhelm May 02 '18 at 13:29
3

Let x be the number of balls that have been removed and y be the number of balls remaining. After each cycle y=9x. As x>0, y>0. There will be infinitely many balls in the urn at 12PM.

The reason that solutions based on probabilities lead to difficulties is that the probabilities from infinite series are tricky. ET Jaynes wrote about a few different apparent paradoxes of probability, like this one, in his book Probability Theory: The Logic of Science. I do not have my copy at hand, but the first part of the book is available online from Larry Bretthorst here. The following quote is from the preface.

Yet when all is said and done we find, to our own surprise, that little more than a loose philosophical agreement remains; on many technical issues we disagree strongly with de Finetti. It appears to us that his way of treating infinite sets has opened up a Pandora’s box of useless and unnecessary paradoxes; nonconglomerability and finite additivity are examples discussed in Chapter 15.

Infinite set paradoxing has become a morbid infection that is today spreading in a way that threatens the very life of probability theory, and requires immediate surgical removal. In our system, after this surgery, such paradoxes are avoided automatically; they cannot arise from correct application of our basic rules, because those rules admit only finite sets and infinite sets that arise as well-defined and well-behaved limits of finite sets. The paradoxing was caused by (1) jumping directly into an infinite set without specifying any limiting process to define its properties; and then (2) asking questions whose answers depend on how the limit was approached.

For example, the question: “What is the probability that an integer is even?” can have any answer we please in (0, 1), depending on what limiting process is to define the “set of all integers” (just as a conditionally convergent series can be made to converge to any number we please, depending on the order in which we arrange the terms).

In our view, an infinite set cannot be said to possess any “existence” and mathematical prop- erties at all—at least, in probability theory—until we have specified the limiting process that is to generate it from a finite set. In other words, we sail under the banner of Gauss, Kronecker, and Poincar ́e rather than Cantor, Hilbert, and Bourbaki. We hope that readers who are shocked by this will study the indictment of Bourbakism by the mathematician Morris Kline (1980), and then bear with us long enough to see the advantages of our approach. Examples appear in almost every Chapter.

The use of limits in the answer of @enumaris (+1) provides a way around the trickiness of infinities in probability.

Michael Lew
  • 10,995
  • 2
  • 29
  • 47
  • 5
    Please show us *which laws of probability* justify your conclusion in the first paragraph. Without that, you are just making an unfounded assertion. – whuber Nov 24 '17 at 22:09
  • The question in the shaded region does not require laws of probability. The laws of probability lead one onto the rocks in situations like this. That is what Jaynes said, and this is a nice example of that difficulty. – Michael Lew Nov 24 '17 at 22:12
  • 3
    The problem arises not from the laws of probability, but when people fail to acknowledge or use the laws of probability correctly. It is no resolution of a paradox to deny the axioms and techniques that one otherwise uses for reasoning in other circumstances. – whuber Nov 24 '17 at 22:13
  • I do not deny the axioms! Where does the original question say that answers have to be based on argument from the axioms of probability? I am arguing that probability-based solutions lead to difficulties. Jaynes argues that those difficulties come from the application of probability to problems where there are infinities. – Michael Lew Nov 24 '17 at 22:18
  • 4
    The phrase "at random" in the question *demands* consideration of probabilities. Otherwise, what do you understand "at random" to mean?? – whuber Nov 24 '17 at 22:20
  • I do not understand the phrase "at random" to be an instruction to limit the toolkit to the axioms of probability. In what world does that phrase "demand" anything? – Michael Lew Nov 24 '17 at 22:22
  • 4
    Your replies miss the point. All I ask is what you could possibly mean by "at random" if not (the obviously intended) *uniformly at random* and, regardless, how you propose to reason about an explicitly stated random process if not with some theory of randomness? – whuber Nov 24 '17 at 22:24
  • @whuber Do you mean to say that my first paragraph is not something that can be called reasoning? The problem of balls in the urn does not 'demand' any particular type of reasoning. The fact that probabilistic reasoning leads reliably to a misleading result suggests that such reasoning is not useful for this problem. The question, then, is why is the probabilistic reasoning unhelpful in this context. Jaynes provided an extensive take on that question. cmaster and enumaris provide similar analyses to Jaynes, and they seem entirely correct to me. – Michael Lew Nov 25 '17 at 00:01
  • 5
    I have yet to see any valid probabilistic reasoning in your post, Michael. – whuber Nov 25 '17 at 00:13
  • @whuber There is no requirement for probabilistic reasoning. The presence of the phrase "at random" is not a demand for probabilistic reasoning, as I have written above. I do not understand your insistence. – Michael Lew Nov 25 '17 at 00:19
  • 1
    -1. As others here commented already, if at each step number $n$ ball number $n$ is removed then clearly there are no balls left at midnight (because each ball is going to be removed at the corresponding step), despite 10 balls being added and only one removed at each step. It does not make sense to talk about the probabilistic version of this "paradox", before this simple non-probabilistic version is understood. Cc to @whuber. – amoeba Nov 25 '17 at 20:30
  • I'd like to know what container the balls were in before the urn, and if it's empty at 12PM, and supposing the balls go into a third vessel post-urn, how that's doing at 12PM. Also, supposing the random ball `n` removed had to be numbered `10n-1`. The same number of removals, but how many balls are in the urn and vessel #3 at 12PM. (Presumably, given the premises the set theory folks use, there'd be infinitely many in both urn and vessel, just from reading the number on a ball.) – agc Nov 30 '17 at 05:13
  • Craig Few people would consider that to be the intended meaning of "random." Although you certainly could interpret the question that way, your answer would likely satisfy nobody that it either has resolved the paradox or shed much light on the issues. – whuber Dec 03 '17 at 15:16
  • 1
    -1. Any attempt at solving the problem using only the ball count fails, because there is no rigorous principle that enables us to determine the noon ball count from the pre-noon ball count alone. We do not know that the ball count as a function of time, $f(t)$, is continuous at $t=1$ hour past 11AM. Ross's rigorous mathematical solution, which accounts for each individual ball separately, proves that $\lim_{t \rightarrow 1} f(t)=\infty$ but $f(1)=0$. ***Your proof relies on an unproven assumption which is in fact proven false by a more rigorous analysis of the problem.*** – Paul Dec 06 '17 at 05:49
  • As for the rest, Jaynes's comment is irrelevant to this problem because the way the limit is approached is perfectly well-specified. – Paul Dec 06 '17 at 05:50
  • @MichaelLew - "...answers depend on how the limit was approached." I think that applies in this case. Ross considers only the set $\{ lim_{n\rightarrow\infty} P(E_{in})\}\|{i=1,2,...\infty}$. Notice the index ratios $\{ i/(n\rightarrow\infty \}|i=1,2,...\infty$ always equal 0, we never get to $\infty/\infty$=1. Consider instead the set $\{ \lim_{N\rightarrow\infty} P(E_{a*N,b*N})\|\forall(a,b) a>1,b>1,a<=b\}$. In there we find the non-zero pointwise probabilities of $\lim_{N\rightarrow\infty} P(E_{a*N,b*N}) = (a/b)^{(1/9)}$. See below. https://stats.stackexchange.com/a/316784/122600 – Craig Hicks Dec 21 '17 at 08:23
3

The aim of this post is to argue for the OPs last option that we need a better formulation. Or at least, Ross proof is not as clear cut as it may seem at first, and certainly, the proof is not so intuitive that is in a good position to be in an introduction course for theory of probability. It requires much explanation both in understanding the paradoxical aspects, and once that has been cleared explanation at the points where Ross' proof passes very quickly, making it difficult to see which axioms, theorems, and implicit interpretations that the proof depends on.

Related to this aspect it is very amusing to read Teun Koetsier's final words in "Didactiek met oneindig veel pingpongballen?"

Als we niet oppassen dan wordt het 'Paradoxes a window to confusion'.

Translated "If we aren't carefull then it becomes 'Paradoxes a window to confusion'"

Below is a description of the "regular" arguments that may pass in discussions on supertasks, and more specifically the deterministic Ross-Littlewood paradox. After this, when we set all this discussion aside, a view is given of the special case of the probabilistic Ross-Littlewood paradox as providing additional elements, which however get lost and confusing in the wider setting with supertasks.

Three deterministic cases and discussion on supertasks

The Ross-Littlewood paradox knows many different outcomes depending on the manner in which the balls are displaced from the urn. To investigate these, let's kick off by using the exact problem description as Littlewood describes as the 5th problem in his 1953 manuscript

Version 1 The set of balls remaining in the urn is empty

The Ross-Littlewood paradox, or Littlewood-Ross paradox, first appeared as the 5th problem in Littlewood's 1953 manuscript "a mathematician's miscellany"

An infinity paradox. Balls numbered 1, 2, ... (or for a mathematician the numbers themselves) are put into a box as follows. At 1 minute to noon the numbers 1 to 10 are put in, and the number 1 is taken out. At 1/2 minute to noon numbers 11 to 20 are put in and the number 2 is taken out and so on. How many are in the box at noon?

Littlewood is short about this problem, but gives a nice representation as the set of points:

$P_{1} + P_{2}+ ... + P_{10} - P_1 + P_{11} + ... + P_{20} - P_2 + ...$

for which it is easily noticed that it is 'null'.

Version 2 The set of balls remaining in the urn has infinite size

Ross (1976) adds two more versions to this paradox. First we look at the first addition:

Suppose that we possess an infinitely large urn and an infinite collection of balls labeled ball number 1, number 2, number 3, and so on. Consider an experiment performed as follows: At 1 minute to 12 P.M. , balls numbered 1 through 10 are placed in the urn and ball number 10 is withdrawn. (Assume that the withdrawal takes no time.) At 12 minute to 12 P.M. , balls numbered 11 through 20 are placed in the urn and ball number 20 is withdrawn. At 14 minute to 12 P.M. , balls numbered 21 through 30 are placed in the urn and ball number 30 is withdrawn. At 18 minute to 12 P.M. , and so on. The question of interest is, How many balls are in the urn at 12 P.M. ?

Obviously the answer is infinity since this procedure leaves all the balls with numbers $x \mod 10 \neq 0$ in the urn, which are infinitely many.

Before we move on to Ross' second addition, which included probabilities, we move on to another case.

Version 3 The set of balls remaining in the urn is a finite set of arbitrary size

The urn can have any number of balls at 12 p.m. depending on the procedure of displacing the balls. This variation has been described in by Tymoczko and Henle (1995) as the tennis ball problem.

Tom is in a large box, empty except for himself. Jim is standing outside the box with an infinite number of tennis balls (numbered 1, 2, 3, ....). Jim throws balls 1 and 2 into the box. Tom picks up a tennis ball and throws it out. Next Jim throws in balls 3 and 4. Tom picks up a ball and throws it out. Next Jim throws in balls 5 and 6. Tom picks up a ball and throws it out. This process goes on an infinite number of times until Jim has thrown all the balls in. Once again, we ask you to accept accomplishing an infinite number of tasks in a finite period of time. Here is the question: How many balls are in the box with Tom when the action is over?

The answer is somewhat disturbing: It depends. Not enough information has been given to answer the question. There might be an infinite number of balls left, or there might be none.

In the textbook example they argue for the two cases, either infinite or finite (Tymoczko and Henle, leave the intermediate case as an exercise), however the problem is taken further in several journal articles in which the problem is generalized such that we can get any number depending on the procedure followed.

Especially interesting are the articles on the combinatorial aspects of the problem (where the focus is, however, not on the aspects at infinity). For instance counting the number of possible sets that we can have at any time. In the case of adding 2 balls and removing 1 each step the results are simple and there the number of possible sets in the n-th step is the n+1-th catalan number. E.g. 2 possibilties {1},{2} in the first step, 5 possibilities {1,3}{1,4}{2,3}{2,4} and {3,4} in the second step, 14 in the third, 42 in the fourth, etcetera (see Merlin, Sprugnoli and Verri 2002, The tennis ball problem). This result has been generalized to different numbers of adding and substracting balls but this goes too far for this post now.

Arguments based on the concept of supertasks

Before getting to the theory of probability, many arguments can already be made against the deterministic cases and the possibility of completing the supertask. Also, one can question whether the set theoretic treatment is a valid representation of the kinematic representation of the supertask. I do not wish to argue whether these arguments are good or bad. I mention them to highlight that the probabilistic case can be contrasted with these 'supertask'-arguments and can be seen as containing additional elements that have nothing to do with supertasks. The probabilistic case has a unique and separate element (the reasoning with theory of probability) that is neither proven or refuted by arguing against or for the case of supertasks.

  • Continuity arguments: These arguments are often more conceptual. For instance the idea that the supertask can not be finished such as Aksakal and Joshua argue in their answers, and a clear demonstration of these notions is Thomson's lamp, which in the case of the Ross Littlewood paradox would be like asking, was the last removed number odd or even?

  • Physical arguments: There exist also arguments that challenge the mathematical construction as being relevant to the physical realization of the problem. We can have a rigorous mathematical treatement of a problem, but a question remains whether this really has bearing on a mechanistic execution of the task (beyond the simplistic notions such as breaking certain barriers of the physical world as speed limits or energy/space requirements).

    • One argument might be that the set-theoretic limit is a mathematical concept that not necessarily describes the physical reality

      For example consider the following different problem: The urn has a ball inside which we do not move. Each step we erase the number previously written on the ball and rewrite a new, lower, number on it. Will the urn be empty after infinitely many steps? In this case it seems a bit more absurd to use the set theoretic limit, which is the empty set. This limit is nice as a mathematical reasoning, but does it represent the physical nature of the problem? If we allow balls to disappear from urns because of abstract mathematical reasoning (which, maybe should be considered more as a different problem) then just as well we might make the entire urn disappear?

    • Also, the differentiation of the balls and assigning them an ordering seems "unphysical" (it is relevant to the mathematical treatment of sets, but do the balls in the urn behave like those sets?). If we would reshuffle the balls at each step (e.g. each step randomly switch a ball from the discarded pile with a ball from the remaining pile of infinite balls), thus forgetting the numbering based on either when they enter the urn or the number they got from the beginning, then the arguments based on set theoretic limits makes no sense anymore because the sets do not converge (there is no stable solution once a ball has been discarded from the urn, it can return again).

      From the perspective of performing the physical tasks of filling and emptying the urn it seems like it should not matter whether or not we have numbers on the balls. This makes the set theoretic reasoning more like a mathematical thought about infinite sets rather than the actual process.

Anyway, If we insist on the use of these infinite paradoxes for didactic purposes, and thus, before we get to the theory of probability, we first need to fight for getting an acceptable idea of (certain) supertasks accepted by the most skeptical/stubborn thinkers, then it may be interesting to use the correspondence between the Zeno's paradox and the Ross-Littlewood paradox described by Allis and Koetsier (1995) and shortly described below.

In their analogy Achilles is trying to catch up the turtle while both of them cross flags that are placed in such a way, with distance $$F(n)=2^{-10 \log n}$$ such that the distance of Achilles with $n$ flags is twice the distance of the turtle with $10n$ flags, namely $F(n) = 2 F(10n)$. Then until 12.pm. the difference in the flags that the turtle and Achilles will have past is growing. But, eventually at 12 p.m. nobody except the Eleatics would argue that they Achilles and the turtle reached the same point and (thus) have zero flags in between them.

Achilles and the turtle

The probabilistic case and how it adds new aspects to the problem.

The second version added by Ross (in his textbook), removes the balls based on random selection

Let us now suppose that whenever a ball is to be withdrawn, that ball is randomly selected from among those present. That is, suppose that at 1 minute to 12 P.M. balls numbered 1 through 10 are placed in the urn and a ball is randomly selected and withdrawn, and so on. In this case, how many balls are in the urn at 12 P.M. ?

Ross solution is that the probability is 1 for the urn being empty. However, while Ross' argumentation seems sound and rigorous, one might wonder what kind of axioms are necessary for this and which of the used theorems might be placed under stress by implicit assumptions that might be not founded in those axioms (for instance the presupposition that the events at noon can be assigned probabilities).

Ross' calculation is in short a combination of two elements that divides the event of a non-empty urn into countably many subsets/events and proves that for each of these events the probability is zero:

  1. For, $F_i$, the event that ball number $i$ is in the urn at 12 p.m., we have $P(F_1) = 0 $

  2. For, $P(\bigcup_1^\infty F_i)$, the probability that the urn is not empty at 12 p.m. we have

    $P(\bigcup_1^\infty F_i) \leq \sum_1^\infty P(F_i) = 0$

The probabilistic case of the Ross-Littlewood paradox, without reasoning about supertasks

In the most naked form of the paradox, stripping it from any problems with the performance of supertasks, we may wonder about the "simpler" problem of subtracting infinite sets. For instance in the three versions we get:

$$\begin{array} \\ S_{added} &= \lbrace 1,2,3,4,5,6,7,8,9,10 \rbrace + \lbrace10k \text{ with } k \in \mathbb{N} \rbrace \\ S_{removed,1} &= \lbrace k \text{ with } k \in \mathbb{N} \rbrace \\ S_{removed,2} &= \lbrace 10k \text{ with } k \in \mathbb{N} \rbrace \\ S_{removed,3} &= \lbrace k \text{ with } k \in \mathbb{N} \rbrace \setminus \lbrace a_1,a_2,a_3,... \text{ with } a_i \in \mathbb{N} \rbrace \end{array}$$

and the problem reduces to a set subtraction like $S_{added}-S_{removed,1} = \emptyset$.

Any infinite sequence, $S_{RL} =\lbrace a_k \text{ without repetitions and } a_k < 10k \rbrace$ , is a (equally) possible sequence that describes the order in which the balls can be removed in a probabilistic realization of the Ross-Littlewood problem. Lets call these infinite sequences RL-sequences.

Now, the more general question, without the paradoxical reasoning about supertasks, is about the density of RL sequences that do not contain the entire set $\mathbb{N}$

A graphical view of the problem.

nested, fractal, structure

Before the edited version of this answer I had made an argument that used the existence of an injective map from 'the infinite sequences that empty the urn' to 'the infinite sequences that do not contain the number 1'.

That is not a valid argument. Compare for instance with the density of the set of squares. There are infinitely many squares (and there is the bijective relation $n \mapsto n^2$ and $n^2 \mapsto n$), yet the set of squares have density zero in $\mathbb{N}$.

The image below creates a better view how, with each extra step, the probability of ball 1 in the urn is decreasing (and we can argue the same for all other balls). Even though the cardinality of the subset of all RL-sequences (the sequences of displaced balls) is equal to the cardinality of all the RL-sequences (the image displays a sort of fractal structure and the tree contains infinitely many copies of itselve).

growth of sample space, number of paths

The image shows all the possible realizations for the first five steps, with the scheme for the tennis ball problem (the tennis ball problem, each step: add 2 remove 1, grows less fast and is easier to display). The turquoise and purple lines display all possible paths that may unfold (imagine at each step $n$ we throw a dice of size $n+1$ and based on it's result we select one of the $n+1$ paths, or in other words based on the results we remove one of the $n+1$ balls in the urn).

The number of possible urn compositions (the boxes) increase as the n+1-th Catalan number $C_{n+1}$, and the total number of paths increase as the factorial $(n+1)!$. For the case of the urn compositions with ball number 1 inside (colored dark gray) and the paths leading to these boxes (purple), the numbers unfolds exactly the same however this time it is the n-th catalan number and the factorial $n!$.

density of paths that leave ball $n$ inside

So, for the paths that lead to an urn with the ball number 1 inside, the density is $\frac{(n)!}{(n+1)!}$ and decreases as $n$ becomes larger. While there are many realization that lead to finding ball number $n$ in the box, the probability approaches zero (I would argue that this does not make it impossible, but just almost surely not happening, and the main trick in Ross' argument is that the union of countable many null events is also a null event).

Example of paths for the first five steps in tennis ball problem (each step: add 2 remove 1) example of paths for first five steps in tennis ball problem

Ross' arguments for a certainly empty urn.

Ross defines the events (subsets of the sample space), $E_{in}$, that a ball numbered $i$ is in the urn at step $n$. (in his textbook he actually leaves out the subscript $i$ and argues for ball 1).

Proof step 1)

Ross uses his proposition 6.1. for increasing or decreasing sequences of events (e.g. decreasing is equivalent to $E_1 \supset E_2 \supset E_3 \supset E_4 \supset ...$).

Proposition 6.1: If $\lbrace E_n, n\geq 1 \rbrace$ is either an increasing or a decreasing sequence of events, then $$\lim_{n \to \infty} P(E_n) = P(\lim_{n \to \infty} E_n) $$

Using this proposition Ross states that the probability for observing ball $i$ at 12 p.m. (which is the event $lim_{n \to \infty} E_{in}$) is equal to

$$lim_{n \to \infty} P (E_{in})$$

Allis and Koetsier argue that this is one of those implicit assumptions. The supertask itselve does not (logically) imply what happens at 12 p.m. and solutions to the problem have to make implicit assumptions, which is in this case that we can use the principle of continuity on the set of balls inside the urn to state what happens at infinity. If a (set-theoretic) limit to infinity is a particular value, then at infinity we will have that particular value (there can be no sudden jump).

An interesting variant of the Ross-Littlewood paradox is when we also randomly return balls that had been discarded before. In that there won't be convergence (like Thomson's lamp) and we can not as easily define the limit of the sequences $E_{in}$ (which is not decreasing anymore).

Proof step 2)

The limit is calculated. This is a simple algebraic step.

$$ lim_{n \to \infty} P (E_{in}) = \prod_{k=i}^{\infty} \frac{9k}{9k+1} = 0$$

Proof step 3)

It is argued that step 1 and 2 works for all $i$ by a simple statement

"Similarly, we can show that $P(F_i)=0$ for all $i$"

where $F_i$ is the event that ball $i$ has been taken out of the urn when we have reached 12 p.m.

While this may be true, we may wonder about the product expression whose lower index now goes to infinity:

$$lim_{i \to \infty}(lim_{n \to \infty} P (E_{in})) = lim_{i \to \infty}\prod_{k=i}^{\infty} \frac{9k}{9k+1} = ... ?$$

I have not so much to say about it except that I hope that someone can explain to me whether it works.

It would also be nice to obtain better intuitive examples about the notion that the decreasing sequences $E_{in}, E_{in+1}, E_{in+2}, ...$, which are required for proposition 6.1, can not all start with the step number index, $n$, being equal to 1. This index should be increasing to infinity (which is not just the number of steps becoming infinite, but also the random selection of the ball that is to be discarded becomes infinite and the number of balls for which we observe the limit becomes infinite). While this technicality might be tackled (and maybe already has been done in the other answers, either implicitly or explicitly), a thorough and intuitive, explanation might be very helpful.

In this step 3 it becomes rather technical, while Ross is very short about it. Ross presupposes the existence of a probability space (or at least is not explicit about it) in which we can apply these operations at infinity, just the same way as we can apply the operations in finite subspaces.

The answer by ekvall provides a construction, using the extension theorem due to Ionescu-Tulcea, resulting in an infinite product space $\sum_{k=0}^\infty \Omega_i \bigotimes_{k=0}^\infty \mathcal{A}_i$ in which we can express the events $P(E_i)$ by the infinite product of probability kernels, resulting in the $P=0$.

However it is not spelled out in an intuitive sense. How can we show intuitively that the event space $E_{i}$ works? That it's complement is the null set (and not a number 1 with infinitly many zeros, such as is the solution in the adjusted version of the Ross-Littlewood problem by Allis and Koetsier) and that it is a probability space?

Proof step 4)

Boole's inequality is used to finalize the proof.

$$P\left( \bigcup_1^\infty F_i \right) \leq \sum_1^\infty P(F_i) = 0$$

The inequality is proven for sets of events which are finite or infinite countable. This is true for the $F_i$.

This proof by Ross is not a proof in a constuctivist sense. Instead of proving that the probability is almost 1 for the urn to be empty at 12 p.m., it is proving that the probability is almost 0 for the urn to be filled with any ball with a finite number on it.

Recollection

The deterministic Ross-Littlewood paradox contains explicitly the empty set (this is how this post started). This makes it less surprising that the probabilistic version ends up with the empty set, and the result (whether it is true or not) is not so much more paradoxical as the non-probabilistic RL versions. An interesting thought experiment is the following version of the RL problem:

  • Imagine starting with an urn that is full with infinitely many balls, and start randomly discarding balls from with it. This supertask, if it ends, must logically empty the urn. Since, if it was not empty we could have continued. (This thought experiment, however, stretches the notion of a supertask and has a vaguely defined end. Is it when the urn is empty or when we reach 12 p.m.?)

There is something unsatisfying about the technique of Ross' proof, or at least some better intuition and explanation with other examples might be needed in order to be able to fully appreciate the beauty of the proof. The 4 steps together form a mechanism that can be generalized and possibly applied to generate many other paradoxes (Although I have tried I did not succeed).

We may be able to generate a theorem such that for any other suitable sample space which increases in size towards infinity (the sample space of the RL problem has $card(2^\mathbb{N})$). If we can define a countable set of events $E_{ij}$ which are a decreasing sequence with a limit 0 as the step $j$ increases, then the probability of the event that is the union of those events goes to zero as we approach infinity. If we can make the union of the events to be the entire space (in the RL example the empty vase was not included in the union whose probability goes to zero, so no severe paradox occurred) then we can make a more severe paradox which challenges the consistency of the axioms in combination with transfinite deduction.

  • One such example (or an attempt to create on) is the infinitely often splitting of a bread into smaller pieces (in order to fulfill the mathematical conditions let's say we only make the splits into pieces that have the size of a positive rational number). For this example we can define events (at step x we have a piece of size x), which are decreasing sequences and the limit of the probability for the events goes to zero (likewise as the RL paradox, the decreasing sequences only occur further and further in time, and there is pointwise but not and uniform convergence).

    We would have to conclude that when we finish this supertask that the bread has disappeared. We can go into different directions here. 1) We could say that the solution is the empty set (although this solution is much less pleasant than in the RL paradox, because the empty set is not part of the sample space) 2) We could say there are infinitely many undefined pieces (e.g. the size of infinitely small) 3) or maybe we would have to conclude (after performing Ross' proof and finding empty) that this is not a supertask that can be completed? That the notion of finishing such a supertask can be made but does not necessarily "exist" (a sort of Russell's paradox).


A quote from Besicovitch printed in Littlewood's miscellany:

"a mathematician's reputation rests on the number of bad proofs he has given".


Allis, V., Koetsier, T. (1995), On Some Paradoxes of the Infinite II, The British Journal for the Philosophy of Science, pp. 235-247

Koetsier, T. (2012), Didactiek met oneindig veel pingpongballen, Nieuw Archief voor Wiskunde, 5/13 nr4, pp. 258-261 (dutch original, translation is possible via google and other methods)

Littlewood, J.E. (1953), A mathematician's Miscellany, pp. 5 (free link via archive.org)

Merlin, D., Sprugnoli, R., and Verri M.C. (2002), The tennis ball problem, Journal of Combinatorial Theory, pp. 307-344

Ross, S.M. (1976), A first course in probability, (section 2.7)

Tymoczko, T. and Henle, J. (1995 original) (1999 2nd edition reference on google), Sweet Reason: a field guide to modern logic

Sextus Empiricus
  • 43,080
  • 1
  • 72
  • 161
  • Comments are not for extended discussion; this conversation has been [moved to chat](https://chat.stackexchange.com/rooms/76774/discussion-on-answer-by-martijn-weterings-at-each-step-of-a-limiting-infinite-pr). – whuber Apr 29 '18 at 21:49
3

It's worth reading amoeba's answer that is just excellent and clarifies the problem very much. I don't exactly disagree with his answer but want to point out that the solution of the problem is based on a certain convention. What is interesting is that this sort of problem shows that this convention, while often used, is questionable.

Just as he says there is a technical point about proving that for each ball the probability to stay in the urn forever is 0. Apart from this point, the problem is not about probabilities. A deterministic equivalent may be given. It is much easier to understand. The key idea is: since every ball is absent from the urn from some point in time, the urn at the end is empty. If you represent the presence in the urn of each ball by a sequence of zeros and ones, each sequence is 0 from a certain range, thus its limit is 0.

Now the problem can be simplified even more. I call the moments 1, 2, 3.... for simplicity:

  • moment 1: put ball 1 in the urn
  • moment 2: remove it
  • moment 3: put ball 2 in the urn
  • moment 4: remove it
  • moment 5: put ball 3 in the urn
  • ...

What balls at the end (noon) ? With the same idea, same answer: none.

But fundamentally, there is no way to know, because the problem does not say what happens at noon. Actually, it is possible that at the end of times, Pikachu comes suddenly in the urn. Or maybe balls all suddenly collapse and merge into one big ball. Not meaning that this is meant to be realistic, it's just not specified.

The problem can only be answered if a certain convention tells us how to go to the limit: a continuity assumption. The state of the urn at noon is the limit of its states before. Where should we look for a continuity assumption that would help us answer to the question?

In physical laws? Physical laws ensure a certain continuity. I think of a simplistic classical model, not calling on to real modern physics. But fundamentally, physical laws would bring exactly the same questions as the mathematical ones: the way we choose to describe continuity for physical laws relies on asking the question mathematically: what is continuous, how?

We have to look for a continuity assumption in a more abstract way. The usual idea is to define the state of the urn as a function from the set of balls into $\{0;1\}$. 0 means absent, 1 means present. And to define continuity, we use product topology, aka pointwise convergence. We say that the state at noon, is the limit of the states before noon according to this topology. With this topology, there is a limit, and it is 0: an empty urn.

But now we modify the problem a little in order to challenge this topology:

  • moment 1: put ball 1 in the urn
  • moment 2: remove it
  • moment 3: put ball 1 in the urn
  • moment 4: remove it
  • moment 5: put ball 1 in the urn
  • ...

For the same topology, the sequence of states has no limit. That's where I start to see the paradox as a true paradox. For me this modified problem is essentially the same. Imagine you are the urn. You see balls coming and going. If you can't read the number on it, whether it is the same ball or another one does not change what's happening to you. Instead of seeing balls as individual distinct elements, you see them as a quantity of matter coming in and out. The continuity could naturally be defined by looking at variations of the quantity of matter. And there is indeed no limit. In a way this problem is the same as the original problem where you decide to ignore the ball identity, thus leading to a different metric and a different notion of convergence. And even if you could see the number on the balls, the state could be seen as just a flickering presence with a growing number.

In one case, the limit of the sequence of your states is "empty", in the other case the limit is undefined.

The formalization of the problem with the product topology fundamentally relies on separating what happens to each different ball, and thus creating a metric reflecting the "distinguishablitiy". Only because of this separation, a limit can be defined. The fact that this separation is so fundamental to the answer but not fundamental for describing "what's going on" in the urn (a point that is endlessly arguable), makes me think the solution is the consequence of a convention rather than a fundamental truth.

For me, the problem, when considered as purely abstract has a solution as long as the missing information is provided: that the state at noon is the limit of the previous states and limit in what sense. However, when thinking of this problem intuitively, the limit of the sequence of states is not something you can think in a single manner. Fundamentally, I think there is no way to answer.

Benoit Sanchez
  • 7,377
  • 21
  • 43
  • Benoit, what about the formulation by Ross and his calculation? – Sextus Empiricus Dec 01 '17 at 17:10
  • He assumes balls have an identity. From this assumption, the probability space is constructed (formalization), and you conclude there is no ball at the end. Change the formalization, you change the answer to the same problem "put ten new balls in and remove one at random at each step". – Benoit Sanchez Dec 01 '17 at 17:16
  • 1
    The answer to the original problem does not depend on the formalization. Your proposed problem variations are not different formalizations of the same problem, they are different problems. – Paul Dec 01 '17 at 17:51
  • Re-adding a ball you already removed is a physically different process than only adding new balls and yields a different result. The math knows this and gives the right answer to each problem, but for some reason people want to hand-wave and imagine that they're the same problem. They're not. – Paul Dec 01 '17 at 17:54
  • See my answer for a response to the notion that you can treat the balls as indistinguishable, anonymous, unlabeled, or what-have-you. It's unphysical except in quantum mechanics, and no one has claimed to even consider, let alone solve, a quantum mechanical version of this problem. – Paul Dec 01 '17 at 17:56
  • 1
    I agree with @Paul but just commenting here to say that I do find the example of putting 1 ball on odd steps and taking it out on even steps interesting. This series of urn states clearly does not have any limit which IMHO means that this "[supertask](https://en.wikipedia.org/wiki/Supertask)" is ill-defined and cannot be completed. This is in contrast to the supertask we are discussing here. – amoeba Dec 02 '17 at 13:53
  • @BenoitSanchez - Set theory asks questions that have an answer to your in/out problem. ---- Q: What is the supremum of the infinite set of cardinalities {‖S0‖,‖S1‖,...}? A: 1 ---- Q: What is the cardinality of the supremum of the infinite set of sets {S0,S1,...}? A: 1. ---- See my answer below (https://stats.stackexchange.com/a/316784/122600). ---- I'm upvoting you because you asked a question for which set theory DOES have easy simple answers, thus clarifying the issue. – Craig Hicks Dec 02 '17 at 15:46
  • 1
    Interesting rewrite Benoit! That's certainly one thought-provoking pair of supertasks. @Paul, don't miss the edit. – amoeba Dec 02 '17 at 22:56
  • Nice but once again, physically different problems. In this new variant a single ball undergoes divergent motion with no limiting position. Not true of the original, where no ball undergoes divergent motion. To conflate the two is yet another manifestation of the indistinguishability fallacy which is already refuted in my answer. – Paul Dec 02 '17 at 23:35
  • It doesn't matter if it looks the same to an observer unless you're doing relativity or quantum mechanics, in which case you should say so and get to busting out the Dirac brakets and all the fixings. – Paul Dec 02 '17 at 23:40
  • 1
    To me the numbers on the balls make all the difference in the world in Benoit's two new urn problems. It is the difference between having a very persistent recurrent visitor and watching a stampede. It's hard to say what happens to the recurrent visitor at noon, but with the stampede it is very easy to see that it will pass away leaving nothing behind. It is only when you ignore the critical fact of the balls' distinct identities that you lose perspective and everything looks confusingly the same. The numbers are there to remind us of those identities. Ignoring them is unphysical. – Paul Dec 03 '17 at 07:53
  • @Paul it seems it's not even possible to meaningfully define the event "ball 1 in the urn at 12 P.M." in terms of basic events ("ball 1 in urn at time x"). Whereas in the original problem this event is $\cap_{n \in \mathbb{N}}E_{1n}$, here this would not make sense. – Carlos Cinelli Dec 03 '17 at 08:01
  • 1
    Yes, I agree, for the recurrent single ball version. For the sequential numbered ball stampede, it's easy to prove that no ball is in the urn at noon. – Paul Dec 03 '17 at 08:05
  • @Carlos. In my way of thinking "ball x in the urn at 12 P.M." is not defined from what happens before noon in neither problems. For the original problem, you don't work directly on the event "no ball in the urn at noon". Instead you work with the event "all balls leave the urn forever before noon". Saying the two events are the same, even if quite natural, is a sort of hidden assumption, that is axiomatically outside probabilities and usual logics, and says something about time continuity. In my single ball problem, you can't do such a natural thing. – Benoit Sanchez Dec 03 '17 at 11:36
  • @BenoitSanchez This is some very legalistic special pleading. The idea that an object removed stays gone after it is removed is the most basic and innocuous assumption imaginable for a word problem, and would be accepted without dispute in any other context. This comment is on a level with saying "putting one orange with another orange doesn't necessarily make two oranges, because that requires special assumptions about space continuity that lie outside arithmetic theory" – Paul Dec 06 '17 at 05:16
  • Paul, sure I'm doing "devil advocating". The problem by Ross has something unusual in the sense the event considered is not about what happens before noon or even infinitely close to noon (just involving infinite disjunctions/conjunctions of time varying events), which can be found in probability exercises and fairly well done by evkall, but what happens at noon which is fairly different. One specific limit assumption is in not about probabilities at all, and not about using something like $\sigma$ additivity or that kind of limit. To me, this was not visible at first look. – Benoit Sanchez Dec 06 '17 at 11:06
  • Agreed that it is an unusual and unusually difficult problem. In the end, I contend that there is only one physically reasonable solution, and that is Ross's. – Paul Dec 06 '17 at 14:37
1

OK, I'll try again.

The answer is that the paradox is purely mathematical. Enumaris's and cmaster's answer's tell what is going on in one way, but this is another way to see the problem. The problem is how we deal with probabilities with infinities, as Jaynes has written about (see my other attempted answer for details).

An infinite series is usually treated as if it has no end, but in this problem there is an end time (12PM) and so logically, even if not mathematically, there is a last cycle of addition and removal of balls: the one that happens infinitessimally prior to 12PM. The existence of a 'last' cycle allows us to look at the probabilities backwards as well as forwards through time.

Consider the ten balls last added. For each of them their probability of being removed is zero because they are each just one of infinity balls that might be removed. Thus the probability that there will be at least ten balls remaining at 12PM is unity.

QED. A probabilistic argument that does not lead to nonsense.

Michael Lew
  • 10,995
  • 2
  • 29
  • 47
  • 4
    There is no last "cycle" in the problem in question any more than there is a last term in the sequence $a_n = 1 - 1/n$, $n = 1, 2, \dots$, which also has "an end time" at 1. – ekvall Nov 25 '17 at 22:23
  • @ekvall Can you say why the infinite series of cycles can end at 12 PM and yet not have an end? Seems like this is a circumstance where the conventionally assumed rules of infinities leads to ridiculous (not counter-intuitive, but wrong) outcomes. – Michael Lew Nov 25 '17 at 22:47
  • 2
    @MichaelLew: Consider the act of clapping your hands. Consider that there will come a point in time where your hands are 1/2 together. Then 1/4 together. Then 1/8 together. Consider that every time your hands halve the remaining distance to each other, they can always halve it *again*. This is a cycle that clearly has no end (what number of steps must you take before the next step will put the hands together?) but whose series very clearly has an end (or are you unable to clap?) – Vegard Nov 28 '17 at 02:04
  • @Vegard Consider not clapping your hands, because as they approach each other increasingly slowly. At a starting distance take 2 seconds to reach 1/2 the distance and in general $2^n$ more seconds to reach from $\frac{1}{2^{n-1}}$ of the distance $\frac{1}{2^n}$ of the distance. BTW for Michael +1. – Carl Dec 01 '17 at 02:55
  • @Carl Why would my hands approach eachother increasingly slowly? That would be a sad clap. Assume that my hands are moving at some constant velocity and then explain how I haven't covered an infinite amount of distance-halvings in finite time? Infinite series can converge, and the infinite series `1/2 + 1/4 + ... 1/2^n` does converge, as I assume anyone who has had entry-level calculus knows? But this is in response to the question of how an infinite series can be traversed in finite time without there being an actual end to the series itself, not some solution to the ball-problem. – Vegard Dec 05 '17 at 08:46
  • @Vegard No. The velocity of the hands is decreasing so that the clap happens at infinite time. However, infinite time is never reached so that your hands do not clap. Moreover, one cannot remove an infinite number of balls, and even if we suspend disbelief to assume that we could do so, $\infty-\infty$, which is what this problem is, or $0\times\infty$ in the probabilistic case, the answers are not zero, they are indeterminate. – Carl Dec 05 '17 at 23:17
  • @Carl Again, why would the velocity of my hands decrease? That's not the core of the question I am referring to, as I already stated. **Assuming constant velocity**, it is trivial to show that particular infinite series can be "completed" in finite time. Additionally, as also stated previously, **this is not a comment on the ball problem**, it's a response to an earlier comment under OP. – Vegard Dec 06 '17 at 07:34
  • @Vegard Well, because I assumed the velocity of hand motion decreases to give an example of how moving hands can fail to clap after a finite time elapses. It was an example. Re: *Assuming constant velocity, it is trivial to show that particular infinite series can be "completed" in finite time.* Is this relevant? If so, how, please? – Carl Dec 06 '17 at 12:58
  • @Carl Sure. In response to this question: `Can you say why the infinite series of cycles can end at 12 PM and yet not have an end`. My point was that it's trivial to show that the infinite series ends at 12PM despite there being no end to the series - as long as the series converges, anyway. – Vegard Dec 06 '17 at 13:53
  • @Vegard I know that. I think you are missing my point, which is that taking an infinite number of balls out of an urn is not physically achievable. Where are you going to put all those balls? Line them up like Paul does to infinity? How does your robot travel to infinity and back in zero time? You can stipulate an infinity of balls in abstract mathematical terms, but it is not physically achievable. I go further and suspend my disbelief in this discontinuity to show that I do not agree with zero balls remaining. – Carl Dec 08 '17 at 02:07
  • @Carl Of course it's not physically possible. But it's also not possible to have an urn that contains infinite balls. You're not disagreeing with the answer, you're disagreeing with the principal premise of the problem. However, the problem isn't about what is physically possible, it's a mathematical problem illustrated using physical terms as a metaphor. If the physical terms bother you that much, substitute it for some non-physical term, or that doesn't violate physicality. That's easy to do, because as I said, the problem isn't really about urns or balls, it's about sets and infinities. – Vegard Dec 08 '17 at 07:13
  • @Vegard Even when I suspend disbelief, the math still seems incorrect. It includes $0\times\infty=0$, which is wishful thinking, sets ordered by one method only when there are $n!$ permutations, and other silliness. – Carl Dec 08 '17 at 20:40
  • @ ekvall: The limit of $a_n=1−1/n, n=1,2,…$ is approached but not *reached* by the sequence. The limit simply exists. The final state of the urn however is constructed like every finite state from the preceding state - if we adhere to the given procedure *and if a final state exists!* – Wilhelm May 01 '18 at 14:57
  • @Vegard: Your hands will meet and Achilles will reach and overtake the tortoise like the hour hand will reach 12 o'clock. But that has nothing to do with an infinite subdivision of the distance. For proof try to start at the other end (with an infinitely small step). There is no infinite sum of 1/2 + 1/4 + 1/8 + ..., there is only a limit, approached but never reached by any term of the sequence of partial sums. – Wilhelm May 01 '18 at 15:00
  • @Wilhelm The series converges, which means there is a [well-defined sum](https://en.wikipedia.org/wiki/1/2_%2B_1/4_%2B_1/8_%2B_1/16_%2B_⋯). – Vegard May 01 '18 at 15:21
  • @Vegard: We call it sum, but it isn't a sum. Sums are restricted to a finite number of summands. It is a limit of the strictly monotonic sequence of partial sums. Strictly monotonic sequences never reach their limit. – Wilhelm May 01 '18 at 15:30
  • In your math they are so restricted, but the rest of the world has a vastly more powerful and useful math that is not so restricted – Paul May 01 '18 at 16:31
1

Recently several comments by Wilhelm, Wolfgang Mückenheim, caused me to reconsider certain formulations in my answer. I am posting this as a new answer mainly because the different approach of this answer, not arguing about the teaching of this problem, but instead about the paradox being invalid.

Wilhelm discusses in his lengthy manuscript that

Transactions are only possible at finite steps $n$ (there is no action possible "between all $n$ and $\omega$").

This reminded me of the term

$$\sum_{k=1}^\infty \prod_{n=k}^\infty \left( \frac{9n}{9n+1} \right)$$

which is derived from Ross' work. This term is indeterminate when the path to infinity is not defined for the following limit.

$$\lim_\limits{(l,m)\to(\infty,\infty)}\sum_{k=1}^l\prod_{n=k}^m\left(\frac{9n}{9n+1}\right)$$

This seems to resemble the point that Wilhelm discusses and is also mentioned in aksakal's answer. The steps in time become infinitely small, so we will be able to reach 12 p.m. in that sense, but we will at the same time need to add and remove an (unphysical) infinity number of balls. It is a false idea to attach this supertask to a process like Zeno's arrow, just like the switch of Thompson's paradoxical lamp can not have a definite position at the end of a supertask.

In terms of the limit we can say that the physical path to infinity that we take is

$$\lim_\limits{l\to \infty}\sum_{k=1}^l\prod_{n=k}^l\left(\frac{9n}{9n+1}\right) = \lim_\limits{l\to \infty} \frac{9l}{10}$$

so not zero but infinite.

Glorfindel
  • 700
  • 1
  • 9
  • 18
Sextus Empiricus
  • 43,080
  • 1
  • 72
  • 161
  • 2
    FYI, Wolfgang Mückenheim has been trolling math forums with nonsense for decades http://mathforum.org/kb/thread.jspa?forumID=13&threadID=2723432&messageID=9826413 – Paul May 01 '18 at 00:06
  • Thank you for that information, and to be honest I did not read the entire manuscript, although I do like a nice finitist argument, and his argument (troll or not) does make sense (which is not uncommon to trolling). Although I'd personally say that, if the steps become infinitely small, then we might have a (physical) process that is allowed to be considered infinite in number of steps . Sadly, it is not as much his trolling, and much more the mobbing with the votes against opposite positions (or in favour ones own) that spoils discussion in his thread and fuels trolling (or other). – Sextus Empiricus May 01 '18 at 01:25
  • @Martijn Weterings: It is easy to prove who is the troll here around: The idea of Cantor is the limit $\omega$ after 1, 2, 3, ... . First this violates mathematical induction, because before $\omega$ there is *always* another natural number. Second, to exclude any physical relevance of set theory, model the sequence by a merry-go-round where the revolutions are counted. Can there be a limit? (The collapse of the earth's orbit after having emitted gravitational waves for $10^{15}$ years is certainly not a result of set theory.) – Wilhelm May 01 '18 at 11:35
  • 1
    "First this violates mathematical induction, because before ω there is always another natural number." Mathematical Induction says nothing whatsoever about what should or should not be "before" ω. Limit ordinals are not generated by induction and induction has nothing to say about whether they exist or not. Your mind is full of false assumptions about how math should work, and when these false assumptions contradict the real mathematics, you blame the latter. – Paul May 01 '18 at 17:01
  • Mathematical induction says that for every $n$ there is $n+1$ and this does *never* change. The limit ordinal is assumed by mathematicians who are unable to comprehend the infinite. What means quantifying over all natural numbers? Does it mean to take only those natural numbers which have the characteristic property of every natural number, i.e., to be followed by infinitely many natural numbers? Then you don't get all of them because always infinitely many are remaining. Or do you take all natural numbers with no exception? – Wilhelm May 01 '18 at 18:02
  • Or do you take all natural numbers with no exception? Then you include some which are not followed by infinitely many and hence are not natural numbers because they are lacking the characteristic property of every natural number. That means you get more than all natural numbers. Conclusion: "All natural numbers" is a contradcition in itself. – Wilhelm May 01 '18 at 18:03
  • Wilhelm, Paul, could you take this discussion to some of the chats? – Sextus Empiricus May 01 '18 at 18:05
0

I believe that this example supports "if the premise is false then the conditional is true"

In this universe, there are no infinite urns and no infinite collection of balls. It is impossible to split time into arbitrarily small pieces.

Thus Sheldon Ross is right to say that the urn is empty at 12:00. Students who say that the urn has infinite balls at 12:00 are just as right.

If you answered the urn has 50 balls then you are also correct.

I have not rigorously proved that this universe does not contain infinite urns and infinite balls and that time is not atomic - I just believe those things. If you believe those three assertions are wrong, then you believe Ross's problem is empirically falsifiable. I am waiting for your experimental results.

emory
  • 179
  • 4
  • 2
    Are you also waiting for experimental results that $\pi$ is irrational on grounds that there is no way one can fit an infinite number of infinitly tiny triangles in a circle in this universe? – user603 Nov 25 '17 at 13:58
  • 3
    @user603 no, but I claim the last digit of pi is 7. Can you prove otherwise? – emory Nov 25 '17 at 14:04
  • 1
    indeed, it's a fair distinction. – user603 Nov 25 '17 at 14:05
  • 4
    -1. The problem is well-defined mathematically and the impossibility of physical realization has nothing to do with it. – amoeba Nov 26 '17 at 00:18
  • 1
    @amoeba is it well-defined? I am not alone in thinking otherwise. https://en.wikipedia.org/wiki/Ross%E2%80%93Littlewood_paradox#Problem_is_ill-formed most accurately reflects my views. Clearly https://en.wikipedia.org/wiki/Ross%E2%80%93Littlewood_paradox#Vase_contains_infinitely_many_balls is the most intuitive explanation. If that is not true, then there must be something "magical" going on between 11:59.99... and 12:00. To me that lends credence to https://en.wikipedia.org/wiki/Ross%E2%80%93Littlewood_paradox#Problem_is_underspecified ... we don't really know what will happen at 12:00. – emory Nov 26 '17 at 01:29
  • @CraigHicks the problem is stated in terms of physical operations. Thus I think it is defined at 12:00. However 12:00 will never happen. If we redefined the problem in terms of mathematical operations, then you would be correct. – emory Nov 29 '17 at 12:36
  • @CraigHicks the problem is defined in terms of adding and removing balls from an urn. Imagine the process stopped at some arbitrary point between 11:59 and 12:00 (i.e., 11:59.00, 11:59:30, 11:59.45, and no more). Then we could talk unambiguously about the state of the urn at 12:00.00 (and 13:00, 14:00, ...). I think I agree with you that if the problem was defined strictly in terms of mathematical operations, then the value would be undefined at 12:00. – emory Nov 29 '17 at 20:12
  • @CraigHicks I could let f(11:59)=0; f(11:59.30)=f(11:59)+9; f(11:59.45)=f(11:59.30)+9, and similarly so on. Then you would be correct that f(12:00) is undefined. For that matter, f(11:59.31) would be similarly undefined. But the way the problem is presented there is an unambiguous answer with respect to 11:59.31. – emory Nov 29 '17 at 22:08
  • 1
    +1 Imposing a favored outcome following a discontinuity of the $\pm\infty$, or for that matter, any discontinuity, while ignoring the limiting conditions prior to the discontinuity smacks of double-talk, having your cake and eating it too, or if you wish, plain foolishness. – Carl Nov 30 '17 at 00:03
  • 2
    I also find this question nonsense. If the urn is empty at 12:00, then there must have been a time when the last ball was removed. But at any given moment when a ball is removed, more balls are added so that the last ball is **not** removed. How can there be no time at which the last ball is removed? On the other hand, if at 12:00 the adding of balls has ceased, then there must be a time at which the last ball was added. But, if some ball was the last to have been added, there cannot be infinitely many balls in the urn. A process cannot have a beginning, be never-ending, and yet cease. – Kevin Dec 01 '17 at 04:23
  • @CraigHicks if the problem was defined in terms of the piecewise union of finite step values I would 100% agree with you but it is defined in terms of operations on balls and some people say that it depends on exactly how you remove the balls (at random versus remove every 10th ball, etc). When reducing the problem to the piecewise union function these fine distinctions get lost, therefor I choose not to reduce. – emory Dec 01 '17 at 16:22
  • @CraigHicks I am thinking that fundamentally we are in agreement about the conclusion and that you are probably right about the piecewise union function. I just choose not to make that reduction because some people say the answer depends on the way in which you remove the balls and if they were right about that then the reduction would be wrong. To be safer in my analysis, I do not consider the reduction. – emory Dec 01 '17 at 16:30
  • @emory: "I have not rigorously proved that this universe does not contain infinite urns and infinite balls" but if our knowledge is not completely wrong, then the "accessible part" of the universe contains not more than 10^80 atoms and will never contain an infinite number. Therefore the problem is purely theoretical. Originally it was introduced by Fraenkel in order to argue in favour of infinite bijections. Alas, we have to accept discontinuity of the cardinality function, contrary to the defined conditions and moreover contary to mathematics because there the limit is defined by continuity. – Wilhelm May 02 '18 at 10:08
0

I support the opinion that the problem is ill-posed. When we consider something transfinite we often have to use a limit. It seems that here it is the only way. Since we distinguish different balls, we have an infinite-dimensional process $$(X_{t,1}, X_{t,2},...),$$ where $t=-1,-1/2,-1/4,...$ stands for the time, $X_{t,j}=1$ if there is the ball $j$ at time $t+0$ and $X_{t,j}=0$ otherwise.

Now it is on each everyone's discretion which convergence to use: uniform, componentwise, $l_p$, etc. Needless to say, the answer depends on the choice.

The misunderstanding in this problem goes from neglecting the fact that metric issues are crucial when we consider convergence of infinite-dimensional vectors. Without choosing the type of convergence, no correct answer can be given.

(There is componentwise convergence to zero vector. While $l_1$ norm counts the number of balls, so in this norm the process is exploding.)

Viktor
  • 853
  • 1
  • 8
  • 22
  • Component-wise convergence yields the "standard" answer which is that the urn is empty at the limit. Of course this convergence is not uniform. But this does not mean that there is no convergence. – amoeba Dec 02 '17 at 14:08
  • @amoeba Convergence takes place for some distances and does not take place for other ones. So the statement of the problem needs refinement to be well-posed. – Viktor Dec 02 '17 at 14:11
  • 2
    "The urn is empty" if and only if every ball that had been put in was eventually taken out. That's the definition of emptiness. And it translates to component-wise convergence. – amoeba Dec 02 '17 at 14:35
  • 2
    I agree with this answer. First, what convergence notion to choose is totally independent of probability theory. It's not because we have an habit of using pointwise convergence/product topology (where here a point is a ball with a certain identity) that this notion must be used as the only option. It's not specified in the problem nor a general convention. And this even if we decide to agree totally with standard probability theory. – Benoit Sanchez Dec 02 '17 at 17:24
  • 1
    This is cargo cult math. You throw in metric issues because they matter on other problems, not because they're relevant to this problem. – Paul Dec 02 '17 at 18:57
  • 1
    @Paul "Cargo cult math". Never thought such a term exists. Will think over it. :) – Viktor Dec 02 '17 at 19:01
  • 1
    @Paul Metric issues are crucial if we consider convergence of infinite-dimensional objects. Convergence of infinite-dimensional objects usually makes almost no sense when the metric is not specified. – Viktor Dec 02 '17 at 19:12
  • 1
    The relevant metric is pointwise convergence. This is clearly implied by the problem scenario. All that matters is whether each individual ball is present in the limit. – Paul Dec 02 '17 at 19:29
  • 1
    @Paul The "relevant metric" must be stated explicitly. Without it the problem is ill-posed. – Viktor Dec 02 '17 at 19:33
  • 1
    This is a case where too much education makes people unaware of what is right in front of their nose. Get your nose out of the theory textbook and read the word problem. – Paul Dec 02 '17 at 19:34
  • 1
    @Paul The question is: "How many balls are in the urn?" How can I deduce that it is pointwise convergence? – Viktor Dec 02 '17 at 19:37
  • 2
    (+1) I agree that this problem is posited without a metric. Moreover, the answer of zero balls is also 1 ball at the same time so the zero ball answer is not a number. Countable infinity is not a number. Ill-posited question. There are indeed questions so ridiculous that they have no answers. – Carl Dec 03 '17 at 23:45
  • @Viktor - Ross considers only the set {limn→∞P(Ein)}‖i=1,2,...∞. Notice the index ratios {i/(n→∞}|i=1,2,...∞ always equal 0, we never get to ∞/∞=1. Consider instead the set {limN→∞P(Ea∗N,b∗N)‖∀(a,b)a>1,b>1,a<=b}. In there we find the non-zero pointwise probabilities of lim_{N→∞}P(Ea∗N,b∗N)=(a/b)^(1/9). Then all the intuitions are clarified by further algebraic results. See below. stats.stackexchange.com/a/316784/122600 – Craig Hicks Dec 21 '17 at 08:36
-2

More intuition than formal education, but:

If the intervals to midnight are halving, we never reach midnight... we only approach asymptotically; so one could argue that there is no solution.

Alternatively, depending on the phrasing:

  • as there are infinite intervals of +10 balls the answer is infinite
  • as there are infinite intervals of (+10 balls - 1) the answer is 10*infinite -1*infinite = 0?
  • as there are infinite intervals of (+9 balls) +1 the answer is infinite + 1
CGretski
  • 29
  • 2
  • 11
    It looks like you would agree with Zeno that [Achilles can never catch the tortoise](https://en.wikipedia.org/wiki/Zeno%27s_paradoxes#Achilles_and_the_tortoise); and worse, [neither can even get started in their race.](https://en.wikipedia.org/wiki/Zeno%27s_paradoxes#Dichotomy_paradox) – whuber Nov 24 '17 at 22:48
  • @whuber Those problems are not at all related to this answer. – Clearer Nov 26 '17 at 12:29
  • 2
    @Clearer I would like to suggest they are closely related through their naive treatment of "infinity." – whuber Nov 26 '17 at 13:39
  • 5
    -1 because it's 00:00 on my watch right now so I have just reached midnight despite remaining time periods halving ad infinitum during the last minute. – amoeba Nov 26 '17 at 23:00
  • @amoeba The discontinuity is that you have an infinite number of removed balls at that time. Exactly where are you keeping that number of balls? Are the balls infinitely small as well so that there is possibly room enough in the universe for us non-balls? Mind you, an infinite number of infinitely small balls could still occupy an infinite volume, and when you play with metrics the rules are not so naive as the posts here. – Carl Nov 29 '17 at 22:40
  • Midnight is not reached but we get infinitely close. An asymptotic limit can be very useful in engineering problems. When you say "only approach asymptotically" the only implies some deficiency, but that's not the case. The non intuitive (and also not useful from an engineering perspective) feature of Ross' analysis is the "jump" in the limit to 0, with no convergence. By reworking the approach to limit, we can obtain convergence - $\{ \lim_{N\rightarrow\infty} P(E_{a*N,b*N})\|\forall(a,b) a>1,b>1,a<=b\}$ instead of $\{ lim_{n\rightarrow\infty} P(E_{i,n})\}\|{i=1,2,...\infty}$, ...[cont] – Craig Hicks Dec 21 '17 at 16:24
  • [cont] .... we can obtain "intuitive" results with non-zero limits starting with the pointwise probabilities $\lim_{N\rightarrow\infty} P(E_{a*N,b*N}) = (a/b)^(1/9)$. See my answer below: https://stats.stackexchange.com/a/316784/122600 – Craig Hicks Dec 21 '17 at 16:27
  • @whuber: Achilles will reach and overtake the tortoise like the hour hand will reach 12 o'clock. But that has nothing to do with an infinite subdivision of the distance. For proof start at the other end (with an infinitely small step). There is no infinite sum of 1/2 + 1/4 + 1/8 + ... There is only a *limit*, approached but never reached by a term of the sequence of partial sums. – Wilhelm Apr 30 '18 at 13:34
-5

Rewrite: Jan 16, 2018

Section 1: Outline

The fundamental results this post are as follows:

  • The halfway ball has a probability of about $0.91$ of remaining in the limit as the step goes to $\infty$ - this is both a real world observation and is derived mathematically.
    The derived function has a domain of the rationals in $(0,1]$. For example, the probability in the limit of the halfway ball remaining corresponds to the domain value $1/2$. This function can computed the probability of remaining for any fraction of the step size.
  • Ross' analysis is not wrong but is incomplete because it attempts to iterate the rationals in order of magnitude $\left(i,\infty\right), i=1..\infty$.
    The rationals cannot be iterated in order of magnitude. Hence, Ross's analysis cannot access the full domain and can only offer a limited view of the total behavior.
  • Ross's analysis does however account for one particular observable behavior: in the limit it is not possible through serial iteration from 1 to reach the first remaining ballset.
  • Ross' limit sequences have some nice convincing properties that seem intuitively unique.
    However, we show another set of limit sequences which satisfy the same nice properties and give the values for our function.

Section 2 "Notation and terminology" covers notation and terminology used in this post.

Section 3 "The Halfway Ballset" introduces a real world observation - the converge in the limit of the probability of remaining of a ball whose index is a halfway through all the inserted balls. This limit value is about 91%. The case of the halfway ballset is generalized to any rational in $(0,1]$, which all have non-zero limit values.

Section 4 "Resolution of the Paradox" presents a unified framework for including both Ross' result and the 'rational-domain' results (described herein). As already noted, Ross' analysis an only offer a limited view of the total behavior. Hence, the source of the paradox is identified and resolved.

In the appendix some other results less important results are discussed:

  • "Expectations in the limit" calculates the expected number of balls remaining up to and including any fraction of the step size.
  • A corollary of this result is determining the index of the first ball which has an expectation of remaining greater than one.

Section 2: Notation and terminology

  • We label the ball indexes inserted at step $n$ as $\{n.1, n.2, n.3, ..... n.10\}$ and call this set the $n$th "ballset". Ballset is one word, created for this post.
    This terminology regrettably deviates from Ross' terminology, but it also makes the text much clearer and shorter.
  • The notation $E(a,b)$ refers to the event that ball $a.1$ in ballset $a$ remains at step $b$, ignoring the other balls in the ballset.
  • The notation $P(a,b)$ is an abbreviation for $P(E(a,b))$ and it refers to the probability of $E(a,b)$.
    Note that all balls $a.i$ in ballset $a$ have the same probability of remaining.
    -- The value of $P(E(a,b))$ is $\prod_{k=a}^{b} \frac{9k}{(9k+1)}$.
  • The Ross limit $P(a)$ is the probability $P(a,b)$ as $b$ goes to infinity:
    -- $P_{lim1}(a) = \lim_{b\rightarrow\infty} P(a,b)$
  • The rational-limit is defined as the limit as the both ball index $a$ and step $b$ go to infinity while maintaining constant ratio: -- $P_{lim2}(a,b) = \lim_{k\rightarrow\infty} P(ka,kb)$

Section 3: The halfway ballset

At every even step $2n$, the halfway ballset is defined as the $n$th ballset. At every even step $2n$, the halfway probability of remaining is defined as $P(n,2n)$.
In the limit as $n\rightarrow\infty$, the halfway probability of remaining is therefore $\lim_{n\rightarrow\infty} P(1*n,2*n)$.
Theorem 1 below gives a numerical value for the halfway probability of remaining.

Theorem 1 - Limit of probability of elements in a ratio-preserving domain sequence

\begin{equation} \lim_{n\rightarrow\infty} P(a*n,b*n) = (\frac{a}{b})^\frac{1}{9} \end{equation} The proof is given below just before the appendix.

By Theorem 1, the halfway probability of remaining in the limit is $(\frac{1}{2})^\frac{1}{9}$ which evaluates to an approximate decimal value of $0.925875$.

Sanity Check Lets do a sanity check to see if the numerical limit for the halfway probability "looks right".

\begin{array}{|l|rcl|} \hline n & P(n/2,n) &=& \text{trunc decimal val} \\ \hline 1000 & P(500,1000) &=& 0.92572614082 \\ \hline 10000 & P(5000,10000) &=& 0.9258598528 \\ \hline 100000 & P(50000,100000) &=& 0.925873226 \\ \hline 1000000 & P(500000,1000000) &=& 0.92587456 \\ \hline \hline \infty & \lim_{n\rightarrow\infty} P(n,2n) &=& 0.925875 \\ \hline \end{array}

The first 4 rows are the halfway probabilities of remaining for the step number values of $10^3$, $10^4$, $10^5$, and $10^6$, respectively. The final row is the limit. It seems the halfway probabilities are indeed converging to the predicted limit.
This real world observation, which does not fit within Ross's framework, needs to be explained.

** Section 4 "Resolution of the Paradox" **

This section explains a unified framework for both Ross' analysis and the rational-domain analysis, By viewing them together the paradox is resolved.

The rational limit $P_{lim2}(a,b)$ is reducible to a function from the rationals $(0,1]$ to the reals $(0,1]$: \begin{array}{rcl} P_{lim2}(a,b) &=& lim_{k\rightarrow\infty} P(ka,kb) \\ &=& (\frac{a'}{b'})^\frac{1}{9} \end{array} where $gcd(a',b')=1$ and $\frac{a'}{b'} = \frac{a}{b}$. Here $gcd()$ indicates greatest common divisor. Equivalently statements are "$a'$ and $b'$ are mutually prime", and "$\frac{a'}{b'}$ is the reduced fraction of $\frac{a}{b}$.

The Ross limit can be written as the limit of a sequence of rational limits: \begin{array}{rclr} P_{lim1}(a) &=& lim_{k\rightarrow\infty} P(a,k) & \\ &=& lim_{i,k\rightarrow\infty} P(ka/i,kb) & \text{for some } b \\ &=& lim_{i\rightarrow\infty} P_{lim2}(a/i,b) & \\ &=& lim_{i\rightarrow\infty} P_{lim2}(0,b) & \end{array} The tuple $(0,b)$ is not a member of the rationals in $(0,1]$; it belongs to $[0,0]$. Therefore the Ross limit is isomorphic to the function $P_{lim2}(a,b)$ on domain $[0,0]$ and its image is always the unique real $0$.

The Ross limit and the rational-limit are the same function on two disjoint domains $[0,0]$ and $(0,1]$ respectively. The Ross limit only considers the case of ballset indexes which have been demoted to be infinitely small relative to the stepsize.

The Ross-limit analysis predicts that in the limit, accessing the values $P_{lim1}(i)$ sequentially for $i=1,2,...\infty$ will never reach a non-zero value.
This is a correct and corresponds to real world observation.

The rational-limit analysis accounts for real world observations such as the halfway ballset which the Ross-limit does not account for. The function is the same $P_{lim2}(a,b)$ but the domain is $(0,1]$ instead of $[0,0]$

The diagram below depicts the both the Ross limit sequences and the rational limit sequences.

enter image description here

It is probably fair to say that Ross' analysis includes an implicit assumption that the Ross-limit and its domain is the entire domain of interest. The intuition implicitly underlying Ross' assumption is like due to the four conditions below even if they are not explicitly recognized:

Let $S_i = P(i,n), n=1,...,\infty$ be the $i$th Roth limit sequence. Let $S = \cup_{i=(1...\infty)} S_i$ be the union of Roth limit sequences.

  • (1) The sequences $S_i$ are disjoint and each sequence converges.
  • (2) The union of elements of all sequences $S$ cover exactly the set of all (ball,step) tuples coming into play: $\{ (i,n)\ |\ i\leq n\ \land \ i,n\in Q \}$
  • (3) All of the sequences $S_i$ are infinite in $n$, the step index, so they don't terminate "early".
  • (4) The sequences $S_i$ themselves form a super-sequence $\{ S_i \}_{i_in (1...\infty) }$. Therefore that super-sequence can be "created" iteratively, i,e,, they are countable.

It's not immediately apparent that another system of limit sequences could satisfy the above points (1) - (4).

However, we will now discuss another system of limit sequences which do indeed satisfy the above points (1) - (4).

Let $S_{p,q}$, where $gcd(p,q)=1$, represent the rational-limit sequence \begin{equation} S_{p,q} = \{ (kp,kq) \}_{k\in (1...\infty)} \end{equation} Let $D^*$ be the mutually prime tuples of $D$: = $D^* =\{ (p,q)\in D \land gcd(p,q)=1 \}$ . Let $S^*$ be the union of said rational limit sequences: $S^* = \cup_{d\in D^*} S_{p,q}$

Clearly the sequences $S_{p,q}$ whose union is $S^*$ satisfy the above properties (1) - (3).
The indexes $(p,q)$ are exactly the rationals on $(0,1]$. To satisfy condition (4) we need to show that the rationals on $(0,1]$ are countable.

The (Farey sequence)2 of order $n$ is the sequence of completely reduced fractions between 0 and 1 which when in lowest terms have denominators less than or equal to $n$, arranged in order of increasing size. Here are the first eight Farey sequences:

 F1 = {0/1,                                                                                                          1/1}
 F2 = {0/1,                                                   1/2,                                                   1/1}
 F3 = {0/1,                               1/3,                1/2,                2/3,                               1/1}
 F4 = {0/1,                     1/4,      1/3,                1/2,                2/3,      3/4,                     1/1}
 F5 = {0/1,                1/5, 1/4,      1/3,      2/5,      1/2,      3/5,      2/3,      3/4, 4/5,                1/1}
 F6 = {0/1,           1/6, 1/5, 1/4,      1/3,      2/5,      1/2,      3/5,      2/3,      3/4, 4/5, 5/6,           1/1}
 F7 = {0/1,      1/7, 1/6, 1/5, 1/4, 2/7, 1/3,      2/5, 3/7, 1/2, 4/7, 3/5,      2/3, 5/7, 3/4, 4/5, 5/6, 6/7,      1/1}
 F8 = {0/1, 1/8, 1/7, 1/6, 1/5, 1/4, 2/7, 1/3, 3/8, 2/5, 3/7, 1/2, 4/7, 3/5, 5/8, 2/3, 5/7, 3/4, 4/5, 5/6, 6/7, 7/8, 1/1}

Let $F^*_n$ represent the $n$th Farey sequence without the first element $0/1$.

Let $S^*_n$ be the union of rational limit sequences which have at least one element up to and including step $n$: \begin{equation} S^*_n = \{ S_{p,q}\ |\ \exists (a,b) \} \end{equation}

The elements of $F^*_n$ index, converted from fractions to tuples, exactly index the elements of $S^*_n$. The following table compares the grouping of the limit sequences in the Ross analysis and the rational limit analysis:

\begin{array}{|c|c|c|} \hline & \text{Ross} & \text{rational} \\ \hline \text{num new seq per step } & 1 & \text{multiple (generally)} \\ \hline \text{new seq at step } n & S_n & F^*_n - F^*_{n-1} \\ \hline \text{tot num seq up to step }n & n & \| F^*_n \| \\ \hline \text{super-seq up to step }n & \{ S_m \}_{m=1}^{n} & F^*_n \\ \hline \end{array}

Finally, since methods exist [3],[4] for iteratively creating the super sequence $ F^*_n $, the condition (4) is also satisfied.

One of those methods, a variant of the Stern-Brocot tree, is as follows:

The mediant of two rationals $a/c$ and $b/d$ is defined as $\frac{a+b}{c+d}$

  • Set $F*_n = \emptyset$
  • Append $1/n$ to $F*_n$
  • Loop for $i$ in $1...(\|F*_{n-1}\|-1)$

    • Append $F*_{n-1}[i]$ to F*_n$

    • Let $x = mediant(F*_{n-1}[i], F*_{n-1}[i+1])$

    • If $denom(x) \leq n$ append x to $F*_n$
    • continue loop
  • Append $F*_{n-1}[n]$ to $F*_n$

The paradox has been resolved.

Proof of Theorem 1 First note that: \begin{eqnarray} P(E_{a,b}) &=& \prod_{k=a}^{b} \frac{9k}{(9k+1)} \\ &=& \frac{\Gamma \left(a+\frac{1}{9}\right) \Gamma (b+1)}{\Gamma (a) \Gamma\left(b+\frac{10}{9}\right)} \\ &=& (a-1)^{\frac{1}{2}-a} \left(a-\frac{8}{9}\right)^{a-\frac{7}{18}} b^{b+\frac{1}{2}} \left(b+\frac{1}{9}\right)^{-b-\frac{11}{18}} \end{eqnarray} where the last transformation is the Sterling transformation.

Then, syntactically substituting $a\rightarrow a*n$ and $b\rightarrow b*n$ into the last (Sterling form) equation we get \begin{eqnarray} \lim_{n\rightarrow\infty} P(E_{a,b}) &=& \lim_{n\rightarrow\infty} (a M-1)^{\frac{1}{2}-a M} \left(a M-\frac{8}{9}\right)^{a M-\frac{7}{18}} (b M)^{b M+\frac{1}{2}} \left(b M+\frac{1}{9}\right)^{-b M-\frac{11}{18}} \\ &=& \left(\frac{a}{b}\right)^\frac{1}{9} \end{eqnarray}

Appendix: Other results

Expectations in the limit

This section gives a closed expression for the expected number of balls remaining up to and including any fraction of the step size.
A corollary of this result is a numerical approximation of the index of the first ball which has an expectation of remaining greater than one.

( To be continued )

Craig Hicks
  • 139
  • 6
  • 1
    Please do not post two of the same answer to two different questions. – Glen_b Dec 02 '17 at 10:55
  • @Glen_b - I have completely rewritten my answer, changing to a purely mathematical and statistical approach. No philosophy, sets, counting, or computational science. I think this is in keeping with this board. Perhaps I could post it as a new answer? I would appreciate your consideration. – Craig Hicks Dec 20 '17 at 23:55
  • 1
    I'm not sure what you're asking me to do here. If you really think you have a different answer you can post it. – Glen_b Dec 21 '17 at 02:28
  • @Glen_b Craig cannot post another answer because this thread is protected and his reputation (minus association bonus) is currently negative. I am not sure there is any way to help him apart from temporarily removing the protection. Craig, a better solution would be for you to post some other answers in other threads, get a couple of upvotes, and accumulate enough rep to be able to post here. – amoeba Dec 21 '17 at 21:39
  • @amoeba - I have condensed the answer to deliver the message in fewer words. Did you read it? In the $(i,n)$ >DOMAIN< space, each Ross limit approaches an $i/n$ ratio of zero. In comparison the ratio preserving limits of $\lim_{n\rightarrow \infty} \frac{a*n}{b*n}$ preserve the ratio $a/b$. Do you see my point? – Craig Hicks Dec 23 '17 at 20:27