This is the dilemma that I have faced in applied probability in general. Say you have the choice to put your savings of $\$10$ in a deposit account with guaranteed retun of $\$100$ or buy a lottery ticket with that money where you can win $\$100,000$ with a probablity of 0.01 and lose everything with a probablity of 0.99. Some people mighht say the expected winnings are $\$100$ in the first case and $\$1000$ in the second case so we should buy the lottery. Some might say since we are highly likely to lose everything in the second case and hence we should chose the first option.
In this context, what is the right choice for RL and multi-armed bandit problems?