Wikipedia's nice article on the histogram contains the following sentence:
"Using wider bins where the density of the underlying data points is low reduces noise due to sampling randomness; using narrower bins where the density is high (so the signal drowns the noise) gives greater precision to the density estimation."
without an actual source which can then be cited.
Does someone know if there is a specific paper in the literature that mentions this point to some effect?
Googling keywords has not turned up anything in this regard.