Identifying the distribution of data is essentially impossible.
The class of distribution functions is very large; it must be at least as large as the cardinality of $\mathbb{R}$ (e.g. consider only the unit step functions, corresponding to a constant value at some $x$ - there are as many of those as the cardinality of $\mathbb{R}$, so it must be at least that large).
Further, any cdf has an infinity of "near neighbors" that at a given sample size are hard to tell from any given distribution. (e.g. if we consider the KS-statistic to tell them apart, there's an infinite number of distributions close enough to the true distribution that a test at some sample size won't be able to tell the difference).
So the idea that we can say this must be the distribution, on the basis of a sample is a hopeless task.
If we restrict ourselves to some small list of candidates, then at some large sample size we might hope to rule out almost all the list (which sounds useful) ... but then we may actually end up ruling out the entire list (and indeed as sample sizes become large, this becomes essentially a certainty, because the chances our list includes the actual distribution of the data will be essentially zero).
[Further, tools which have a very large list of distribution families to choose from often "overfit", which can be counterproductive. One hopes that they might eventually catch up with some of the ideas which help avoid this problem, but even then, in general domain knowledge is going to be a better tool for good model choice than some arbitrary list of distributions.]
Indeed the entire approach seems pointless, because not only are real distributions typically going to be more complex than we can ever hope to identify (e.g. we might conceive of them consisting of arbitrary mixtures), knowing the true distribution class would be effectively useless as a model (with more parameters than observations, for example).
Probability models are in general little more than (hopefully) useful approximations. We should treat them as such.
So our interest should not be on identifying the distribution, but a distribution - one which adequately describes the situation well enough for our purposes, but which is simple enough for us to do something with at the sample sizes we can get.