I am trying to compare the errors from two statistical models in order to give evidence to one being "better" in terms of lower prediction error than the other.
To formalize this, I thought that a test of stochastic dominance between two collections of random variables (the OOS errors) would be a good idea. Ideally the null hypothesis would be :
$$\mathbb{H}_0: F(x) \ge G(x) \forall x \in \mathbb{R} $$
I have found resources pointing me to the Kruskal-Wallis test, but unfortunately cannot seem to find a paper explicitly stating and proving one of these (or similar) null hypotheses. Many sources I check simply state that the null is that the medians of the two distributions differ, but this is not what I want to check. Any help is appreciated.