Stone (1977) considers the problem of the choice of predicting density for $y$ given $x$ from a prescribed class of formal predicting densities $\{f(y|x,\alpha,S), \alpha \in \mathscr{A}\}$ whose members are indexed by the choice parameter $\alpha$. He shows that AIC and LOOCV (leave-one-out cross validation) are asymptotically equivalent provided that the following assumption holds:
The conditional distribution of $y$ given $x$ in the distribution $P$ is $f(y|x,\theta^*)$ for some unique $\theta^* \in \Theta$, that is, the conventional model $\{f(y|x,\theta),\theta \in \Theta)\}$ is true.
I am having a hard time understanding this formal requirement and using it in applications.
Could anyone illustrate when this assumption holds vs. when it fails by an example and a counterexample?
References
- Stone, M. (1977). An asymptotic equivalence of choice of model by cross‐validation and Akaike's criterion. Journal of the Royal Statistical Society: Series B (Methodological), 39(1), 44-47.