Let $Y$ be a real random variable and $X$ be a real random vector. In a nonparametric model with additive noise, we assume the relationship
$$Y = f(X) + \epsilon$$
for some unknown regression function $f$ and noise $\epsilon$. This assumption is in contrast to the general nonparametric model, where no assumption about the additivity about the noise is made. Now, I'm wondering why not every model can be written in the former form.
We can always take $f(x) := E[Y \,|\, X = x]$ and $\epsilon = Y - f(X)$. This gives the form
$$Y = f(X) + \epsilon$$
Moreover, we find $E[\epsilon] = 0$ and $E[X\epsilon] = 0$. Am I missing some assumption for the nonparametric regression model with additive noise that is not satisfied here? Otherwise, it seems to me, that the general nonparametric regression model and the nonparametric regression model with additive noise are equivalent.