Say I have the model $Y_{i} = m(x_{i}) + \epsilon_{i}$ and $Y_{i}$ and $X_{i}$ are two mutually independent i.i.d. sequences.
Then, how can I show that the Nadaraya Watson estimator is unbiased for this model, regardless of the bandwidth? And what is the intuition behind that?
I know that $E(\epsilon_{i})=0$ and that by the LIE we get $E(E(\epsilon_{i}|X_{i}))=E(\epsilon_{i})=0$ and since we have an i.i.d. sequence it also holds that $E(\epsilon|X)=0$, i.e. for all i's. But how should I proceed?
Edit: The NW estimator given as follows:
$ \hat{m}(x)= \frac{\sum K(\frac{X_{i}-x}{h})Y_{i}}{\sum K(\frac{X_{i}-x}{h})}=\sum W_{i}(x)Y_{i}=\sum x(X'X)^{-1}X_{i}Y_{i} $