I am trying to get some intuition on why m-out-of-n bootstrap works but haven't been able to find good explanation. I would really appreciate any input on this.
I think I do understand what bootstrap is about -- estimating how $\sqrt{n}(T_n(X_1,...,X_n)-T(X;F))$ behaves using $\sqrt{n}(T_n(X_1^*,...,X_n^*)-T(X;\hat{F_n}))$. ($X_1,...,X_n$ drawn from $F$, the true CDF. And $X_1^*,...,X_n^*$ drawn from $\hat{F_n}$, the ECDF). From my understanding, when $T$ is a smooth function, bootstrap works fine. Sometimes when T is non-smooth (such as extreme order statistics, or $|\mu|$), m-out-of-n bootstrap can "smooth" things out and works.
My main question is:
- Why does m-out-of-n bootstrap "smooth" things out?
I have two more things that I want to make sure I am understanding correctly.
Since only $m$ samples are drawn, how can the behavior (variability, etc.) of $T_m(X_1^*,...,X_m^*)$ resemble that of a sample statistics using $n$ observations ($T_n(X_1,...,X_n)$). Or is it only known that asymptotically they are the same?
When using m-out-of-n bootstrap method to find CI, do we need to scale the variance of $\sqrt{m}(T_m(X_1^*,...,X_m^*;\hat{F_n})-T(X;\hat{F_n}))$ by $\frac{n}{m}$ since we're drawing a smaller sample size from $\hat{F_n}$?
Hope my questions are clear.