How to understand the scaling in Metropolis Hastings MCMC

Question

We know the Metropolis Hastings (MH) in MCMC:

target distribution: $\pi(x)$
proposal distribution: $p(y|x)$
acceptance: $\alpha(x,y) = \min \Big(1, \dfrac{\pi(y)p(y|x)}{\pi(x)p(x|y)}\Big)$

Here are some examples of a proposal distribution:

$p(y|x)$ density of $U(x-\sigma, x+\sigma)$
$p(y|x)$ density of $N(x,\sigma^2).$

Then how to understand the statements that:

large $\alpha$ or $\sigma^2$ will lower the acceptance;
small $\alpha$ or $\sigma^2$ will increase the correlation.

I cannot mathematically deduce the result from the acceptance formula.

The intuition is that if your chain at iteration $i$ is $x_{i}$ and you propose a candidate that it is near $x^{'}_{cand}=x_{i}+0.1$ ($a$ small and $\sigma^{2}$ small) it is more likely to be accepted than a candidate $x_{cand}^{'}=x_{i}+1000$ ($a$ big and $\sigma^{2}$ big). So, obviously in the first case if you accept your candidate you will have higher correlation because it is almost the same with your $x_{i}$ — Fiodor1234, Jul 19 '21 at 12:21
@Fiodor1234 agree on the intuition, actually I want to see the mathematical proof, especially for the first one. — user6703592, Jul 19 '21 at 12:22

Xi'an · Accepted Answer · 2021-07-19T16:48:25.697

3

Assuming a target distribution with unrestricted support $\mathfrak X$, consider the simple random walk proposal written as $$y=x+\sigma\epsilon\,,$$ where $\epsilon$ is a unit-variance symmetric white noise. Then $$\dfrac{\pi(y)p(y|x)}{\pi(x)p(x|y)}=\dfrac{\pi(y)}{\pi(x)}=\dfrac{\pi(x+\sigma\epsilon)}{\pi(x)}$$ which a.s. converges to $1$ when $\sigma$ goes to zero and to $0$ when $\sigma$ goes to infinity.

edited Jul 19 '21 at 16:48

answered Jul 19 '21 at 12:35

Xi'an

90,397
9
157
575

Sorry I cannot convince by above deduction, you should proof $P\Big(U\leq \min(1,\dfrac{\pi(x+\sigma\epsilon)}{\pi(x)})\Big)$ is an decreasing function of $\sigma.$ – user6703592 Jul 19 '21 at 12:41
"to 0 when $\sigma$ goes to infinity" why? – user6703592 Jul 19 '21 at 12:44
The property does not hold for all $\sigma$'s but only for small enough and large enough $\sigma$'s. – Xi'an Jul 19 '21 at 12:54
even for the large enough $\sigma,$ why goes to 0? Or everything here is "almost surely"? – user6703592 Jul 19 '21 at 13:06
That the density goes to zero at infinity is a necessary condition for the density to be integrable. – Xi'an Jul 19 '21 at 13:50
when $\sigma$ is small, the generated states are highly correlated. Then why do we say the high correlation states will slow the convergence? – user6703592 Jul 19 '21 at 17:25
The higher the correlation the larger the variance of a Monte Carlo estimator like $\sum_{t=1}^T h(x_t)/T$, hence the larger the number of simulations $T$ needed to achieve a given precision. – Xi'an Jul 20 '21 at 06:54

How to understand the scaling in Metropolis Hastings MCMC

1 Answers1