If the probability density of your future distribution is positively skewed, then typically (though not always; von Hippel, 2005) the median will be lower than its mean. So a technique that aims at the median as a point forecast will be biased low. Since the MAPE usually prefers a low biased prediction, such a technique will usually perform better in terms of the MAPE in such a situation.
Note that there are a couple of caveats in this description. The chain of reasoning is not perfect, because you can find pathological counter-examples for at least two steps. Nevertheless, it should work in most practical cases.
You may be better off using a custom optimization routine that directly attempts to minimize the MAPE. The problem being, of course, that the MAPE is not differentiable at perfect forecasts. Alternatively, you could try to estimate full predictive densities and then output the (-1)-median of this density as a point forecast, which is the functional that minimizes the MAPE in expectation (Gneiting, 2011, p. 752 with $\beta=-1$).
You may be interested in What are the shortcomings of the Mean Absolute Percentage Error (MAPE)?