The Stein's estimator assumes that data points are draws from a normal distribution, i.e., $Z_i \sim N(\mu_i, \sigma^2_i)$.
By looking at different sources (Wikipedia, Efron,James-Stein Estimator with unequal variances) it seems that each $Z_i$ can be drawn from a normal distribution with arbitrary mean and variance.
In the extreme, all means and variances are different: $$\mu_i\neq\mu_j \text{ and } \sigma^2_i\neq\sigma^2_j, \forall\ i\neq j$$
"A quirky example would be estimating the speed of light, tea consumption in Taiwan, and hog weight in Montana, all together."
Assuming my assumptions so far are correct, how does pushing each $Z_i$ towards the global mean can create a better individual predictor. If not, what is wrong with my assumptions?