Calculating posterior of difference given posterior of two means

Question

I am using R and MCMCpack to do a Bayesian analysis of some data that I have. I have generated posterior distributions (postDist) on the means of some parameters (y1,y2,y3) using MCMCregress (postDist <- MCMCRegress( x ~ y + z ,...)).

Now, I would like to take those posterior distributions on the means and generate a posterior distribution on the difference between the means. Is that a reasonable thing to do in a Bayesian analysis, and if so, how do you do it (either in theory or in R)?

I believe that I have figured this out in R, but I am still unsure of the theory. In the `MCMCpack` the output is samples of the joint posterior distribution on all parameters, not the marginal posterior distributions of each individual parameter. So the difference in means can be displayed as `plot(postDist[,"y1"]-postDist[,"y2"])`. — DaleSpam, Apr 08 '14 at 11:30

score 2 · Accepted Answer · edited Apr 13 '17 at 12:19

First, the method and theory, in brief: The goal is to approximate the target distribution $p(\theta|D)$ where $\theta$ is a vector parameter and $D$ is observed data, given some prior distribution $p(\theta)$. At each stage of the MCMC chain, the sampling algorithm proposes a new parameter vector $\theta$. (This process varies depending on the flavor of algorithm, and the proposal distribution.) Given a proposed $\theta$, it then computes the product $p(D|\theta_{proposed})p(\theta_{proposed})$, which by Bayes rule is proportional to the posterior distribution $p(\theta|D)$. It accepts the proposal with probability $max(\frac{p(\theta_{proposed})}{p(\theta_{current})},1)$. If a number of requirements are met, this chain will produce a representative sample from the posterior distribution. (In brief, it requires a proposal process that adequately covers the posterior distribution, proper burn-in, and convergence.)

If those requirements are met, one can view the MCMC sample as an approximation to the posterior. Each individual sample value is one sampled vector of values for $\theta$; likewise, differencing two sampled parameters over the entire sample produces an approximated distribution of the difference between the two parameters. (I'm not familiar with MCMCPack, but I gather from your code and comment that postDist[,"y2"] and postDist[,"y2"] are vectors of samples from the posterior, so this should work.) This is one benefit of MCMC methods: If the parameters covary, then solving for their sum or difference analytically depends on knowing their joint distribution.

By the by, I began learning Bayesian methods with Kruschke's Doing Bayesian Data Analysis, and I highly recommend his chapters explaining MCMC algorithms. It's a very approachable, intuitive treatment.

Sean - is there some interinsic ordering we should worry about when taking the difference between both vectors? or is it basically just a series of random draws from each vector, compute the difference, and then plot all of the difference estimates? — pythOnometrist, Oct 14 '21 at 16:22
I would think of it as taking the difference between elements of the same draw from the joint posterior, which is a little more specific than “random draws from any vector,” and plotting those. But it’s admittedly been a long time since I thought about the content in this question and answer :) — Sean Easter, Oct 15 '21 at 01:39
:) thanks for responding though. I would like your perspective on this question - if you have a moment. https://stats.stackexchange.com/questions/548295/bayesian-comparing-means-of-two-posterior-samples-help-a-frequentist-out No worries if you are busy. — pythOnometrist, Oct 15 '21 at 16:44

Calculating posterior of difference given posterior of two means

1 Answers1

Linked