Training a basic Markov Random Field for classifying pixels in an image

Question

I am attempting to learn how to use Markov Random Fields to segment regions in an image. I do not understand some of the parameters in the MRF or why the expectation maximisation I perform fails to converge to a solution sometimes.

Starting from Bayes' theorem, I have $p(x|y) = p(y|x) p(x) / p(y)$, where $y$ is the pixel's gray-scale value and $x$ is a class label. I have chosen to use a Gaussian distribution for $p(y|x)$, while $p(x)$ is modelled using the MRF.

I use a potential function for the MRF that has both pairwise clique potentials and a potential value for the class label of the pixel being classified. The single pixel potential value is some constant $\alpha$ that depends on the class label $x$. The pairwise potential functions are evaluated for the 4-connected neighbours and return positive $\beta$ if the neighbour has the same class label as this pixel and $-\beta$ if the labels differ.

At the point in the expectation maximisation where I have to find the values of $\alpha(x)$ and $\beta$ that maximise the expected value of the log-likelihood I used a numerical optimisation method (tried conjugate gradient, BFGS, Powell's method) but would always find that the value of $\beta$ would become negative, the $\alpha$s would increase dramatically and an iteration or two later the whole image would be assigned to one label only (background: assigning class labels given MRF parameters was done using ICM). If I removed the alphas, i.e. only using pairwise clique potentials, then the expectation maximisation would work just fine.

Please explain what is the purpose of the alphas for each class? I thought they would be related to the amount of that class that is present in the image, but it appears not. Once I got the MRF working with only pairwise potentials, I compared it to a straight forward Gaussian Mixture Model and found that they produced almost identical results. I was expecting the pairwise potentials to smooth out the classes a bit, but that didn't happen. Please advise where I went wrong.

In my application, the grey-scale value of the pixel counts and neighbouring pixels are more likely to have the same class label, but there isn't any reason to use a different beta for each pairwise clique. I hope I understood your question properly. — chippies, Feb 06 '14 at 16:33
Constant alphas seem to serve the purpose to model the prior distribution on the labels. As you supposed, correct alphas would likely to superimpose those labels that are encountered more often in the training set. If your model works well without them, why don’t you just drop them from the model? Your description is not verbose enough to answer why alphas would grow and spoil everything, but you probably need regularization. Try to add the gaussian prior on alpha to the model, i.e. add $\lambda \|\alpha\|^2$ to the log-posterior, this would probably prevent overfitting. — Roman Shapovalov, Jul 29 '14 at 15:20
What is unclear from your question: 1) Does your likelihood p(y|x) decompose over pixels, so you use a 1D gaussian for each? 2) What is the exact objective you optimize in the EM (you mentioned log-likelihood, but you use MRF for modelling prior)? 3) Do you define potentials in the logarithmic domain? Does increasing beta means increasing P(x), or energy, which is -log P(x), or negative energy? 4) Do you manage to actually decrease the EM objective by setting such degenerate alphas, or does the optimization fail? — Roman Shapovalov, Jul 29 '14 at 15:27

score 1 · Answer 1 · edited Apr 13 '17 at 12:44

Diagnosis

This sounds like an initialization problem.

The MRF model that you are using is non-convex and, as such, has multiple local minima. As far as I know, all existing optimization techniques are sensitive to initialization, meaning that the quality of the final solution is highly affected by where you start the optimization procedure from.

Training a basic Markov Random Field for classifying pixels in an image

1 Answers1

Diagnosis

Suggested Solution