2

Say I have a hidden Markov model (HMM), and due to the emission distribution and our observations, we get probability 0 for certain states due to computer precision, which causes singularities (divide by 0) when using the forward algorithm.

What are some methods to handle this? One method is to use a distribution with fatter tails, i.e. replace a Gaussian distribution with a t-distribution. Are there others?

mlstudent
  • 452
  • 3
  • 12

1 Answers1

1

I can think of two ways to avoid numerical underflow:

1) Use the scaling method described in Rabiner's paper (Rabiner, L. R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2), 257-286.)

2) Conduct your computations in log space. (use the logsumexp trick). I don't have a reference right now, but I remember there was a paper on this.