I am looking for a library, script or program that can normalize the transcribed and gold texts when computing the word error rate (WER) of an automated speech recognition system.
For example, if:
- the gold transcript is
Without the dataset the article is useless
- the predicted transcript is
Without the data set the article's useless
the texts should be normalized so that the WER is 0 (and not 3 or 4 if the text isn't normalized).
I have crossposted the question at: