BLEU (Bilingual Evaluation Understudy) is an algorithm for evaluating the quality of text which has been machine-translated from one natural language to another.
Questions tagged [bleu]
2 questions
2
votes
1 answer
Shouldn't ROUGE-1 precision be equal to BLEU with w=(1, 0, 0, 0) when brevity penalty is 1?
I am trying to evaluate a NLP model using BLEU and ROUGE. However, I am a bit confused about the difference between those scores. While I am aware that ROUGE is aimed at recall whilst BLEU measures precision, all ROUGE implementations I have come…

jdepoix
- 71
- 2
1
vote
2 answers
NLTK: odd outputs from bleu_score
For machine translation purposes I use bleu score, which seems to be the validation mechanism of choice (used in the sutskever 2014 sequence-to-sequence).
The purpose is to get as high bleu as possible (between 0 to 1).
The following mumble gives an…

Alexander R Johansen
- 125
- 5