Training a Discriminator to guide Beam Search for a seq2seq model?

Asked Jul 19 '21 at 21:54

Active Oct 11 '21 at 02:04

Viewed 45 times

The idea is to train a discriminator during training of the seq2seq model to differentiate between 'fake' decoder outputs and 'real' decoder targets, while not propagating discriminator loss to the seq2seq model. Then during inference the discriminator could be used either as a scoring function in beam search, or beam search could proceed as normal and the discriminator would be used to rank the output beams to select the most 'real' looking sequence.

I've seen some papers and architectures where a discriminator is used as part of the training loop as for a seq2seq GAN, but I have not seen a discriminator used as a learnable score function for inference-time beam search. Has any work been done in this area? And if not is there a reason as to why this isn't a good idea?

edited Jul 19 '21 at 22:02

asked Jul 19 '21 at 21:54

Avelina

It sounds to me like you’ve described reranking of the partial translation hypotheses. – Arya McCarthy Jul 20 '21 at 00:57
Thank you for good question, did you find any answer for that? – shaghayegh nadjari Oct 11 '21 at 01:56

Training a Discriminator to guide Beam Search for a seq2seq model?

0 Answers0