Disabling Greedy Decoding During Sequence Prediction #1251

Losbal · 2021-08-05T01:03:26Z

Losbal
Aug 5, 2021

(My apologies for originally posting a similar question under Issues, which I have since closed)

I am trying to train a grammatical error correction model with transformers, such that the inputs are English sentences, some of which have grammatical errors, and the output is a corrected version of each sentence if it contains errors, or simply the same sentence as the input if there are none. The problem I'm facing is that the greedy strategy in the decoder is producing outputs which are completely different in terms of word tokens and vocabulary from the inputs, since the n+1 word prediction is based on the _n_th word prediction, and prediction errors multiply significantly as the output sentence is generated one word at a time.

It seems that I would want to base the _n_th prediction on the _n_th input token, not the decoder's previous prediction. Could someone please point me to how I can disable the greedy decoder in this case? I have looked at sequence_decoders.py and recurrent_modules.py but I am not sure what to modify exactly. Many, MANY, thanks in advance!!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disabling Greedy Decoding During Sequence Prediction #1251

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Disabling Greedy Decoding During Sequence Prediction #1251

Losbal Aug 5, 2021

Replies: 0 comments

Losbal
Aug 5, 2021