Update Evaluation used for V.1.1.1 #6

jordiclive · 2021-07-24T12:54:24Z

The references in /evaluation/dart_reference are not for the current version. Can you replace with the new references and share the tokenization script that is done to the predictions.

I am getting very different BLEU scores depending on tokenization, and how many references I use.
As there are up to ~30 for a few examples.

I would like to directly compare against the README leaderboard.

lxuechen · 2021-08-01T00:01:00Z

Upvoting this, since having the same issue here.

LemonQC · 2021-08-18T07:59:38Z

How to run BART in the model. Could you provide more details about running environment and python script.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Evaluation used for V.1.1.1 #6

Update Evaluation used for V.1.1.1 #6

jordiclive commented Jul 24, 2021 •

edited

Loading

lxuechen commented Aug 1, 2021 •

edited

Loading

LemonQC commented Aug 18, 2021

Update Evaluation used for V.1.1.1 #6

Update Evaluation used for V.1.1.1 #6

Comments

jordiclive commented Jul 24, 2021 • edited Loading

lxuechen commented Aug 1, 2021 • edited Loading

LemonQC commented Aug 18, 2021

jordiclive commented Jul 24, 2021 •

edited

Loading

lxuechen commented Aug 1, 2021 •

edited

Loading