Skip to content

Latest commit

 

History

History
 
 

eng-pqe

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 

opus-2020-06-28.zip

  • dataset: opus
  • model: transformer
  • source language(s): eng
  • target language(s): fij gil haw mah mri nau niu rap smo tah ton tvl
  • model: transformer
  • pre-processing: normalization + SentencePiece (spm32k,spm32k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • download: opus-2020-06-28.zip
  • test set translations: opus-2020-06-28.test.txt
  • test set scores: opus-2020-06-28.eval.txt

Benchmarks

testset BLEU chr-F
Tatoeba-test.eng-fij.eng.fij 22.5 0.434
Tatoeba-test.eng-gil.eng.gil 59.3 0.739
Tatoeba-test.eng-haw.eng.haw 1.1 0.159
Tatoeba-test.eng-mah.eng.mah 7.6 0.363
Tatoeba-test.eng-mri.eng.mri 7.2 0.295
Tatoeba-test.eng.multi 11.3 0.311
Tatoeba-test.eng-nau.eng.nau 0.5 0.094
Tatoeba-test.eng-niu.eng.niu 28.1 0.509
Tatoeba-test.eng-rap.eng.rap 3.5 0.163
Tatoeba-test.eng-smo.eng.smo 24.6 0.461
Tatoeba-test.eng-tah.eng.tah 10.4 0.296
Tatoeba-test.eng-ton.eng.ton 21.1 0.463
Tatoeba-test.eng-tvl.eng.tvl 29.3 0.500

opus-2020-07-06.zip

  • dataset: opus
  • model: transformer
  • source language(s): eng
  • target language(s): fij gil haw lkt mah mri nau niu rap smo tah ton tvl
  • model: transformer
  • pre-processing: normalization + SentencePiece (spm32k,spm32k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • download: opus-2020-07-06.zip
  • test set translations: opus-2020-07-06.test.txt
  • test set scores: opus-2020-07-06.eval.txt

Benchmarks

testset BLEU chr-F
Tatoeba-test.eng-fij.eng.fij 20.5 0.406
Tatoeba-test.eng-gil.eng.gil 31.4 0.607
Tatoeba-test.eng-haw.eng.haw 0.5 0.141
Tatoeba-test.eng-lkt.eng.lkt 0.5 0.077
Tatoeba-test.eng-mah.eng.mah 8.4 0.331
Tatoeba-test.eng-mri.eng.mri 7.9 0.300
Tatoeba-test.eng.multi 10.3 0.304
Tatoeba-test.eng-nau.eng.nau 0.5 0.083
Tatoeba-test.eng-niu.eng.niu 34.6 0.531
Tatoeba-test.eng-rap.eng.rap 2.1 0.148
Tatoeba-test.eng-smo.eng.smo 25.4 0.467
Tatoeba-test.eng-tah.eng.tah 8.9 0.263
Tatoeba-test.eng-ton.eng.ton 26.1 0.489
Tatoeba-test.eng-tvl.eng.tvl 28.9 0.520

opus-2020-07-14.zip

  • dataset: opus
  • model: transformer
  • source language(s): eng
  • target language(s): fij gil haw lkt mah mri nau niu rap smo tah ton tvl
  • model: transformer
  • pre-processing: normalization + SentencePiece (spm32k,spm32k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • download: opus-2020-07-14.zip
  • test set translations: opus-2020-07-14.test.txt
  • test set scores: opus-2020-07-14.eval.txt

Benchmarks

testset BLEU chr-F
Tatoeba-test.eng-fij.eng.fij 20.5 0.406
Tatoeba-test.eng-gil.eng.gil 31.4 0.607
Tatoeba-test.eng-haw.eng.haw 0.5 0.141
Tatoeba-test.eng-lkt.eng.lkt 0.5 0.077
Tatoeba-test.eng-mah.eng.mah 8.4 0.331
Tatoeba-test.eng-mri.eng.mri 7.9 0.300
Tatoeba-test.eng.multi 10.3 0.304
Tatoeba-test.eng-nau.eng.nau 0.5 0.083
Tatoeba-test.eng-niu.eng.niu 34.6 0.531
Tatoeba-test.eng-rap.eng.rap 2.1 0.148
Tatoeba-test.eng-smo.eng.smo 25.4 0.467
Tatoeba-test.eng-tah.eng.tah 8.9 0.263
Tatoeba-test.eng-ton.eng.ton 26.1 0.489
Tatoeba-test.eng-tvl.eng.tvl 28.9 0.520

opus-2020-07-20.zip

  • dataset: opus
  • model: transformer
  • source language(s): eng
  • target language(s): fij gil haw lkt mah mri nau niu rap smo tah ton tvl
  • model: transformer
  • pre-processing: normalization + SentencePiece (spm32k,spm32k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • download: opus-2020-07-20.zip
  • test set translations: opus-2020-07-20.test.txt
  • test set scores: opus-2020-07-20.eval.txt

Benchmarks

testset BLEU chr-F
Tatoeba-test.eng-fij.eng.fij 20.5 0.406
Tatoeba-test.eng-gil.eng.gil 31.4 0.607
Tatoeba-test.eng-haw.eng.haw 0.5 0.141
Tatoeba-test.eng-lkt.eng.lkt 0.5 0.077
Tatoeba-test.eng-mah.eng.mah 8.4 0.331
Tatoeba-test.eng-mri.eng.mri 7.9 0.300
Tatoeba-test.eng.multi 10.3 0.304
Tatoeba-test.eng-nau.eng.nau 0.5 0.083
Tatoeba-test.eng-niu.eng.niu 34.6 0.531
Tatoeba-test.eng-rap.eng.rap 2.1 0.148
Tatoeba-test.eng-smo.eng.smo 25.4 0.467
Tatoeba-test.eng-tah.eng.tah 8.9 0.263
Tatoeba-test.eng-ton.eng.ton 26.1 0.489
Tatoeba-test.eng-tvl.eng.tvl 28.9 0.520

opus-2020-07-27.zip

  • dataset: opus
  • model: transformer
  • source language(s): eng
  • target language(s): fij gil haw lkt mah mri nau niu rap smo tah ton tvl
  • model: transformer
  • pre-processing: normalization + SentencePiece (spm32k,spm32k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • download: opus-2020-07-27.zip
  • test set translations: opus-2020-07-27.test.txt
  • test set scores: opus-2020-07-27.eval.txt

Benchmarks

testset BLEU chr-F
Tatoeba-test.eng-fij.eng.fij 20.5 0.406
Tatoeba-test.eng-gil.eng.gil 31.4 0.607
Tatoeba-test.eng-haw.eng.haw 0.5 0.141
Tatoeba-test.eng-lkt.eng.lkt 0.5 0.077
Tatoeba-test.eng-mah.eng.mah 8.4 0.331
Tatoeba-test.eng-mri.eng.mri 7.9 0.300
Tatoeba-test.eng.multi 10.3 0.304
Tatoeba-test.eng-nau.eng.nau 0.5 0.083
Tatoeba-test.eng-niu.eng.niu 34.6 0.531
Tatoeba-test.eng-rap.eng.rap 2.1 0.148
Tatoeba-test.eng-smo.eng.smo 25.4 0.467
Tatoeba-test.eng-tah.eng.tah 8.9 0.263
Tatoeba-test.eng-ton.eng.ton 26.1 0.489
Tatoeba-test.eng-tvl.eng.tvl 28.9 0.520

opus2m-2020-08-01.zip

  • dataset: opus2m
  • model: transformer
  • source language(s): eng
  • target language(s): fij gil haw lkt mah mri nau niu rap smo tah ton tvl
  • model: transformer
  • pre-processing: normalization + SentencePiece (spm32k,spm32k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • download: opus2m-2020-08-01.zip
  • test set translations: opus2m-2020-08-01.test.txt
  • test set scores: opus2m-2020-08-01.eval.txt

Benchmarks

testset BLEU chr-F
Tatoeba-test.eng-fij.eng.fij 22.1 0.396
Tatoeba-test.eng-gil.eng.gil 41.9 0.673
Tatoeba-test.eng-haw.eng.haw 0.6 0.114
Tatoeba-test.eng-lkt.eng.lkt 0.5 0.075
Tatoeba-test.eng-mah.eng.mah 9.7 0.386
Tatoeba-test.eng-mri.eng.mri 7.7 0.301
Tatoeba-test.eng.multi 11.3 0.306
Tatoeba-test.eng-nau.eng.nau 0.5 0.071
Tatoeba-test.eng-niu.eng.niu 42.5 0.560
Tatoeba-test.eng-rap.eng.rap 3.3 0.122
Tatoeba-test.eng-smo.eng.smo 27.0 0.462
Tatoeba-test.eng-tah.eng.tah 11.3 0.307
Tatoeba-test.eng-ton.eng.ton 27.0 0.528
Tatoeba-test.eng-tvl.eng.tvl 29.3 0.513