Folders and files Name Name Last commit message
Last commit date
parent directory
View all files
dataset: opus
model: transformer
source language(s): ind pap
target language(s): ind pap
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
download: opus-2020-07-19.zip
test set translations: opus-2020-07-19.test.txt
test set scores: opus-2020-07-19.eval.txt
testset
BLEU
chr-F
Tatoeba-test.multi.multi
21.1
0.369
dataset: opus
model: transformer
source language(s): ind pap
target language(s): ind pap
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
download: opus-2020-07-26.zip
test set translations: opus-2020-07-26.test.txt
test set scores: opus-2020-07-26.eval.txt
testset
BLEU
chr-F
Tatoeba-test.msa-msa.msa.msa
0.7
0.149
Tatoeba-test.msa-pap.msa.pap
31.7
0.577
Tatoeba-test.multi.multi
21.1
0.369
Tatoeba-test.pap-msa.pap.msa
17.7
0.197
dataset: opus
model: transformer
source language(s): eng ind jak_Latn max_Latn min msa_Latn pap tmw_Latn zlm zlm_Latn zsm_Latn
target language(s): eng ind jak_Latn max_Latn min msa_Latn pap tmw_Latn zlm zlm_Latn zsm_Latn
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
download: opus-2020-09-26.zip
test set translations: opus-2020-09-26.test.txt
test set scores: opus-2020-09-26.eval.txt
testset
BLEU
chr-F
Tatoeba-test.eng-msa.eng.msa
33.2
0.587
Tatoeba-test.eng-pap.eng.pap
45.1
0.638
Tatoeba-test.msa-eng.msa.eng
39.8
0.585
Tatoeba-test.msa-msa.msa.msa
14.8
0.353
Tatoeba-test.msa-pap.msa.pap
31.7
0.577
Tatoeba-test.multi.multi
37.0
0.586
Tatoeba-test.pap-eng.pap.eng
48.9
0.603
Tatoeba-test.pap-msa.pap.msa
17.7
0.197
dataset: opus
model: transformer
source language(s): eng ind jak_Latn max_Latn min msa_Latn pap tmw_Latn zlm zlm_Latn zsm_Latn
target language(s): eng ind jak_Latn max_Latn min msa_Latn pap tmw_Latn zlm zlm_Latn zsm_Latn
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
download: opus-2020-10-04.zip
test set translations: opus-2020-10-04.test.txt
test set scores: opus-2020-10-04.eval.txt
testset
BLEU
chr-F
Tatoeba-test.eng-msa.eng.msa
33.2
0.587
Tatoeba-test.eng-pap.eng.pap
45.1
0.638
Tatoeba-test.msa-eng.msa.eng
39.8
0.585
Tatoeba-test.msa-msa.msa.msa
14.8
0.353
Tatoeba-test.msa-pap.msa.pap
31.7
0.577
Tatoeba-test.multi.multi
37.0
0.586
Tatoeba-test.pap-eng.pap.eng
48.9
0.603
Tatoeba-test.pap-msa.pap.msa
17.7
0.197
You can’t perform that action at this time.