Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md

README.md

opus-2020-07-19.zip

dataset: opus
model: transformer
source language(s): ind pap
target language(s): ind pap
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
download: opus-2020-07-19.zip
test set translations: opus-2020-07-19.test.txt
test set scores: opus-2020-07-19.eval.txt

Benchmarks

testset	BLEU	chr-F
Tatoeba-test.multi.multi	21.1	0.369

opus-2020-07-26.zip

dataset: opus
model: transformer
source language(s): ind pap
target language(s): ind pap
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
download: opus-2020-07-26.zip
test set translations: opus-2020-07-26.test.txt
test set scores: opus-2020-07-26.eval.txt

Benchmarks

testset	BLEU	chr-F
Tatoeba-test.msa-msa.msa.msa	0.7	0.149
Tatoeba-test.msa-pap.msa.pap	31.7	0.577
Tatoeba-test.multi.multi	21.1	0.369
Tatoeba-test.pap-msa.pap.msa	17.7	0.197

opus-2020-09-26.zip

dataset: opus
model: transformer
source language(s): eng ind jak_Latn max_Latn min msa_Latn pap tmw_Latn zlm zlm_Latn zsm_Latn
target language(s): eng ind jak_Latn max_Latn min msa_Latn pap tmw_Latn zlm zlm_Latn zsm_Latn
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
download: opus-2020-09-26.zip
test set translations: opus-2020-09-26.test.txt
test set scores: opus-2020-09-26.eval.txt

Benchmarks

testset	BLEU	chr-F
Tatoeba-test.eng-msa.eng.msa	33.2	0.587
Tatoeba-test.eng-pap.eng.pap	45.1	0.638
Tatoeba-test.msa-eng.msa.eng	39.8	0.585
Tatoeba-test.msa-msa.msa.msa	14.8	0.353
Tatoeba-test.msa-pap.msa.pap	31.7	0.577
Tatoeba-test.multi.multi	37.0	0.586
Tatoeba-test.pap-eng.pap.eng	48.9	0.603
Tatoeba-test.pap-msa.pap.msa	17.7	0.197

opus-2020-10-04.zip

dataset: opus
model: transformer
source language(s): eng ind jak_Latn max_Latn min msa_Latn pap tmw_Latn zlm zlm_Latn zsm_Latn
target language(s): eng ind jak_Latn max_Latn min msa_Latn pap tmw_Latn zlm zlm_Latn zsm_Latn
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
download: opus-2020-10-04.zip
test set translations: opus-2020-10-04.test.txt
test set scores: opus-2020-10-04.eval.txt

Benchmarks

testset	BLEU	chr-F
Tatoeba-test.eng-msa.eng.msa	33.2	0.587
Tatoeba-test.eng-pap.eng.pap	45.1	0.638
Tatoeba-test.msa-eng.msa.eng	39.8	0.585
Tatoeba-test.msa-msa.msa.msa	14.8	0.353
Tatoeba-test.msa-pap.msa.pap	31.7	0.577
Tatoeba-test.multi.multi	37.0	0.586
Tatoeba-test.pap-eng.pap.eng	48.9	0.603
Tatoeba-test.pap-msa.pap.msa	17.7	0.197

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpp-cpp

cpp-cpp

README.md

opus-2020-07-19.zip

Benchmarks

opus-2020-07-26.zip

Benchmarks

opus-2020-09-26.zip

Benchmarks

opus-2020-10-04.zip

Benchmarks

Files

cpp-cpp

Directory actions

More options

Directory actions

More options

Latest commit

History

cpp-cpp

Folders and files

parent directory

README.md

opus-2020-07-19.zip

Benchmarks

opus-2020-07-26.zip

Benchmarks

opus-2020-09-26.zip

Benchmarks

opus-2020-10-04.zip

Benchmarks