Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1 Cor. 1:14 contains unexpected output that does not match source verse #666

Open
bhartmoore opened this issue Feb 27, 2025 · 0 comments
Open
Labels
bug Something isn't working pipeline 6: infer Issue related to using a trained model to translate.

Comments

@bhartmoore
Copy link
Collaborator

1 Cor 1:14 includes very unexpected output in the bep-to-Indonesian back translation, despite the team being generally happy with the output (GEN, RUT, 1-2CO) otherwise. Model scores look good: BLEU 52.83, chrf3++ 68.55.

Of note is that the training set in almost entirely OT, with the exception of Eph 1-4.

bep (source) extract:

Rasiꞌnanto i oloꞌmi tiꞌarakou ara to kuriu, bateꞌraꞌ peá Kirisipu hai Gaiu,

In English (NIV),

I thank God that I did not baptize any of you except Crispus and Gaius,

Indonesian (target) inference:

Kepada pemimpin penyanyi. Dinyanyikan menurut lagu "Bunga Bakung." Lagu yang dikarang Daudiꞌ.
Translated Ind-Eng by Google Translate:

To the leader of the singers. Sung to the tune of "Daffodils." Song composed by Daudiꞌ.

The config for this model and the inference are in S:\MT\experiments\Indonesia\Behoa\NLLB.1.3B.bep-ABe.id-TBBe.

@ddaspit ddaspit added the bug Something isn't working label Feb 27, 2025
@ddaspit ddaspit moved this from 🆕 New to 🔖 Ready in SIL-NLP Research Feb 27, 2025
@ddaspit ddaspit added the pipeline 6: infer Issue related to using a trained model to translate. label Feb 27, 2025
@ddaspit ddaspit removed their assignment Feb 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working pipeline 6: infer Issue related to using a trained model to translate.
Projects
Status: 🔖 Ready
Development

No branches or pull requests

2 participants