You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
By using some combination of alignments, LLM's and Levenshtein distance, we should be able to determine the "proper form" of keywords in the target sentence, and tell NLLB-200 to include that specific surface form of the word. This could be done by doing the following:
Determining the matching of proper names for each verse from source to target
Giving this information to an LLM to say "when this word is used here in this context, the surface form looks like this" - give it a lot of examples. Put them in the context window
Ask the LLM: "For this new name in this context, what should the surface form be?"
Take those surface forms and feed them to the LLM to "guide" the decoding.
Implementing this relies upon the successful implementation of #178.
The text was updated successfully, but these errors were encountered:
By using some combination of alignments, LLM's and Levenshtein distance, we should be able to determine the "proper form" of keywords in the target sentence, and tell NLLB-200 to include that specific surface form of the word. This could be done by doing the following:
Implementing this relies upon the successful implementation of #178.
The text was updated successfully, but these errors were encountered: