Punctuation in Synonyms #468
Replies: 5 comments
-
Hi @nathanjfield 👋 I think it has to do with the fact that the soft/hard separators of the tokenizer are not also considered in the synonyms. @ManyTheFish do you confirm? Customizing the separators could be a solution so that Meilisearch can take Thanks! |
Beta Was this translation helpful? Give feedback.
-
Hello @gmourier and @nathanjfield, |
Beta Was this translation helpful? Give feedback.
-
Hello everyone 👋 We just released a 🧪 prototype that allows customizing tokenization and we'd love your feedback. How to get the prototype?Using docker, use the following command:
From source, compile Meilisearch on the How to use the prototype?You can find all the details in the PR. Feedback and bug reporting when using this prototype are encouraged! Thanks in advance for your involvement. It means a lot to us ❤️ |
Beta Was this translation helpful? Give feedback.
-
Hello everyone 👋 We have just released the first RC (release candidate) of Meilisearch containing this new feature! You can test it by using:
You are welcome to leave your feedback in this discussion. If you encounter any bugs, please report them here. 🎉 Official and stable release containing this change will be available on September 25th, 2023 |
Beta Was this translation helpful? Give feedback.
-
Hey folks 👋 v1.4.0 has been released! 🦓 You can now customize tokenization by adding or removing tokens from the separator tokens and non-separator tokens lists. ✨ Note: |
Beta Was this translation helpful? Give feedback.
-
I'm using Meilisearch in a live production environment for employee reports. As speed is essential when inputting reports, several people use shorthand when entering information. Examples include:
And so on.
The problem is that when adding synonyms into Meilisearch, the punctuation, forward slashes in this case, are removed. So 'm/c' becomes 'm c'. Meaning that the synonyms for search don't work.
I'd like the indexing/synonyms to include punctuation marks.
Beta Was this translation helpful? Give feedback.
All reactions