You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
My team uses your yake extractor. We get pretty good results.
We would like to enable stemming. However, after studying the source code the stemming implementation seems to have two major shortcomings. implementation
The final weight calculation with stemming doesn't consider the term-frequency. In contrast, the plain-word implementation does that. (line 358, 390) Additionally, the word-implementation also differentiates between stopwords and non-stopwords. (line 365-383)
Is it possible to offer the same great possibilities with the stemmed-version? In general, should we go with the stem or non-stem version? What would you suggest?
My team and I are grateful for any help. :)
The text was updated successfully, but these errors were encountered:
Hi,
I tried some things. And I need to ensure the performance are the same as before, so I need to find a way to prove how the updated code is backwards compatible.
My team uses your yake extractor. We get pretty good results.
We would like to enable stemming. However, after studying the source code the stemming implementation seems to have two major shortcomings.
implementation
The final weight calculation with stemming doesn't consider the term-frequency. In contrast, the plain-word implementation does that. (line 358, 390) Additionally, the word-implementation also differentiates between stopwords and non-stopwords. (line 365-383)
Is it possible to offer the same great possibilities with the stemmed-version? In general, should we go with the stem or non-stem version? What would you suggest?
My team and I are grateful for any help. :)
The text was updated successfully, but these errors were encountered: