We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请问分词结果怎么过滤单个字符呢?如果源词就只有一个字符那么就直接返回源词,如果原来的词是多个字符例如 “我是中国人”, 那么分词结果只保留 “我是中国人”, “我是”,“中国人”, “中国”,不再要“人”
The text was updated successfully, but these errors were encountered:
这个需求可以使用solr自带的 Length Filter 过滤器来实现。
Length Filter
示例如下:
<analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LengthFilterFactory" min="2" max="7"/> </analyzer>
将该过滤器配置在 ik 分词器的过滤器列表里即可。
Sorry, something went wrong.
十分感谢🙏
No branches or pull requests
请问分词结果怎么过滤单个字符呢?如果源词就只有一个字符那么就直接返回源词,如果原来的词是多个字符例如 “我是中国人”, 那么分词结果只保留 “我是中国人”, “我是”,“中国人”, “中国”,不再要“人”
The text was updated successfully, but these errors were encountered: