Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

如何使用ik分词器搜索emoji表情?IK分词器会自动过滤Emoji和特殊符号表情。 #1067

Open
yeliheng opened this issue Jul 3, 2024 · 2 comments

Comments

@yeliheng
Copy link

yeliheng commented Jul 3, 2024

Description

IK分词器会自动过滤Emoji和特殊符号表情,我希望所有emoji也能够被正常分词,请问应该如何解决这个问题?

Steps to reproduce

image
image

Expected behavior

所有Emoji表情都被过滤了。

Environment

  • Versions: Elasticsearch 8.11.3(Docker)
@kin122
Copy link

kin122 commented Jul 29, 2024

emoji表情包最好还是单独用icu分词器去处理吧,ik并不支持

@yangzhongke
Copy link
Contributor

新PR已经解决这个问题,请更新
#1071
请验证后close这个issue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants