修改默认配置,搜索CJK,找到 如下去掉注释,改成如下:
# n-gram length to index, for CJK indexing
# only supports 0 and 1 for now, other lengths to be implemented
# optional, default is 0 (disable n-grams)
#
ngram_len = 1
# n-gram characters list, for CJK indexing
# optional, default is empty
#
ngram_chars = U+3000..U+2FA1F
# phrase boundary characters list
# optional, default is empty
#
phrase_boundary = ., ?, !, U+2026 # horizontal ellipsis
# phrase boundary word position increment
# optional, default is 0
#
# phrase_boundary_step = 100
本文介绍如何通过修改配置文件来优化CJK字符的索引功能,包括调整n-gram长度、设置n-gram字符范围及短语边界等参数。

6761

被折叠的 条评论
为什么被折叠?



