Elasticsearch权威指南：跨字段实体搜索技术解析-优快云博客

本文链接：https://blog.youkuaiyun.com/gitblog_01111/article/details/148576086

Elasticsearch权威指南：跨字段实体搜索技术解析

elasticsearch-definitive-guide The Definitive Guide to Elasticsearch 项目地址: https://gitcode.com/gh_mirrors/el/elasticsearch-definitive-guide

什么是跨字段实体搜索

跨字段实体搜索是Elasticsearch中处理分散在多个字段中的实体信息的一种常见搜索模式。这种模式特别适用于像人员、产品或地址这类实体数据，因为这些实体的识别信息通常分布在不同的字段中。

典型应用场景

假设我们有以下两种数据结构：

人员信息：

{
    "firstname": "Peter",
    "lastname": "Smith"
}

地址信息：

{
    "street": "5 Poland Street",
    "city": "London",
    "country": "United Kingdom",
    "postcode": "W1V 3DG"
}

当用户搜索"Peter Smith"或"Poland Street W1V"时，我们需要同时搜索多个字段才能找到匹配结果。

基础实现方案

布尔查询方案

最直观的解决方案是使用布尔查询的should子句：

{
  "query": {
    "bool": {
      "should": [
        { "match": { "street": "Poland Street W1V" }},
        { "match": { "city": "Poland Street W1V" }},
        { "match": { "country": "Poland Street W1V" }},
        { "match": { "postcode": "Poland Street W1V" }}
      ]
    }
  }
}

多匹配查询简化

为了避免重复查询字符串，可以使用multi_match查询并指定most_fields类型：

{
  "query": {
    "multi_match": {
      "query": "Poland Street W1V",
      "type": "most_fields",
      "fields": ["street", "city", "country", "postcode"]
    }
  }
}