Elasticsearch学习笔记(二) 重要的Query DSL

博客围绕数据查询展开,介绍了准备数据后,多种查询方式,如查询部分字段、match、match_phrase等,还阐述了must、should、must_not等条件的使用,以及组合多查询的SQL表示,最后提到不影响评分的filter的用法。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

 

0. 准备好数据

"hits": [
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "1",
        "_score": null,
        "_source": {
          "name": "盐城冬冬",
          "age": 30,
          "hometown": "盐城",
          "gender": "male",
          "interesting": "watching TV"
        },
        "sort": [
          "1"
        ]
      },
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "2",
        "_score": null,
        "_source": {
          "name": "珣爷",
          "age": 28,
          "hometown": "徐州",
          "gender": "female",
          "interesting": "watching movie"
        },
        "sort": [
          "2"
        ]
      },
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "3",
        "_score": null,
        "_source": {
          "name": "米可",
          "age": 1,
          "hometown": "苏州",
          "gender": "female"
        },
        "sort": [
          "3"
        ]
      }
    ]

1. 查询部分字段

GET /pigg/_search
{
  "_source": ["name", "age"]
}

2. match

#查询interesting匹配"watching TV"
GET /pigg/_search
{
  "query": {
    "match": {
      "interesting": "watching  TV"
    }
  }
}

返回如下:

"hits": [
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "1",
        "_score": 0.5753642,
        "_source": {
          "name": "盐城冬冬",
          "age": 30,
          "hometown": "盐城",
          "gender": "male",
          "interesting": "watching TV"
        }
      },
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "2",
        "_score": 0.2876821,
        "_source": {
          "name": "珣爷",
          "age": 28,
          "hometown": "徐州",
          "gender": "female",
          "interesting": "watching movie"
        }
      }
    ]

看到结果也返回了"interesting"= "watching movie"的数据, 其中id=1的_score要比id=2的要高,
这个说明是匹配的程度,id=1的要比id=2的更加匹配

#查询interesting匹配"TV"或者"moive"
GET /pigg/_search
{
  "query": {
    "match": {
      "interesting": "TV movie"
    }
  }
}

返回结构如下:

"hits": {
    "total": 2,
    "max_score": 0.2876821,
    "hits": [
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "2",
        "_score": 0.2876821,
        "_source": {
          "name": "珣爷",
          "age": 28,
          "hometown": "徐州",
          "gender": "femal",
          "interesting": "watching movie"
        }
      },
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "1",
        "_score": 0.2876821,
        "_source": {
          "name": "盐城冬冬",
          "age": 30,
          "hometown": "盐城",
          "gender": "male",
          "interesting": "watching TV"
        }
      }
    ]

上面结果命中了2个人, "_score"都是0.2876821,说明匹配度两者相同

#查询age=30的
GET /pigg/_search
{
  "query": {
    "match": {
      "age": 30
    }
  }
}

3. match_phrase

#短语查询,这个会将"watching TV"作为一个短语去进行匹配查询
GET /pigg/_search
{
  "query": {
    "match_phrase": {
      "interesting": "watching TV"
    }
  }
}

返回结果如下:

"hits": [
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "1",
        "_score": 0.5753642,
        "_source": {
          "name": "盐城冬冬",
          "age": 30,
          "hometown": "盐城",
          "gender": "male",
          "interesting": "watching TV"
        }
      }
    ]

4.must

查询interesting匹配"watching TV",并且gender匹配"female"

GET /pigg/_search
{
  "query": {
    "bool": {
      "must": [
        { "match": { "interesting": "watching TV" }},
        { "match": {"gender": "female" }}
      ]
    }
  }
}

返回结果如下:

    "hits": [
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "2",
        "_score": 0.5753642,
        "_source": {
          "name": "珣爷",
          "age": 28,
          "hometown": "徐州",
          "gender": "female",
          "interesting": "watching movie"
        }
      }
    ]

{ “match”: { “interesting”: “watching TV” }}这条件语句能返回id=1或2的数据
{ “match”: {“gender”: “female” }}这条件语句能返回id=2或3的数据
这两条语句是且的关系,所有最后返回id=2的数据

5. should

5.1查询interesting匹配"watching mobile",gender匹配"female"

#查询interesting匹配"watching mobile",或gender匹配"female"
GET /pigg/_search
{
  "query": {
    "bool": {
      "should": [
        { "match": { "interesting": "watching mobile" }},
        { "match": {"gender": "female" }}
      ]
    }
  }
}

返回结果如下:

"hits": [
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "2",
        "_score": 0.5753642,
        "_source": {
          "name": "珣爷",
          "age": 28,
          "hometown": "徐州",
          "gender": "female",
          "interesting": "watching movie"
        }
      },
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "1",
        "_score": 0.2876821,
        "_source": {
          "name": "盐城冬冬",
          "age": 30,
          "hometown": "盐城",
          "gender": "male",
          "interesting": "watching TV"
        }
      },
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "3",
        "_score": 0.2876821,
        "_source": {
          "name": "米可",
          "age": 1,
          "hometown": "苏州",
          "gender": "female"
        }
      }
    ]

从上面结果看,id=2的数据匹配得分最高,另外两个匹配度相同
注意这次查询的是"watching mobile",不是"watching TV"

5.2 minimum_should_match

这个是指或的条件,必须满足多少条,下面的minimum_should_match=2,所以一条都查不到

GET /pigg/_search
{
  "query": {
    "bool": {
      "should": [
        {
          "range": {
            "age": {
              "gte": 0,
              "lte": 3
            }
          }
        },
        {
         "match": { "hometown.keyword": "徐州" }
        }
      ],
      "minimum_should_match": 2
    }
  }
}

 

6. must_not

查询interesting不匹配 “watching TV”,
并且
gender不匹配 “female”

GET /pigg/_search
{
  "query": {
    "bool": {
      "must_not": [
        { "match": {"interesting": "watching movie"} },
        { "match": {"gender": "female"} }
      ]
    }
  }
}

查询结果是空的,没有匹配的数据

7. 组合多查询

用SQL表示如下
where gender != ‘male’ and ( (age >= 0 and age <= 3) or hometown = ‘徐州’ )

GET /pigg/_search
{
  "query": {
    "bool": {
      "must_not": [
        {
          "match": {
            "gender": "male"
          }
        }
      ],
      "should": [
        {
          "range": {
            "age": {
              "gte": 0,
              "lte": 3
            }
          }
        },
        {
         "match": {
           "hometown.keyword": "徐州"
         }
        }
      ]
    }
  }
}

返回结果如下:

"hits": [
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "3",
        "_score": 1,
        "_source": {
          "name": "米可",
          "age": 1,
          "hometown": "苏州",
          "gender": "female"
        }
      },
      {
        "_index": "pigg",
        "_type": "_doc",
        "_id": "2",
        "_score": 0.2876821,
        "_source": {
          "name": "珣爷",
          "age": 28,
          "hometown": "徐州",
          "gender": "female",
          "interesting": "watching movie"
        }
      }
    ]

8.不影响评分的filter

如果不希望age的比较影响评分,可以放到filter里

GET /pigg/_search
{
  "query": {
    "bool": {
      "must": [
        {
          "match": {"interesting": "watching TV"}
        }
      ], 
      "filter": {
        "range": {
          "age": {
            "gte": 10,
            "lte": 29
          }
        }
      }
    }
  }
}

--------------------- 
作者:盐城三爷 
来源:优快云 
原文:https://blog.youkuaiyun.com/winterking3/article/details/82896738 
版权声明:本文为博主原创文章,转载请附上博文链接!

电动汽车数据集:2025年3K+记录 真实电动汽车数据:特斯拉、宝马、日产车型,含2025年电池规格和销售数据 关于数据集 电动汽车数据集 这个合成数据集包含许多品牌和年份的电动汽车和插电式车型的记录,捕捉技术规格、性能、定价、制造来源、销售和安全相关属性。每一行代表由vehicle_ID标识的唯一车辆列表。 关键特性 覆盖范围:全球制造商和车型组合,包括纯电动汽车和插电式混合动力汽车。 范围:电池化学成分、容量、续航里程、充电标准和速度、价格、产地、自主水平、排放、安全等级、销售和保修。 时间跨度:模型跨度多年(包括传统和即将推出的)。 数据质量说明: 某些行可能缺少某些字段(空白)。 几个分类字段包含不同的、特定于供应商的值(例如,Charging_Type、Battery_Type)。 各列中的单位混合在一起;注意kWh、km、hr、USD、g/km和额定值。 列 列类型描述示例 Vehicle_ID整数每个车辆记录的唯一标识符。1 制造商分类汽车品牌或OEM。特斯拉 型号类别特定型号名称/变体。型号Y 与记录关联的年份整数模型。2024 电池_类型分类使用的电池化学/技术。磷酸铁锂 Battery_Capacity_kWh浮充电池标称容量,单位为千瓦时。75.0 Range_km整数表示充满电后的行驶里程(公里)。505 充电类型主要充电接口或功能。CCS、NACS、CHAdeMO、DCFC、V2G、V2H、V2L Charge_Time_hr浮动充电的大致时间(小时),上下文因充电方法而异。7.5 价格_USD浮动参考车辆价格(美元).85000.00 颜色类别主要外观颜色或饰面。午夜黑 制造国_制造类别车辆制造/组装的国家。美国 Autonomous_Level浮点自动化能力级别(例如0-5),可能包括子级别的小
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值