Solo.io Gloo 请求重试机制深度解析-优快云博客

Solo.io Gloo 请求重试机制深度解析

gloo The Feature-rich, Kubernetes-native, Next-Generation API Gateway Built on Envoy 项目地址: https://gitcode.com/gh_mirrors/glo/gloo

什么是请求重试机制

在分布式系统中，网络请求失败是常见现象，特别是临时性网络错误（transient errors）。Solo.io Gloo 提供了强大的请求重试机制，允许开发者为特定路由配置重试策略，从而提高系统的健壮性和可靠性。

重试策略核心参数

Gloo 的重试机制主要通过三个核心参数进行配置：

retryOn：定义触发重试的条件，支持多种错误类型组合
numRetries：指定最大重试次数（默认1次）
perTryTimeout：设置每次重试的超时时间

retryOn 详解

retryOn 参数基于 Envoy 的重试机制，支持以下常见错误类型：

connect-failure：连接失败
refused-stream：上游拒绝流
unavailable：服务不可用
5xx：服务器返回5xx错误码
gateway-error：网关错误（502/503/504）
reset：连接被重置

多个条件可以用逗号分隔组合使用，例如："connect-failure,5xx"

配置示例解析

下面是一个完整的 VirtualService 配置示例，展示了如何为特定路由设置重试策略：

apiVersion: gateway.solo.io/v1
kind: VirtualService
metadata:
  name: 'default'
  namespace: 'gloo-system'
spec:
  virtualHost:
    domains:
    - '*'
    routes:
    - matchers:
       - prefix: '/petstore'
      routeAction:
        single:
          upstream:
            name: 'default-petstore-8080'
            namespace: 'gloo-system'
      options:
        retries:
          retryOn: 'connect-failure,5xx'
          numRetries: 3
          perTryTimeout: '5s'

在这个配置中：