目录
1. Metrics Server 与 Kubernetes 版本对照
1. Metrics Server 与 Kubernetes 版本对照
| Metrics Server | Metrics API group/version | Supported Kubernetes version |
|---|---|---|
| 0.8.x | metrics.k8s.io/v1beta1 | 1.31+ |
| 0.7.x | metrics.k8s.io/v1beta1 | 1.27+ |
| 0.6.x | metrics.k8s.io/v1beta1 | 1.25+ |
| 0.5.x | metrics.k8s.io/v1beta1 | *1.8+ |
| 0.4.x | metrics.k8s.io/v1beta1 | *1.8+ |
| 0.3.x | metrics.k8s.io/v1beta1 | 1.8-1.21 |
本文以社区版 v0.6.1 为例,说明安装流程和常见问题处理。
2. 下载部署清单
wget https://github.com/kubernetes-sigs/metrics-server/releases/download/v0.6.1/components.yaml
3. 修改镜像和配置(重要)
由于网络原因,直接拉取 k8s.gcr.io 的镜像可能失败,你需要替换部署文件中的镜像地址,并添加必要的启动参数。
3.1 替换镜像源
将原镜像 k8s.gcr.io/metrics-server/metrics-server:v0.6.1 替换为可访问的镜像源,例如阿里云镜像仓库。
# 修改 components.yaml 中的 image 字段
image: registry.aliyuncs.com/google_containers/metrics-server:v0.6.1
3.2 添加安全参数
在测试环境中,为避免 TLS 证书验证导致 metrics-server 无法启动,你需要在 Deployment 的 args 中添加以下参数。
- --kubelet-insecure-tls
kubectl edit deployment metrics-server -n kube-system
containers:
- args:
- --cert-dir=/tmp
- --secure-port=4443
# ... 可能还有其他参数 ...
- --kubelet-insecure-tls # 添加这一行
image: registry.k8s.io/metrics-server/metrics-server:vx.x.x # 您的镜像地址
name: metrics-server
不加参数- --kubelet-insecure-tls,会报错如下图所示:

4. 部署与验证
应用修改后的配置,并检查运行状态。
kubectl apply -f components.yaml
# 查看 metrics-server Pod 是否运行正常
kubectl get pods -n kube-system -l k8s-app=metrics-server
如果 Pod 状态为 Running,通常表示安装成功。你可以通过以下命令验证资源监控数据是否正常采集:
kubectl top nodes
kubectl top pods --all-namespaces
5. 完整的yaml文件
修改镜像和- --kubelet-insecure-tls参数后的yaml文件
apiVersion: v1
kind: ServiceAccount
metadata:
labels:
k8s-app: metrics-server
name: metrics-server
namespace: kube-system
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
labels:
k8s-app: metrics-server
rbac.authorization.k8s.io/aggregate-to-admin: "true"
rbac.authorization.k8s.io/aggregate-to-edit: "true"
rbac.authorization.k8s.io/aggregate-to-view: "true"
name: system:aggregated-metrics-reader
rules:
- apiGroups:
- metrics.k8s.io
resources:
- pods
- nodes
verbs:
- get
- list
- watch
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
labels:
k8s-app: metrics-server
name: system:metrics-server
rules:
- apiGroups:
- ""
resources:
- nodes/metrics
verbs:
- get
- apiGroups:
- ""
resources:
- pods
- nodes
verbs:
- get
- list
- watch
---
apiVersion: rbac.authorization.k8s.io/v1
kind: RoleBinding
metadata:
labels:
k8s-app: metrics-server
name: metrics-server-auth-reader
namespace: kube-system
roleRef:
apiGroup: rbac.authorization.k8s.io
kind: Role
name: extension-apiserver-authentication-reader
subjects:
- kind: ServiceAccount
name: metrics-server
namespace: kube-system
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
labels:
k8s-app: metrics-server
name: metrics-server:system:auth-delegator
roleRef:
apiGroup: rbac.authorization.k8s.io
kind: ClusterRole
name: system:auth-delegator
subjects:
- kind: ServiceAccount
name: metrics-server
namespace: kube-system
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
labels:
k8s-app: metrics-server
name: system:metrics-server
roleRef:
apiGroup: rbac.authorization.k8s.io
kind: ClusterRole
name: system:metrics-server
subjects:
- kind: ServiceAccount
name: metrics-server
namespace: kube-system
---
apiVersion: v1
kind: Service
metadata:
labels:
k8s-app: metrics-server
name: metrics-server
namespace: kube-system
spec:
ports:
- name: https
port: 443
protocol: TCP
targetPort: https
selector:
k8s-app: metrics-server
---
apiVersion: apps/v1
kind: Deployment
metadata:
labels:
k8s-app: metrics-server
name: metrics-server
namespace: kube-system
spec:
selector:
matchLabels:
k8s-app: metrics-server
strategy:
rollingUpdate:
maxUnavailable: 0
template:
metadata:
labels:
k8s-app: metrics-server
spec:
containers:
- args:
- --cert-dir=/tmp
- --secure-port=4443
- --kubelet-preferred-address-types=InternalIP,ExternalIP,Hostname
- --kubelet-use-node-status-port
- --metric-resolution=15s
- --kubelet-insecure-tls
image: registry.aliyuncs.com/google_containers/metrics-server:v0.6.1
imagePullPolicy: IfNotPresent
livenessProbe:
failureThreshold: 3
httpGet:
path: /livez
port: https
scheme: HTTPS
periodSeconds: 10
name: metrics-server
ports:
- containerPort: 4443
name: https
protocol: TCP
readinessProbe:
failureThreshold: 3
httpGet:
path: /readyz
port: https
scheme: HTTPS
initialDelaySeconds: 20
periodSeconds: 10
resources:
requests:
cpu: 100m
memory: 200Mi
securityContext:
allowPrivilegeEscalation: false
readOnlyRootFilesystem: true
runAsNonRoot: true
runAsUser: 1000
volumeMounts:
- mountPath: /tmp
name: tmp-dir
nodeSelector:
kubernetes.io/os: linux
priorityClassName: system-cluster-critical
serviceAccountName: metrics-server
volumes:
- emptyDir: {}
name: tmp-dir
---
apiVersion: apiregistration.k8s.io/v1
kind: APIService
metadata:
labels:
k8s-app: metrics-server
name: v1beta1.metrics.k8s.io
spec:
group: metrics.k8s.io
groupPriorityMinimum: 100
insecureSkipTLSVerify: true
service:
name: metrics-server
namespace: kube-system
version: v1beta1
versionPriority: 100
6. 注意事项
6.1 版本兼容性
不同版本的 metrics-server 对 Kubernetes 集群有兼容性要求。选择版本时,请务必参考官方或云服务商提供的兼容性矩阵。
6.2 生产环境安全
在上述示例中,我们使用了 --kubelet-insecure-tls 参数来跳过证书验证,这仅在测试环境中推荐使用。在生产环境中,为了安全起见,你应该配置和使用有效的 TLS 证书。
6.3 问题排查
如果安装后 metrics-server 无法正常运行,一个有效的排查方法是查看其 Pod 的日志
kubectl logs -n kube-system [metrics-server-pod-name]
1790

被折叠的 条评论
为什么被折叠?



