hchen-gogogo-优快云博客

原创 Resource Limits、QOS、Priority 以及Evction整理

Kubernetes 通过资源请求（Requests）和限制（Limits）来管理 Pod 的资源使用；通过 QoS 类别来管理 Pod 的资源分配和驱逐策略；通过 PriorityClass，可以为 Pod 设置不同的优先级，从而在资源竞争时决定哪些 Pod 应该被优先调度或保留。

2025-02-19 13:32:41 674

原创 apt --fix-broken install错误处理

【代码】apt --fix-broken install错误处理。

2024-12-09 11:24:03 579

原创 Kubernetes 污点和容忍

kubernetes Taint 和 Toleration 注意点

2024-12-07 16:15:19 703

原创通过 kubernetes 的 pod 提供 samba 服务

通过 kubernetes 的 pod 提供 samba 服务

2024-12-05 15:57:27 225

在Kubernetes中，Horizontal Pod Autoscaler (HPA) 根据Pod中所有容器的总资源使用情况来自动扩展Pod的数量。如果打算基于CPU使用率设置HPA，需要为每个容器指定CPU的请求（requests）和限制（limits）。资源请求（requests）：这告诉Kubernetes调度器在进行Pod调度时，每个容器至少需要保证的CPU资源。这是HPA计算资源使用率和决定是否扩展或缩减的基础。资源限制（limits）：这定义了容器可以使用的最大CPU资源。

2024-11-21 14:59:47 778 1

原创白话Linux Namespace

Linux Container namespace docker kubernetes

2024-08-14 11:22:13 967

原创扩容 LVM

没有基础“姿势”，直接快进到实操。

2024-07-11 10:45:39 301 1

原创 pod 禁止执行的 dmesg

在 Kubernetes 环境中，尽管 Pod 提供了与主机系统隔离的运行环境，但它们仍然与宿主机共享某些底层资源和环境，如内核。这种设计使得在任何 Pod 内部执行的 dmesg 命令显示的是宿主机的内核日志，而不是特定于 Pod 的日志。内核共享：所有在同一宿主机上的容器（包括 Pod 内的容器）都运行在同一个操作系统内核之上。dmesg 命令显示的是内核的环环缓冲区消息，这些消息是系统级别的，不是隔离在单独的容器或 Pod 中的。

2024-04-27 16:29:57 601

原创 node-driver-registrar 与 csi-plugin 的关联

node driver registrar 和 csi plugin 的关联，明白 kubelet 调用 csi 的大体流程。

2024-04-02 17:51:31 523

原创 prometheus federate ingesting out-of-order samples error

prometheus federate msg="Error on ingesting out-of-order samples" num_dropped

2024-01-17 15:24:40 1073 1

原创多节点 docker 部署 elastic 集群

在多个节点中通过 docker 部署 elk 集群

2024-01-12 11:32:17 1104

原创申请免费的泛域名证书

申请 google 的泛域名证书,域名托管在 cloudflare, 自动更新证书。

2023-12-06 11:33:26 842

原创 OverlayFS

OverlayFS 的这种设计非常适合容器化的环境，它提供了一种轻量级、高效和灵活的方式来管理容器的文件系统。

2023-12-04 15:19:28 540

原创 grafana alert state error

grafana alert 状态为 error 的处理

2023-11-27 17:26:18 772

原创 k8s 利用 root-ceph 使用外部 ceph 集群

kubernetes 使用外部 ceph cluster 作为 volume.

2023-11-14 16:34:55 383

原创 cephadm 创建 ceph cluster

默认情况下， ceph.conf 文件和 client.admin 密钥环的副本保留在具有 _admin 标签的所有主机上的 /etc/ceph 中。我们通常建议为一台或多台其他主机指定 _admin 标签，以便可以在多台主机上轻松访问 Ceph CLI（例如，通过 cephadm shell ）。fsid 可以通过 cephadm ls, docker ps, cat /etc/ceph/ceph.conf 等来查找。3 个 monitor， 3 个 osd，3 个 mds， 3 个 rgw。

2023-11-09 18:03:53 266

原创 grafana ui 自动添加特殊字符

通过 grafana(当前使用的古董版本: v6.5.0) 编辑现有的查询语句，光标自动移动到最后了，直接复制编辑完后的语句到 prometheus 执行报错如下如果不看 markdown 原文，完全看不出差别。

2023-10-30 11:31:17 558 1

原创 Containerd container_fs 缺失

containerd 缺乏 container_fs 指标

2023-10-25 17:56:05 501 3

原创 ceph 创建 static pvc

静态 static pvc

2023-10-24 18:36:16 195

原创 HEALTH_ERR 1 filesystem is degraded, 1 filesystem is offline , 1 mds daemon damaged - Monitors have

HEALTH_ERR 1 filesystem is degraded, 1 filesystem is offline , 1 mds daemon damaged - Monitors have assigned me to become a standbyceph health detailceph mds statceph fs dumpceph fs statusceph fs ls修复不知其然，不知其所以然。。。

2023-10-23 15:50:21 1141

原创 rook-ceph rbd image is still being used

Warning FailedMount 5m6s (x208 over 9h) kubelet MountVolume.MountDevice failed for volume "pvc-xxxxxxx" : rpc error: code = Internal desc = rbd image replicapool/csi-vol-xxxxxxxx is still being used

2023-09-04 15:28:30 412

原创 elastic 删除 index 错误action [indices:admin/delete] is unauthorized for user [admin] with roles

security_exception: action [indices:admin/delete] is unauthorized for user [admin] with roles [superuser] on indices [.kibana_task_manager_8.2.3_001], this action is granted by the index privileges [delete_index,manage,all]

2023-09-02 11:12:05 1734 1

原创 k8s 从 1.19.10 升级至 1.25.12

跨度很大的成功升级 k8s(从v1.19.10 至 v1.25.12)

2023-07-26 18:52:56 481

原创 Grafana - Alert 时间/状态

在 Grafana Alert 中，Evaluate every 和 Evaluate for 是两个重要的配置选项，用于定义告警的评估周期和持续时间。

2023-03-20 11:35:53 4633

原创 LXCFS 测试遇到的一些问题汇总

lxcfs kubernetes docker container volumes

2022-11-29 17:07:56 786

原创使用 Postgres 作为 Grafana 后端，并迁移数据

单纯的使用 PG 用来当 Grafana 的后端存储，这个倒是简单，难的是怎么样把现有数据迁移过去。网络上的方案千千万，N 种方式失败之后终寻得成功。

2022-11-14 15:35:05 973

原创 Node Problem Detector

NPD 是一个可监控节点的健康状况并检测常见节点问题(例如硬件、内核或容器运行时问题等等)的开源项目

2022-10-21 14:35:53 1011

原创 Containerd 安装及基础使用

containerd docker kubernetes

2022-10-10 09:42:04 1214

原创 Docker, Kubernetes, CRI, OCI, Containerd, Runc 之间的渊源

Docker, Kubernetes, CRI, OCI, Containerd, Runc 之间的渊源

2022-10-09 10:48:28 627

原创 ETCD替换成员

ETCD 替换节点，需要基础概念还是需要先看看官文。很多的操作官文写的很清楚。只是根据实际情况需要变通一下。

2022-07-02 14:06:46 943

原创 Grafana导入 json 文件的 dashboard 错误 Templating Failed to upgrade legacy queries Datasource xxx not found

Templating Failed to upgrade legacy queries Datasource xxx not found

2022-06-14 19:39:57 9277

原创 ceph osd 占用太多的内存

OSD and MON memory consumptionCeph OSD Pod memory consumption very highCeph Cluster CRDCeph HardWare Recommendations

2022-06-09 13:39:18 2223

原创 rook-ceph 迁移 mon 到其他 node

在 kubernetes 环境种优化的、无损的迁移 ceph monitor 到其他节点

2022-05-30 17:59:25 660

原创 Grafana 迁移配置到新的环境

Grafana 默认数据库 sqlite3 迁移到新的环境

2022-05-27 16:25:00 1912

原创 Kubernetes CSI

kubernetes csi volume storage linux

2022-05-09 17:30:17 891

原创 zalando postgres-operator backup/clone

这家伙忒懒了，撒都没有留下。

2022-03-02 18:13:51 587

原创 prometheus operator servicemonitor 添加 label

serviceMonitorSepc.[]endpoint.relabelconfighttps://github.com/prometheus-operator/prometheus-operator/blob/main/Documentation/api.md#servicemonitorspechttps://github.com/prometheus-operator/prometheus-operator/blob/main/Documentation/api.md#servicemonito

2022-02-17 16:27:03 2899

原创 prometheus operator federation

prometheus 通过 prometheus-operator 配置的，怎么样配置联邦呢？先来个官网看看什么是联邦：Federation | Prometheushttps://prometheus.io/docs/prometheus/latest/federation/operator 添加 prometheus 的配置prometheus-operator/additional-scrape-config.md at main · prometheus-operator/prome

2022-02-17 16:11:45 822

原创 POD QoS

Configure Quality of Service for Pods | KubernetesThis page shows how to configure Pods so that they will be assigned particular Quality of Service (QoS) classes. Kubernetes uses QoS classes to make decisions about scheduling and evicting Pods.Before you b

2022-02-16 18:35:00 441

原创 Kubelet: Pod Lifecycle Event Generator (PLEG)

design-proposals-archive/pod-lifecycle-event-generator.md at main · kubernetes/design-proposals-archive · GitHubPod Lifecycle Event Generator: Understanding the "PLEG is not healthy" issue in Kubernetes | Red Hat Developerhttps://developers.redhat.com/

2022-02-16 17:08:09 509

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

sqlite 中文手册

程序员面试宝典（好东西）

c/c++程序员面试宝典

空空如也