故障描述
下午家里停电,恢复供电之后发现K8S集群不能自动启动了,检查发现3个ETCD节点服务不能启动,报错日志如下:
Jun 24 19:45:05 etcd1 systemd[1]: Starting Etcd Server...
-- Subject: Unit etcd.service has begun start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
--
-- Unit etcd.service has begun starting up.
Jun 24 19:45:05 etcd1 etcd[1453]: recognized and used environment variable ETCD_ADVERTISE_CLIENT_URLS=https://192.168.50.60:2379
Jun 24 19:45:05 etcd1 etcd[1453]: recognized and used environment variable ETCD_CERT_FILE=/etc/etcd/ssl/etcd.pem
Jun 24 19:45:05 etcd1 etcd[1453]: recognized and used environment variable ETCD_INITIAL_ADVERTISE_PEER_URLS=https://192.168.50.60:2380
Jun 24 19:45:05 etcd1 etcd[1453]: recognized and used environment variable ETCD_INITIAL_CLUSTER=etcd1=https://192.168.50.60:2380,etcd2=https://192.168.50.61:2380,etcd3=https://192.168.50.62:2380
Jun 24 19:45:05 etcd1 etcd[1453]: recognized and used environment variable

在一次停电事故后,K8S集群中的ETCD服务无法启动,显示数据库文件损坏。通过对错误日志的分析,发现第2节点的数据库文件是最新的且无错误。通过将2节点的文件复制到其他节点,成功恢复ETCD集群正常运行。该事件强调了在处理问题时谨慎操作以及数据库备份的重要性。
最低0.47元/天 解锁文章
7299

被折叠的 条评论
为什么被折叠?



