• KSV
  • 重新安装kubesphere集群报etcd健康检查失败

创建部署问题时,请参考下面模板,你提供的信息越多,越容易及时获得解答。如果未按模板创建问题,管理员有权关闭问题。
确保帖子格式清晰易读,用 markdown code block 语法格式化代码块。
你只花一分钟创建的问题,不能指望别人花上半个小时给你解答。

操作系统信息
例如:虚拟机,Centos7.9 8C/16G

Kubernetes版本信息
kubectl versionClient Version: version.Info{Major:“1”, Minor:“25”, GitVersion:“v1.25.4”, GitCommit:“872a965c6c6526caa949f0c6ac028ef7aff3fb78”, GitTreeState:“clean”, BuildDate:“2022-11-09T13:36:36Z”, GoVersion:“go1.19.3”, Compiler:“gc”, Platform:“linux/amd64”}

Kustomize Version: v4.5.7

Server Version: version.Info{Major:“1”, Minor:“22”, GitVersion:“v1.22.12”, GitCommit:“b058e1760c79f46a834ba59bd7a3486ecf28237d”, GitTreeState:“clean”, BuildDate:“2022-07-13T14:53:39Z”, GoVersion:“go1.16.15”, Compiler:“gc”, Platform:“linux/amd64”}

容器运行时
docker version / crictl version / nerdctl version 结果贴在下方

KubeSphere版本信息
例如:v3.3.0。离线安装。在已有K8s上安装还是使用kk安装。

问题是什么
1. 首次使用kubekey安装没出现问题,但是发现没有配置gluster存储,于是将环境推倒(kubesphere-master、kubesphere-node节点),保留了kubekey节点没有重装

  1. 当再次在kubekey节点执行安装(./kk create cluster -f config-sample.yaml)时,就会一直报etcd证书问题,再次把kubesphere-master、kubesphere-node节点重新安装系统,报错依然存在


    报错如下:

    etcd health check failed: Failed to exec command: sudo -E /bin/bash -c “export ETCDCTL_API=2;export ETCDCTL_CERT_FILE=‘/etc/ssl/etcd/ssl/admin-ks-master.pem’;export ETCDCTL_KEY_FILE=‘/etc/ssl/etcd/ssl/admin-ks-master-key.pem’;export ETCDCTL_CA_FILE=‘/etc/ssl/etcd/ssl/ca.pem’;/usr/local/bin/etcdctl –endpoints=https://172.26.160.18:2379 cluster-health | grep -q ‘cluster is healthy’”

    : Process exited with status 1

    17:23:54 CST retry: [ks-master]

    17:23:59 CST message: [ks-master]

    etcd health check failed: Failed to exec command: sudo -E /bin/bash -c “export ETCDCTL_API=2;export ETCDCTL_CERT_FILE=‘/etc/ssl/etcd/ssl/admin-ks-master.pem’;export ETCDCTL_KEY_FILE=‘/etc/ssl/etcd/ssl/admin-ks-master-key.pem’;export ETCDCTL_CA_FILE=‘/etc/ssl/etcd/ssl/ca.pem’;/usr/local/bin/etcdctl –endpoints=https://172.26.160.18:2379 cluster-health | grep -q ‘cluster is healthy’”

    : Process exited with status 1

    17:23:59 CST retry: [ks-master]

    17:24:04 CST message: [ks-master]

3 . 单独使用此命令在kubesphere-master节点执行,没有报错出现

麻烦各位老师给指点下。谢谢

将系统全部重新安装,再次执行,又报etcd无法启动

Job for etcd.service failed because the control process exited with error code. See “systemctl status etcd.service” and “journalctl -xe” for details.

22:02:00 CST message: [ks-master]

start etcd failed: Failed to exec command: sudo -E /bin/bash -c “systemctl daemon-reload && systemctl restart etcd && systemctl enable etcd”

Job for etcd.service failed because the control process exited with error code. See “systemctl status etcd.service” and “journalctl -xe” for details.: Process exited with status 1

22:02:00 CST retry: [ks-master]

1 年 后

大佬你解决了吗,我也遇到这个问题了

5 个月 后
1 个月 后

先检查下etcd 服务是否开启?
systemctl status etcd.service

检查防火墙是否有开放2379/2380端口

新装部署时。重点:/etc/etcd.env 看下etcd 的配置文件

initial-cluster-state:如果这是一个新的集群,确保设置为 new。如果是一个已有集群的成员,应该是 existing

--initial-cluster-state=new