昨天在我们群主的帮助下,成功安装完成了All-in-one单节点,今天将单节点删除后,再安装Multi-node,还是报之前遇到过的两类问题:

  1. 节点找不到,错误信息如下:
    2020-05-15 18:59:33,204 p=23124 u=root | fatal: [k8sphere02]: FAILED! => {
    “changed”: true,
    “cmd”: “/usr/local/bin/kubectl annotate node k8sphere02 alpha.kubernetes.io/provided-node-ip=192.168.108.73″,
    “delta”: “0:00:00.301397”,
    “end”: “2020-05-15 18:59:33.176589”,
    “failed_when_result”: true,
    “rc”: 1,
    “start”: “2020-05-15 18:59:32.875192″
    }
    STDERR:
    Error from server (NotFound): nodes “k8sphere02″ not found
    MSG:
    non-zero return code

2020-05-15 18:59:33,349 p=23124 u=root | fatal: [k8sphere03]: FAILED! => {
“changed”: true,
“cmd”: “/usr/local/bin/kubectl annotate node k8sphere03 alpha.kubernetes.io/provided-node-ip=192.168.108.45”,
“delta”: “0:00:00.377296”,
“end”: “2020-05-15 18:59:33.316986”,
“failed_when_result”: true,
“rc”: 1,
“start”: “2020-05-15 18:59:32.939690”
}
STDERR:
Error from server (NotFound): nodes “k8sphere03” not found
MSG:
non-zero return code

  1. ks-installer安装报错
    fatal: [k8sphere01]: FAILED! => {
    “attempts”: 30,
    “changed”: true,
    “cmd”: “/usr/local/bin/kubectl get pod -n kubesphere-system | grep Running | grep ks-installer”,
    “delta”: “0:00:00.072270”,
    “end”: “2020-05-15 19:15:20.891646″,
    “rc”: 1,
    “start”: “2020-05-15 19:15:20.819376”
    }
    MSG:
    non-zero return code

执行以下命令,结果如下:
[root@k8sphere01 ]# kubectl describe pod ks-installer-7d9fb945c7-b5bkp -n kubesphere-system
Name: ks-installer-7d9fb945c7-b5bkp
Namespace: kubesphere-system
Priority: 0
Node: <none>
Labels: app=ks-install
pod-template-hash=7d9fb945c7
Annotations: <none>
Status: Pending
IP:

IPs: <none>
Controlled By: ReplicaSet/ks-installer-7d9fb945c7
Containers:
installer:
Image: kubesphere/ks-installer:v2.1.1
Port: <none>
Host Port: <none>
Environment: <none>
Mounts:
/var/run/secrets/kubernetes.io/serviceaccount from ks-installer-token-bvkvn (ro)
Conditions:
Type Status
PodScheduled False
Volumes:
ks-installer-token-bvkvn:
Type: Secret (a volume populated by a Secret)
SecretName: ks-installer-token-bvkvn
Optional: false
QoS Class: BestEffort
Node-Selectors: <none>
Tolerations: node.kubernetes.io/not-ready:NoExecute for 300s
node.kubernetes.io/unreachable:NoExecute for 300s
Events:
Type Reason Age From Message


Warning FailedScheduling 30s (x21 over 28m) default-scheduler 0/1 nodes are available: 1 node(s) had taints that the pod didn’t tolerate.
[root@k8sphere01 ]#

这不是说明原因了吗, 0/1 nodes are available: 1 node(s) had taints that the pod didn’t tolerate.

还有这个 Error from server (NotFound): nodes “k8sphere02″ not found

    rayzhou2017 我也明白都是因为两个node找不到,可我检查了很多篇,也没有发现是啥原因导致其找不到啊!看看大家遇到这种问题,是怎么解决的?

    rayzhou2017 在我的部署中出现过好几次这种问题:

    1. 第一次是两个node(k8sphere02,k8sphere03),其中一个k8sphere02找不到;
    2. 第二次是我换一个新node(k8sphere03,glusterfs-client),能正常;
    3. 后来都卸载后,再次安装1master+2node(k8sphere02,k8sphere03),发现两个node都找不到了。

      Feynman master节点可以ssh其它节点, ssh的要求是每个节点需要相互ssh吗?