All-in-One安装时候etcd的2379一直被拒绝

xudongyang1809 · 2019年12月16日

[root@ks-allinone scripts]# systemctl status etcd
● etcd.service - Etcd Server
Loaded: loaded (/usr/lib/systemd/system/etcd.service; disabled; vendor preset: disabled)
Active: active (running) since Mon 2019-12-16 14:11:11 CST; 53s ago
Main PID: 8947 (etcd)
Memory: 17.4M
CGroup: /system.slice/etcd.service
└─8947 /usr/bin/etcd –name=default –data-dir=/var/lib/etcd/default.etcd –listen-client-urls=http://localhost:2379

Dec 16 14:11:11 ks-allinone etcd[8947]: raft.node: 8e9e05c52164694d elected leader 8e9e05c52164694d at term 2
Dec 16 14:11:11 ks-allinone etcd[8947]: setting up the initial cluster version to 3.3
Dec 16 14:11:11 ks-allinone etcd[8947]: set the initial cluster version to 3.3
Dec 16 14:11:11 ks-allinone etcd[8947]: enabled capabilities for version 3.3
Dec 16 14:11:11 ks-allinone etcd[8947]: published {Name:default ClientURLs:[http://localhost:2379]} to cluster cdf818194e3a8c32
Dec 16 14:11:11 ks-allinone etcd[8947]: ready to serve client requests
Dec 16 14:11:11 ks-allinone systemd[1]: Started Etcd Server.
Dec 16 14:11:11 ks-allinone etcd[8947]: serving insecure client requests on 127.0.0.1:2379, this is strongly discouraged!
Dec 16 14:11:12 ks-allinone bash[8947]: proto: no coders for int
Dec 16 14:11:12 ks-allinone bash[8947]: proto: no encoder for ValueSize int [GetProperties]
./uninstall卸载了kubesphere，重新安装了etcd，启动是成功的，之后./install，再次安装kubesphere2379端口还是过不去。

xudongyang1809 · 2019年12月16日

kubesphere安装前是这样
[root@ks-allinone _]# etcdctl cluster-health
cluster may be unhealthy: failed to list members
Error: client: etcd cluster is unavailable or misconfigured; error #0: malformed HTTP response “\x15\x03\x01\x00\x02\x02”
; error #1: dial tcp 127.0.0.1:4001: getsockopt: connection refused

error #0: malformed HTTP response “\x15\x03\x01\x00\x02\x02”

error #1: dial tcp 127.0.0.1:4001: getsockopt: connection refused

kubesphere安装中途
[root@ks-allinone _]# etcdctl cluster-health
cluster may be unhealthy: failed to list members
Error: client: etcd cluster is unavailable or misconfigured; error #0: dial tcp 127.0.0.1:4001: getsockopt: connection refused
; error #1: dial tcp 127.0.0.1:2379: getsockopt: connection refused

error #0: dial tcp 127.0.0.1:4001: getsockopt: connection refused
error #1: dial tcp 127.0.0.1:2379: getsockopt: connection refused
RUNNING HANDLER [etcd : wait for etcd up] *********************************************************************************************
Monday 16 December 2019 14:24:06 +0800 (0:00:02.726) 0:04:11.915 *******
FAILED - RETRYING: wait for etcd up (60 retries left).
…
FAILED - RETRYING: wait for etcd up (1 retries left).
fatal: [ks-allinone]: FAILED! => {
“attempts”: 60,
“changed”: false,
“content”: "",
“redirected”: false,
“status”: -1,
“url”: “https://172.18.248.238:2379/health”
}

MSG:

Status code was -1 and not [200]: Request failed: <urlopen error [Errno 111] Connection refused>

xudongyang1809 · 2019年12月16日

之前安装一直是在检查etcd健康时候报错，后来有一次这里过去了，restart etcd也过去了，最后reload etcd时候2379又被拒绝了

xudongyang1809 · 2019年12月16日

`TASK [etcd : Configure | Check if etcd cluster is healthy] ****************************************************************************
Monday 16 December 2019 14:43:42 +0800 (0:00:00.197) 0:04:11.017 *******
FAILED - RETRYING: Configure | Check if etcd cluster is healthy (4 retries left).
FAILED - RETRYING: Configure | Check if etcd cluster is healthy (3 retries left).
FAILED - RETRYING: Configure | Check if etcd cluster is healthy (2 retries left).
FAILED - RETRYING: Configure | Check if etcd cluster is healthy (1 retries left).
fatal: [ks-allinone -> ks-allinone]: FAILED! => {
“attempts”: 4,
“changed”: false,
“cmd”: “/usr/local/bin/etcdctl –no-sync –endpoints=https://172.18.248.238:2379 cluster-health | grep -q ‘cluster is healthy’”,
“delta”: “0:00:00.240749″,
“end”: “2019-12-16 14:43:57.047583″,
“rc”: 1,
“start”: “2019-12-16 14:43:56.806834”
}

STDERR:

Error: client: etcd cluster is unavailable or misconfigured; error #0: EOF

error #0: EOF

MSG:

non-zero return code`

[root@ks-allinone etcd]# etcdctl cluster-health member 8e9e05c52164694d is healthy: got healthy result from http://localhost:2379 cluster is healthy

Forest-L · 2019年12月16日

xudongyang1809
重新打开一个窗口，先卸载再安装下呗。

Forest-L · 2019年12月16日

Forest-L
重新打开的窗口确保防火墙已开闭，是用什么机器安装的呢？

xudongyang1809 · 2019年12月17日

Forest-L 卸载安装好几次了，现在的问题是，etcdctl cluster-health结果是“cluster is health”，但是平台安装验证etcd是否健康，报错说不健康，报错信息就是上面那些，只有EOF

Forest-L · 2019年12月17日

xudongyang1809
可以连下你的环境看下吗？

xudongyang1809 · 2019年12月17日

Forest-L

Forest-L 用teamview么？

Forest-L · 2019年12月17日

xudongyang1809
tv12或者向日葵

xudongyang1809 · 2019年12月17日

Forest-L 那我重新下一个teamview12，我的是13的

xudongyang1809 · 2019年12月17日

Forest-L
1 500 723 624
9794

xudongyang1809 · 2019年12月17日

Forest-L 大佬，上午那个image一直下载不下来，我从另一个机器导进来了，但是安装程序还是卡在那里不继续，需要重新安装么？
[root@ks-allinone logs]# docker images | grep cni
calico/cni v3.7.3 1a6ade52d471 6 months ago 135MB

TASK [download : download_container | Download image if required ( calico/cni:v3.7.3 )] ***********************************************
Tuesday 17 December 2019 11:46:10 +0800 (0:00:00.219) 0:22:46.205 ******
FAILED - RETRYING: download_container | Download image if required ( calico/cni:v3.7.3 ) (4 retries left).

Forest-L · 2019年12月17日

xudongyang1809
可以重新安装

xudongyang1809 · 2019年12月17日

Forest-L 上午辛苦大佬了，重新安装后，还是显示2379被拒绝，可能是这个机器太乱了，我已经让公司服务器人重新做一个机器了，等新机器做好，我再试试，这个平台的单机模式对机器有什么特别要求么，比如单独挂载大小什么的，目前是8和，32G内存，80G磁盘

Forest-L · 2019年12月17日

xudongyang1809
磁盘可以稍微弄大点，直接放在根目录即可。
上午看的是机器的磁盘满了，导致etcd安装不成功。

xudongyang1809 · 2019年12月17日

Forest-L [root@ks-allinone ~]# df -hT Filesystem Type Size Used Avail Use% Mounted on /dev/mapper/centos-root ext4 127G 5.4G 116G 5% / devtmpfs devtmpfs 16G 0 16G 0% /dev tmpfs tmpfs 16G 264K 16G 1% /dev/shm tmpfs tmpfs 16G 8.9M 16G 1% /run tmpfs tmpfs 16G 0 16G 0% /sys/fs/cgroup /dev/sda1 ext4 190M 147M 30M 84% /boot /dev/mapper/centos-home ext4 5.6G 24M 5.3G 1% /home tmpfs tmpfs 3.2G 12K 3.2G 1% /run/user/0

我磁盘配好了，根目录被我扩大了，现在在安装，还是docker的资源卡在那，不过我看磁盘空间在半小时里加了2M，可能下载的很慢，谢谢大佬指导

freemankevin · 2021年5月11日

Forest-L 做了这步操作，然后我的etcd 怎么也起不来了怎么办？

systemctl stop etcd.service && systemctl disable etcd.service && rm /var/lib/etcd -rf

freemankevin · 2021年5月11日

Forest-L 报销信息：

......##### 省略
[master3] Downloading image: harbor.dockerregistry.local/calico/node:v3.15.1
[master1] Downloading image: harbor.dockerregistry.local/calico/node:v3.15.1
[master3] Downloading image: harbor.dockerregistry.local/calico/pod2daemon-flexvol:v3.15.1
[master2] Downloading image: harbor.dockerregistry.local/calico/pod2daemon-flexvol:v3.15.1
[master1] Downloading image: harbor.dockerregistry.local/calico/pod2daemon-flexvol:v3.15.1
INFO[22:33:43 CST] Generating etcd certs
INFO[22:33:49 CST] Synchronizing etcd certs
INFO[22:33:49 CST] Creating etcd service
[master1 10.3.1.16] MSG:
Created symlink from /etc/systemd/system/multi-user.target.wants/etcd.service to /etc/systemd/system/etcd.service.
[master2 10.3.1.17] MSG:
Created symlink from /etc/systemd/system/multi-user.target.wants/etcd.service to /etc/systemd/system/etcd.service.
[master3 10.3.1.18] MSG:
Created symlink from /etc/systemd/system/multi-user.target.wants/etcd.service to /etc/systemd/system/etcd.service.
INFO[22:33:54 CST] Starting etcd cluster
[master1 10.3.1.16] MSG:
Configuration file already exists
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start

warn · 2021年7月21日

all-in-one单机台机器启动
`[centos8] Downloading image: calico/pod2daemon-flexvol:v3.16.3
INFO[15:08:55 HKT] Generating etcd certs
INFO[15:08:57 HKT] Synchronizing etcd certs
INFO[15:08:57 HKT] Creating etcd service
[centos8 192.168.31.79] MSG:
etcd already exists
INFO[15:09:19 HKT] Starting etcd cluster
[centos8 192.168.31.79] MSG:
Configuration file already exists
[centos8 192.168.31.79] MSG:
v3.4.13
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
Waiting for etcd to start
WARN[15:10:57 HKT] Task failed …
WARN[15:10:57 HKT] error: Failed to start etcd cluster: Failed to exec command: sudo -E /bin/sh -c “export ETCDCTL_API=2;export ETCDCTL_CERT_FILE=‘/etc/ssl/etcd/ssl/admin-centos8.pem’;export ETCDCTL_KEY_FILE=‘/etc/ssl/etcd/ssl/admin-centos8-key.pem’;export ETCDCTL_CA_FILE=‘/etc/ssl/etcd/ssl/ca.pem’;/usr/local/bin/etcdctl –endpoints=https://192.168.31.79:2379 cluster-health | grep -q ‘cluster is healthy’”
Error: client: etcd cluster is unavailable or misconfigured; error #0: dial tcp 192.168.31.79:2379: connect: connection refused

error #0: dial tcp 192.168.31.79:2379: connect: connection refused: Process exited with status 1
Error: Failed to start etcd cluster: Failed to start etcd cluster: Failed to exec command: sudo -E /bin/sh -c “export ETCDCTL_API=2;export ETCDCTL_CERT_FILE=‘/etc/ssl/etcd/ssl/admin-centos8.pem’;export ETCDCTL_KEY_FILE=‘/etc/ssl/etcd/ssl/admin-centos8-key.pem’;export ETCDCTL_CA_FILE=‘/etc/ssl/etcd/ssl/ca.pem’;/usr/local/bin/etcdctl –endpoints=https://192.168.31.79:2379 cluster-health | grep -q ‘cluster is healthy’”
Error: client: etcd cluster is unavailable or misconfigured; error #0: dial tcp 192.168.31.79:2379: connect: connection refused

error #0: dial tcp 192.168.31.79:2379: connect: connection refused: Process exited with status 1
Usage:
kk create cluster [flags]

Flags:
–download-cmd string The user defined command to download the necessary binary files. The first param ‘%s’ is output path, the second param ‘%s’, is the URL (default “curl -L -o %s %s”)
-f, –filename string Path to a configuration file
-h, –help help for cluster
–skip-pull-images Skip pre pull images
–with-kubernetes string Specify a supported version of kubernetes (default “v1.19.8”)
–with-kubesphere Deploy a specific version of kubesphere (default v3.1.0)
–with-local-storage Deploy a local PV provisioner
-y, –yes Skip pre-check of the installation

Global Flags:
–debug Print detailed information (default true)
–in-cluster Running inside the cluster

Failed to start etcd cluster: Failed to start etcd cluster: Failed to exec command: sudo -E /bin/sh -c “export ETCDCTL_API=2;export ETCDCTL_CERT_FILE=‘/etc/ssl/etcd/ssl/admin-centos8.pem’;export ETCDCTL_KEY_FILE=‘/etc/ssl/etcd/ssl/admin-centos8-key.pem’;export ETCDCTL_CA_FILE=‘/etc/ssl/etcd/ssl/ca.pem’;/usr/local/bin/etcdctl –endpoints=https://192.168.31.79:2379 cluster-health | grep -q ‘cluster is healthy’”
Error: client: etcd cluster is unavailable or misconfigured; error #0: dial tcp 192.168.31.79:2379: connect: connection refused

error #0: dial tcp 192.168.31.79:2379: connect: connection refused: Process exited with status 1`

All-in-One安装时候etcd的2379一直被拒绝

xudongyang1809K零S

xudongyang1809K零S

error #1: dial tcp 127.0.0.1:4001: getsockopt: connection refused

xudongyang1809K零S

xudongyang1809K零S

Forest-LK零S

Forest-LK零S

xudongyang1809K零S

Forest-LK零S

xudongyang1809K零S

Forest-LK零S

xudongyang1809K零S

xudongyang1809K零S

xudongyang1809K零S

Forest-LK零S

xudongyang1809K零S

Forest-LK零S

xudongyang1809K零S

freemankevinK零S

freemankevinK零S

warn