• 监控日志
  • dashbord面板node-exporter显示未发现监控数据

操作系统信息
CentOS Linux release 7.9.2009 (Core) 物理机

Kubernetes版本信息
v1.22.10

容器运行时
docker

KubeSphere版本信息
3.3.0

问题是新加入的几个work节点,其中有两台,未发现监控数据,其他正常。

kubectl get pods -nkubesphere-monitoring-system pod运行正常

NAME READY STATUS RESTARTS AGE

alertmanager-main-0 2/2 Running 0 19m

alertmanager-main-1 2/2 Running 0 19m

alertmanager-main-2 2/2 Running 0 19m

kube-state-metrics-7bdc7484cf-7wmjj 3/3 Running 0 19m

node-exporter-4m9lq 2/2 Running 0 19m

node-exporter-6b7cr 2/2 Running 0 19m

node-exporter-6gqlf 2/2 Running 0 19m

node-exporter-859m8 2/2 Running 0 19m

node-exporter-b6n25 2/2 Running 0 19m

node-exporter-dwqs7 2/2 Running 0 19m

node-exporter-jd9×8 2/2 Running 0 19m

node-exporter-jftz5 2/2 Running 0 19m

node-exporter-jskqj 2/2 Running 0 19m

node-exporter-kfjlh 2/2 Running 0 19m

node-exporter-pqcg8 2/2 Running 0 19m

node-exporter-sqb9b 2/2 Running 0 19m

node-exporter-vcbmh 2/2 Running 0 19m

node-exporter-vh2d8 2/2 Running 0 19m

node-exporter-wvxjg 2/2 Running 0 19m

node-exporter-zrp5m 2/2 Running 0 19m

notification-manager-deployment-78664576cb-sl2fw 2/2 Running 0 19m

notification-manager-deployment-78664576cb-w2j26 2/2 Running 0 19m

notification-manager-operator-7d44854f54-9shvz 2/2 Running 0 19m

prometheus-k8s-0 2/2 Running 0 19m

prometheus-k8s-1 2/2 Running 0 19m

prometheus-operator-8955bbd98-j4nbw 2/2 Running 0 19m

thanos-ruler-kubesphere-0 2/2 Running 0 19m

thanos-ruler-kubesphere-1 2/2 Running 0 19m

    shouziws wangyu1717
    这个页面的CPU、内存的监控是对当前这个Pod 的监控数据,而不是node-exporter 没有采集到监控数据。可以先查看Prometheus targets 页面,看看是否有不健康的采集项,针对进一步排查。

    wangyu1717

    可以继续排查确认下是某个node-exporter 的一个pod的容器监控缺失,还是某个节点的全部Pod 监控数据缺失,定位是某个Pod的特例或者是 采集节点Kubelet 的metrics 出现了问题。

      3 个月 后

      frezes

      大佬我的exporter总是异常down。重启就好了,不一会又down了,看日志有这个报错, 不知道怎么解决

        2 年 后