报错信息如下:
rpc error: code = Unavailable desc = all SubConns are in TransientFailure, latest connection error: connection error: desc = “transport: Error while dialing dial tcp 10.243.93.210:9104: connect: connection refused”




任务重新执行,也有报错信息



麻烦看下应该如何解决这个问题?

Feynman 我看了一下问题应该是在这个pod

pod启动日志

本应该监听的端口或需要启动的服务,没有启动成功

这个pod里面的服务启动失败,导致job和另一个pod也不能正常启动。

配置文件如下,基本上没改过

kind: Deployment
apiVersion: apps/v1
metadata:
  name: openpitrix-hyperpitrix-deployment
  namespace: openpitrix-system
  labels:
    app: openpitrix
    component: openpitrix-hyperpitrix
    version: v0.4.8
  annotations:
    deployment.kubernetes.io/revision: '3'
spec:
  replicas: 1
  selector:
    matchLabels:
      app: openpitrix
      component: openpitrix-hyperpitrix
  template:
    metadata:
      creationTimestamp: null
      labels:
        app: openpitrix
        component: openpitrix-hyperpitrix
        version: v0.4.8
    spec:
      initContainers:
        - name: wait-mysql
          image: 'alpine:3.10.4'
          command:
            - sh
            - '-c'
            - >-
              until nc -z mysql.kubesphere-system.svc 3306; do echo "waiting for
              mysql"; sleep 2; done;
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
        - name: wait-etcd
          image: 'alpine:3.10.4'
          command:
            - sh
            - '-c'
            - >-
              until nc -z etcd.kubesphere-system.svc 2379; do echo "waiting for
              etcd"; sleep 2; done;
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
        - name: openpitrix-db-init
          image: 'openpitrix/openpitrix:flyway-v0.4.8'
          command:
            - /flyway/sql/ddl/ddl_init.sh
            - '-hmysql.kubesphere-system.svc'
            - '-uroot'
            - '--connect-timeout=5'
          env:
            - name: PASSWORD
              value: password
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
        - name: openpitrix-app-db-ctrl
          image: 'openpitrix/openpitrix:flyway-v0.4.8'
          command:
            - /flyway/flyway
            - '-X'
            - '-url=jdbc:mysql://mysql.kubesphere-system.svc/app'
            - '-user=root'
            - '-validateOnMigrate=false'
            - '-locations=filesystem:/flyway/sql/app'
            - migrate
          env:
            - name: FLYWAY_PASSWORD
              value: password
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
        - name: openpitrix-attachment-db-ctrl
          image: 'openpitrix/openpitrix:flyway-v0.4.8'
          command:
            - /flyway/flyway
            - '-X'
            - '-url=jdbc:mysql://mysql.kubesphere-system.svc/attachment'
            - '-user=root'
            - '-validateOnMigrate=false'
            - '-locations=filesystem:/flyway/sql/attachment'
            - migrate
          env:
            - name: FLYWAY_PASSWORD
              value: password
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
        - name: openpitrix-cluster-db-ctrl
          image: 'openpitrix/openpitrix:flyway-v0.4.8'
          command:
            - /flyway/flyway
            - '-X'
            - '-url=jdbc:mysql://mysql.kubesphere-system.svc/cluster'
            - '-user=root'
            - '-validateOnMigrate=false'
            - '-locations=filesystem:/flyway/sql/cluster'
            - migrate
          env:
            - name: FLYWAY_PASSWORD
              value: password
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
        - name: openpitrix-isv-db-ctrl
          image: 'openpitrix/openpitrix:flyway-v0.4.8'
          command:
            - /flyway/flyway
            - '-X'
            - '-url=jdbc:mysql://mysql.kubesphere-system.svc/isv'
            - '-user=root'
            - '-validateOnMigrate=false'
            - '-locations=filesystem:/flyway/sql/isv'
            - migrate
          env:
            - name: FLYWAY_PASSWORD
              value: password
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
        - name: openpitrix-job-db-ctrl
          image: 'openpitrix/openpitrix:flyway-v0.4.8'
          command:
            - /flyway/flyway
            - '-X'
            - '-url=jdbc:mysql://mysql.kubesphere-system.svc/job'
            - '-user=root'
            - '-validateOnMigrate=false'
            - '-locations=filesystem:/flyway/sql/job'
            - migrate
          env:
            - name: FLYWAY_PASSWORD
              value: password
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
        - name: openpitrix-repo-db-ctrl
          image: 'openpitrix/openpitrix:flyway-v0.4.8'
          command:
            - /flyway/flyway
            - '-X'
            - '-url=jdbc:mysql://mysql.kubesphere-system.svc/repo'
            - '-user=root'
            - '-validateOnMigrate=false'
            - '-locations=filesystem:/flyway/sql/repo'
            - migrate
          env:
            - name: FLYWAY_PASSWORD
              value: password
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
        - name: openpitrix-runtime-db-ctrl
          image: 'openpitrix/openpitrix:flyway-v0.4.8'
          command:
            - /flyway/flyway
            - '-X'
            - '-url=jdbc:mysql://mysql.kubesphere-system.svc/runtime'
            - '-user=root'
            - '-validateOnMigrate=false'
            - '-locations=filesystem:/flyway/sql/runtime'
            - migrate
          env:
            - name: FLYWAY_PASSWORD
              value: password
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
        - name: openpitrix-task-db-ctrl
          image: 'openpitrix/openpitrix:flyway-v0.4.8'
          command:
            - /flyway/flyway
            - '-X'
            - '-url=jdbc:mysql://mysql.kubesphere-system.svc/task'
            - '-user=root'
            - '-validateOnMigrate=false'
            - '-locations=filesystem:/flyway/sql/task'
            - migrate
          env:
            - name: FLYWAY_PASSWORD
              value: password
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
      containers:
        - name: hyperpitrix
          image: 'openpitrix/openpitrix:v0.4.8'
          command:
            - hyperpitrix
          ports:
            - name: api-gateway
              containerPort: 9100
              protocol: TCP
            - name: app-manager
              containerPort: 9102
              protocol: TCP
            - name: cateogory-mgr
              containerPort: 9108
              protocol: TCP
            - name: attachment-mgr
              containerPort: 9122
              protocol: TCP
            - name: cluster-mgr
              containerPort: 9104
              protocol: TCP
            - name: isv-mgr
              containerPort: 9118
              protocol: TCP
            - name: job-mgr
              containerPort: 9106
              protocol: TCP
            - name: repo-mgr
              containerPort: 9101
              protocol: TCP
            - name: repo-indexer
              containerPort: 9108
              protocol: TCP
            - name: rp-mgr
              containerPort: 9121
              protocol: TCP
            - name: runtime-mgr
              containerPort: 9103
              protocol: TCP
            - name: task-mgr
              containerPort: 9107
              protocol: TCP
          env:
            - name: OPENPITRIX_ATTACHMENT_BUCKET_NAME
              value: openpitrix-attachment
            - name: OPENPITRIX_ATTACHMENT_ENDPOINT
              value: 'http://minio.kubesphere-system.svc:9000'
            - name: OPENPITRIX_ETCD_ENDPOINTS
              value: 'etcd.kubesphere-system.svc:2379'
            - name: OPENPITRIX_GRPC_SHOW_ERROR_CAUSE
              value: 'true'
            - name: OPENPITRIX_LOG_LEVEL
              value: debug
            - name: OPENPITRIX_MYSQL_HOST
              value: mysql.kubesphere-system.svc
            - name: OPENPITRIX_MYSQL_PASSWORD
              value: password
          resources:
            limits:
              cpu: 500m
              memory: 500Mi
            requests:
              cpu: 100m
              memory: 100Mi
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
      restartPolicy: Always
      terminationGracePeriodSeconds: 30
      dnsPolicy: ClusterFirst
      securityContext: {}
      schedulerName: default-scheduler
      hostAliases:
        - ip: 127.0.0.1
          hostnames:
            - openpitrix-task-manager
            - openpitrix-runtime-manager
            - openpitrix-repo-indexer
            - openpitrix-repo-manager
            - openpitrix-job-manager
            - openpitrix-isv-manager
            - openpitrix-cluster-manager
            - openpitrix-attachment-manager
            - openpitrix-category-manager
            - openpitrix-app-manager
            - openpitrix-api-gateway
            - openpitrix-rp-manager
  strategy:
    type: RollingUpdate
    rollingUpdate:
      maxUnavailable: 25%
      maxSurge: 25%
  revisionHistoryLimit: 10
  progressDeadlineSeconds: 600

这个问题应该如何解决?
我看了一下您刚才发我的帖子,只能是重新部署应用商店才能解决吗?

    zhangpfly 我试了一下,相关的需要调用的几个服务,mysql,etcd,minio也是正常的

    如果只能重装应用商店服务,那么原有的通过应用商店部署的服务是否会受影响?原有的服务是否可以通过新安装的应用商店进行管理?

    不用重装,你看下 kubect get svc -n openpitrix-system 下面几个服务是否都正常。现在看上去那个job找不到app-manager

      zheng1
      看着服务好像正常,但是telnet是不通的

      进到service对应的pod里面,可以看到,相应的端口、服务并没有启动起来

      我删除pod,重启了几次,发现相应的服务还是无法正常启动。
      这个怎么解决?