• DevOps
  • devops流水线未运行

Jason 如果 devops-config 里的 devops.password 格式是正确的 x.y.z 的,就不用重启 devops-controller-manager ;

    8 天 后

    zhaojun-xj yaml出错是因为 token需要base64编码,把password 编码一下就可以,例如:

    echo “password” | base64

    25 天 后

    同样的问题,流水线不能运行,流水线项目无法创建,s2i构建正常。按照上面三步操作后还是不行,能否使用登录jenkins的账号密码转base64后替换密码

    W0531 09:24:18.126595 1 jwt.go:53] jwt: token is expired by 16h28m30s
    W0531 09:24:18.126670 1 jwt.go:53] jwt: token is expired by 16h28m30s
    W0531 09:24:18.126815 1 jwt.go:53] jwt: token is expired by 16h28m30s
    E0531 09:24:21.949083 1 token.go:143] dial tcp 10.43.147.75:6379: connect: connection refused
    E0531 09:24:21.949110 1 token.go:100] dial tcp 10.43.147.75:6379: connect: connection refused
    I0531 09:24:21.949155 1 apiserver.go:599] ::ffff:10.42.34.128 - "POST /oauth/token HTTP/1.1" 500 109 1118ms
    E0531 09:24:24.014172 1 token.go:143] dial tcp 10.43.147.75:6379: connect: connection refused
    E0531 09:24:24.014210 1 token.go:100] dial tcp 10.43.147.75:6379: connect: connection refused
    I0531 09:24:24.014278 1 apiserver.go:599] ::ffff:10.42.34.128 - "POST /oauth/token HTTP/1.1" 500 109 1143ms
    E0531 09:24:26.497278 1 token.go:143] dial tcp 10.43.147.75:6379: connect: connection refused
    E0531 09:24:26.497315 1 token.go:100] dial tcp 10.43.147.75:6379: connect: connection refused
    I0531 09:24:26.497373 1 apiserver.go:599] ::ffff:10.42.34.128 - "POST /oauth/token HTTP/1.1" 500 109 1091ms
    W0531 09:26:01.175135 1 jwt.go:53] jwt: token is expired by 16h36m53s
    W0531 09:26:01.175167 1 jwt.go:53] jwt: token is expired by 16h36m53s
    W0531 09:26:01.175296 1 jwt.go:53] jwt: token is expired by 16h36m53s
    E0531 09:26:06.557269 1 token.go:143] dial tcp 10.43.147.75:6379: connect: connection refused
    E0531 09:26:06.557312 1 token.go:100] dial tcp 10.43.147.75:6379: connect: connection refused
    I0531 09:26:06.557384 1 apiserver.go:599] ::ffff:10.42.34.128 - "POST /oauth/token HTTP/1.1" 500 109 1150ms
    E0531 09:26:08.965813 1 token.go:143] dial tcp 10.43.147.75:6379: connect: connection refused
    E0531 09:26:08.965877 1 token.go:100] dial tcp 10.43.147.75:6379: connect: connection refused
    I0531 09:26:08.965992 1 apiserver.go:599] ::ffff:10.42.34.128 - "POST /oauth/token HTTP/1.1" 500 109 1142ms
    E0531 09:29:00.769422 1 token.go:143] dial tcp 10.43.147.75:6379: connect: connection refused
    E0531 09:29:00.769479 1 token.go:100] dial tcp 10.43.1

      igor 这个日志是 ks-apiserver 的吗 ?是最新的吗 ?

      • igor 回复了此帖

        这个密码是用来干啥的 好像也不是登录密码

        kubesphere-secret 配置base64 解码后的token,比对了一下确实是和jenkins-config中的密码已经一致了

        2 个月 后

        按照楼主说的3个步骤做了,但是还是不能运行流水线。

        walle 我按照步骤改了token和password,但是流水线还是不运行。跟你情况一样。

          13 天 后

          zhangzl419 如果执行了上面3步,还不能运行是流水线,麻烦在重启下 devops-controller 服务试下,如果还不行,看下 devops-controller 日志有没有相关报错信息;

            7 天 后

            yudong

            碰到类似的问题:
            1,开始发现流水线 “未运行”,查看devops-jenkins日志:

            023-08-21 03:07:25.422+0000 [id=20295] WARNING i.k.j.d.a.KubesphereApiTokenAuthenticator#authenticate: API token matched for user liding2 but the impersonation failed

            org.acegisecurity.userdetails.UsernameNotFoundException: User liding2 not found in directory.

            at org.acegisecurity.ldap.search.FilterBasedLdapUserSearch.searchForUser(FilterBasedLdapUserSearch.java:126)

            at hudson.security.LDAPSecurityRealm$LDAPUserDetailsService.loadUserByUsername(LDAPSecurityRealm.java:1320)

            at hudson.security.LDAPSecurityRealm$LDAPUserDetailsService.loadUserByUsername(LDAPSecurityRealm.java:1273)

            at org.acegisecurity.userdetails.UserDetailsService.lambda$toSpring$1(UserDetailsService.java:52)

            Caused: org.springframework.security.core.userdetails.UsernameNotFoundException

            at org.acegisecurity.userdetails.UsernameNotFoundException.toSpring(UsernameNotFoundException.java:51)

            at org.acegisecurity.userdetails.UsernameNotFoundException.toSpring(UsernameNotFoundException.java:34)

            at org.acegisecurity.userdetails.UserDetailsService.lambda$toSpring$1(UserDetailsService.java:54)

            at jenkins.security.ImpersonatingUserDetailsService2.loadUserByUsername(ImpersonatingUserDetailsService2.java:29)

            at hudson.model.User.getUserDetailsForImpersonation2(User.java:406)

            at hudson.model.User.getUserDetailsForImpersonation(User.java:429)

            Caused: org.acegisecurity.userdetails.UsernameNotFoundException: org.springframework.security.core.userdetails.UsernameNotFoundException: org.acegisecurity.userdetails.UsernameNotFoundException: User liding2 not found in directory.; nested exception is org.springframework.security.core.userdetails.UsernameNotFoundException: org.acegisecurity.userdetails.UsernameNotFoundException: User liding2 not found in directory.

            2,按照你前面提的三个步骤操作了,流水线又成功运行了,一会儿后又发现流水线"未运行"
            3,检查Jenkins的role-strategy/assign-roles


            4,查看devops-controller日志
            E0821 03:07:25.426300 1 pipelinerun_controller.go:205] pipelinerun-controller “msg”=“unable to run pipeline” “error”=“unexpected status code: 500” “Pipeline”=“elf” “PipelineRun”={“Namespace”:“tools6rwrl”,“Name”:“elf-gqmc7”} “namespace”=“tools6rwrl” “pipeline”=“elf”

            E0821 03:07:25.426407 1 controller.go:326] “msg”=“Reconciler error” “error”=“unexpected status code: 500” “controller”=“pipelinerun” “controllerGroup”=“devops.kubesphere.io” “controllerKind”=“PipelineRun” “name”=“elf-gqmc7” “namespace”=“tools6rwrl” “pipelineRun”={“name”:“elf-gqmc7”,“namespace”:“tools6rwrl”} “reconcileID”=“6d87a5d2-3543-40d7-94d3-ec3d14fd0bef”

            <nil>

            <nil>

            E0821 03:09:32.357775 1 devopscredential_controller.go:294] unexpected error type: not found resources, should be *restful.ServiceError

            E0821 03:09:32.357826 1 devopscredential_controller.go:167] error syncing ‘cssgzndq/gitlab-token’: failed to remove devops credential finalizer due to bad communication with Jenkins, requeuingcould not reconcile devopsProject

            E0821 03:09:32.357834 1 devopscredential_controller.go:168] error syncing ‘cssgzndq/gitlab-token’: failed to remove devops credential finalizer due to bad communication with Jenkins, requeuing

            E0821 03:09:32.428121 1 devopscredential_controller.go:294] unexpected error type: not found resources, should be *restful.ServiceError

            E0821 03:09:32.428175 1 devopscredential_controller.go:167] error syncing ‘cssgzndq/k8s-token’: failed to remove devops credential finalizer due to bad communication with Jenkins, requeuingcould not reconcile devopsProject

            E0821 03:09:32.428183 1 devopscredential_controller.go:168] error syncing ‘cssgzndq/k8s-token’: failed to remove devops credential finalizer due to bad communication with Jenkins, requeuing

            E0821 03:09:32.449208 1 devopscredential_controller.go:294] unexpected error type: not found resources, should be *restful.ServiceError

            E0821 03:09:32.449252 1 devopscredential_controller.go:167] error syncing ‘uosjmdbk/gitlab-token’: failed to remove devops credential finalizer due to bad communication with Jenkins, requeuingcould not reconcile devopsProject

            E0821 03:09:32.449260 1 devopscredential_controller.go:168] error syncing ‘uosjmdbk/gitlab-token’: failed to remove devops credential finalizer due to bad communication with Jenkins, requeuing

            E0821 03:09:32.461410 1 devopscredential_controller.go:294] unexpected error type: not found resources, should be *restful.ServiceError

            E0821 03:09:32.461456 1 devopscredential_controller.go:167] error syncing ‘uosjmdbk/k8s-token’: failed to remove devops credential finalizer due to bad communication with Jenkins, requeuingcould not reconcile devopsProject

            E0821 03:09:32.461465 1 devopscredential_controller.go:168] error syncing ‘uosjmdbk/k8s-token’: failed to remove devops credential finalizer due to bad communication with Jenkins, requeuing

            <nil>

            <nil>

            <nil>

            <nil>

            E0821 03:14:01.416187 1 pipelinerun_controller.go:205] pipelinerun-controller “msg”=“unable to run pipeline” “error”=“unexpected status code: 500” “Pipeline”=“css” “PipelineRun”={“Namespace”:“tools6rwrl”,“Name”:“css-vc4d8”} “namespace”=“tools6rwrl” “pipeline”=“css”

            E0821 03:14:01.416736 1 controller.go:326] “msg”=“Reconciler error” “error”=“unexpected status code: 500” “controller”=“pipelinerun” “controllerGroup”=“devops.kubesphere.io” “controllerKind”=“PipelineRun” “name”=“css-vc4d8” “namespace”=“tools6rwrl” “pipelineRun”={“name”:“css-vc4d8”,“namespace”:“tools6rwrl”} “reconcileID”=“7917f670-219b-4738-b3fa-311dedcd39fe”

            E0821 03:14:25.682510 1 pipelinerun_controller.go:205] pipelinerun-controller “msg”=“unable to run pipeline” “error”=“unexpected status code: 500” “Pipeline”=“elf” “PipelineRun”={“Namespace”:“tools6rwrl”,“Name”:“elf-rhgbf”} “namespace”=“tools6rwrl” “pipeline”=“elf”

            E0821 03:14:25.682684 1 controller.go:326] “msg”=“Reconciler error” “error”=“unexpected status code: 500” “controller”=“pipelinerun” “controllerGroup”=“devops.kubesphere.io” “controllerKind”=“PipelineRun” “name”=“elf-rhgbf” “namespace”=“tools6rwrl” “pipelineRun”={“name”:“elf-rhgbf”,“namespace”:“tools6rwrl”} “reconcileID”=“1489e14b-bb48-4ca7-8472-2a6ca2ee2ad6”

            5,对比devops-config,kubesphere-config,kubesphere-secret

            6,疑惑的是kubesphere-devops-system/secrets/devops-jenkins中定义的jenkins-admin-password无法登录Jenkins,重启了Jenkins之后发现也登陆不了,最终用的是默认密码登录。

              BombLing 感谢反馈,请问环境是 Kubesphere 哪个版本 ?然后麻烦对比 Kubesphere-config 和 devops-config 里的 jwtSecret 是不是一样的;

                yudong

                1,对比Kubesphere-config 和 devops-config 里的 jwtSecret 是一样的;

                2,Kubesphere是3.4.0,从3.2.1升级的;
                3,回顾时间线:应该devops项目中删除了流水线,或者删除了devops项目,导致拥有该项目的权限的用户出现“未运行”

                4,刚刚发现有devops项目未能成功删除,原有是删除secrets时卡住,使用patch才能删除;

                  BombLing 哦 如果 jwtSecret 和 password(截图中的)是一样的,并且重启过 ks-controller-manager 服务,那就没有问题了。
                  对删除 devops 项目或者流水线卡住问题,可以在 k8s 节点终端上手动操作删除;

                    yudong

                    对比了kubesphere-secret中的secret,kubesphere-config中的jwtSecret,devops-config中的 jwtSecret三者一致,也重启了ks-controller-manager,但是Jenkins的用户显示

                    在什么情况下会触发更新Jenkins的role-strategy/assign-roles?

                      BombLing 哦,同步用户信息在修改用户密码时会触发,您可以试着修改下用户密码。

                        @yudong
                        ks: 3.3.2
                        k8s: 1.24.9
                        现象是在多分支流水线中,经常会出现个别流水线一直未运行的情况,其它流水线都正常,重启相关组件都未解决。通过删除重新创建就可以重新运行。
                        还有一个现象是流水线已经执行完成,jenkins中也已经结束了,但是ks上流水线的状态一直是运行中

                          littlejiancc 可能是同时运行的流水线太对了,这些流水线在排队等待中;可以看下 kubesphere-devops-worker 空间下是否有多个 pod ?