Rancher Server 设置
- Rancher 版本:2.6.9
- 安装选项 (Docker install/Helm Chart):
- 如果是 Helm Chart 安装,需要提供 Local 集群的类型(RKE1, RKE2, k3s, EKS, 等)和版本:
- 在线或离线部署:
在线安装
下游集群信息
- Kubernetes 版本: v1.24.17-rancher1-1
- Cluster Type (Local/Downstream):
- 如果 Downstream,是什么类型的集群?(自定义/导入或为托管 等): 自定义
用户信息
- 登录用户的角色是什么? (管理员/集群所有者/集群成员/项目所有者/项目成员/自定义):admin
- 如果自定义,自定义权限集:
主机操作系统:
ubuntu 22.04
问题描述:
安装纳入自定义集群报错,[ERROR] [controlPlane] Failed to upgrade Control Plane: [[host master209 not ready]]
重现步骤:
结果:
预期结果:
截图:
其他上下文信息:
日志
[INFO ] Initiating Kubernetes cluster
[INFO ] Successfully Deployed state file at [management-state/rke/rke-2484433011/cluster.rkestate]
[INFO ] Building Kubernetes cluster
[INFO ] [dialer] Setup tunnel for host [10.10.10.209]
[INFO ] [network] No hosts added existing cluster, skipping port check
[INFO ] [certificates] Deploying kubernetes certificates to Cluster nodes
[INFO ] [reconcile] Rebuilding and updating local kube config
[INFO ] Successfully Deployed local admin kubeconfig at [management-state/rke/rke-2484433011/kube_config_cluster.yml]
[INFO ] [reconcile] host [10.10.10.209] is a control plane node with reachable Kubernetes API endpoint in the cluster
[INFO ] [certificates] Successfully deployed kubernetes certificates to Cluster nodes
[INFO ] [file-deploy] Deploying file [/etc/kubernetes/kube-api-authn-webhook.yaml] to node [10.10.10.209]
[INFO ] Successfully started [file-deployer] container on host [10.10.10.209]
[INFO ] Waiting for [file-deployer] container to exit on host [10.10.10.209]
[INFO ] Waiting for [file-deployer] container to exit on host [10.10.10.209]
[INFO ] Container [file-deployer] is still running on host [10.10.10.209]: stderr: [], stdout: []
[INFO ] [remove/file-deployer] Successfully removed container on host [10.10.10.209]
[INFO ] [/etc/kubernetes/kube-api-authn-webhook.yaml] Successfully deployed authentication webhook config Cluster nodes
[INFO ] [file-deploy] Deploying file [/etc/kubernetes/audit-policy.yaml] to node [10.10.10.209]
[INFO ] Successfully started [file-deployer] container on host [10.10.10.209]
[INFO ] Waiting for [file-deployer] container to exit on host [10.10.10.209]
[INFO ] Waiting for [file-deployer] container to exit on host [10.10.10.209]
[INFO ] Container [file-deployer] is still running on host [10.10.10.209]: stderr: [], stdout: []
[INFO ] [remove/file-deployer] Successfully removed container on host [10.10.10.209]
[INFO ] [/etc/kubernetes/audit-policy.yaml] Successfully deployed audit policy file to Cluster control nodes
[INFO ] [reconcile] Reconciling cluster state
[INFO ] [reconcile] Check etcd hosts to be deleted
[INFO ] [reconcile] Check etcd hosts to be added
[INFO ] [reconcile] Rebuilding and updating local kube config
[INFO ] Successfully Deployed local admin kubeconfig at [management-state/rke/rke-2484433011/kube_config_cluster.yml]
[INFO ] [reconcile] host [10.10.10.209] is a control plane node with reachable Kubernetes API endpoint in the cluster
[INFO ] [reconcile] Reconciled cluster state successfully
[INFO ] Pre-pulling kubernetes images
[INFO ] Kubernetes images pulled successfully
[INFO ] [etcd] Building up etcd plane..
[INFO ] Successfully started [etcd-fix-perm] container on host [10.10.10.209]
[INFO ] Waiting for [etcd-fix-perm] container to exit on host [10.10.10.209]
[INFO ] Waiting for [etcd-fix-perm] container to exit on host [10.10.10.209]
[INFO ] [remove/etcd-fix-perm] Successfully removed container on host [10.10.10.209]
[INFO ] [etcd] Successfully started [rke-log-linker] container on host [10.10.10.209]
[INFO ] [remove/rke-log-linker] Successfully removed container on host [10.10.10.209]
[INFO ] [etcd] Successfully started etcd plane.. Checking etcd cluster health
[INFO ] [controlplane] Building up Controller Plane..
[INFO ] [sidekick] Sidekick container already created on host [10.10.10.209]
[INFO ] [healthcheck] Start Healthcheck on service [kube-apiserver] on host [10.10.10.209]
[INFO ] [healthcheck] service [kube-apiserver] on host [10.10.10.209] is healthy
[INFO ] [controlplane] Successfully started [rke-log-linker] container on host [10.10.10.209]
[INFO ] [remove/rke-log-linker] Successfully removed container on host [10.10.10.209]
[INFO ] [healthcheck] Start Healthcheck on service [kube-controller-manager] on host [10.10.10.209]
[INFO ] [healthcheck] service [kube-controller-manager] on host [10.10.10.209] is healthy
[INFO ] [controlplane] Successfully started [rke-log-linker] container on host [10.10.10.209]
[INFO ] [remove/rke-log-linker] Successfully removed container on host [10.10.10.209]
[INFO ] [healthcheck] Start Healthcheck on service [kube-scheduler] on host [10.10.10.209]
[INFO ] [healthcheck] service [kube-scheduler] on host [10.10.10.209] is healthy
[INFO ] [controlplane] Successfully started [rke-log-linker] container on host [10.10.10.209]
[INFO ] [remove/rke-log-linker] Successfully removed container on host [10.10.10.209]
[INFO ] [controlplane] Successfully started Controller Plane..
[INFO ] [worker] Building up Worker Plane..
[INFO ] [sidekick] Sidekick container already created on host [10.10.10.209]
[INFO ] [healthcheck] Start Healthcheck on service [kubelet] on host [10.10.10.209]
[INFO ] [healthcheck] service [kubelet] on host [10.10.10.209] is healthy
[INFO ] [worker] Successfully started [rke-log-linker] container on host [10.10.10.209]
[INFO ] [remove/rke-log-linker] Successfully removed container on host [10.10.10.209]
[INFO ] [healthcheck] Start Healthcheck on service [kube-proxy] on host [10.10.10.209]
[INFO ] [healthcheck] service [kube-proxy] on host [10.10.10.209] is healthy
[INFO ] [worker] Successfully started [rke-log-linker] container on host [10.10.10.209]
[INFO ] [remove/rke-log-linker] Successfully removed container on host [10.10.10.209]
[INFO ] [worker] Successfully started Worker Plane..
[INFO ] [controlplane] Processing controlplane hosts for upgrade 1 at a time
[INFO ] Processing controlplane host master209
[ERROR] [controlPlane] Failed to upgrade Control Plane: [[host master209 not ready]]