环境信息:
RKE2 版本:
rancher的版本为:v2.7.9
节点 CPU 架构,操作系统和版本:
3.10.0-1160.99.1.el7.x86_64 #1 SMP Thu Aug 10 10:46:21 EDT 2023 x86_64 x86_64 x86_64 GNU/Linux
集群配置:
这是测试节点,只有一个server节点
问题描述:
在rancher中创建k8s的时候,集群报错:Waiting for agent to check in and apply initial plan,另外我不清楚是不是跟rancher v2.7.9版本有关系
重现步骤:
- 安装 RKE2 的命令:
日志
“[Applyinator] No image provided, creating empty working directory /var/lib/rancher/agent/work/20231117-153252/c4ee63cd896420f19cbd88a05165f5d212dd07ce1625ba584fd0693a9e6d7e87_0”
“[Applyinator] Running command: sh [-c rke2 etcd-snapshot list --etcd-s3=false 2>/dev/null]”
“[Applyinator] Command sh [-c rke2 etcd-snapshot list --etcd-s3=false 2>/dev/null] finished with err: and exit code: 1”
“error loading x509 client cert/key for probe kube-apiserver (/var/lib/rancher/rke2/server/tls/client-kube-apiserver.crt//var/lib/rancher/rke2/server/tls/client-kube-apiserver.key): open /var/lib/rancher/rke2/server/tls/client-kube-apiserver.crt: no such file or directory”
“error loading CA cert for probe (kube-scheduler) /var/lib/rancher/rke2/server/tls/kube-scheduler/kube-scheduler.crt: open /var/lib/rancher/rke2/server/tls/kube-scheduler/kube-scheduler.crt: no such file or directory”
“error while appending ca cert to pool for probe kube-scheduler”
level=error msg=“error loading CA cert for probe (kube-controller-manager) /var/lib/rancher/rke2/server/tls/kube-controller-manager/kube-controller-manager.crt: open /var/lib/rancher/rke2/server/tls/kube-controller-manager/kube-controller-manager.crt: no such file or directory”
“error while appending ca cert to pool for probe kube-controller-manager”
“error loading CA cert for probe (kube-apiserver) /var/lib/rancher/rke2/server/tls/server-ca.crt: open /var/lib/rancher/rke2/server/tls/server-ca.crt: no such file or directory”
“error while appending ca cert to pool for probe kube-apiserver”
rke2-server.service holdoff time over, scheduling restart.
Stopped Rancher Kubernetes Engine v2 (server).
Starting Rancher Kubernetes Engine v2 (server)…
- /usr/bin/systemctl is-enabled --quiet nm-cloud-setup.service
Failed to get unit file state for nm-cloud-setup.service: No such file or directory
“missing required: user: unknown user etcd\nmissing required: group: unknown group etcd\ninvalid kernel parameter value vm.overcommit_memory=0 - expected 1\ninvalid kernel parameter value kernel.panic=0 - expected 10\n”
rke2-server.service: main process exited, code=exited, status=1/FAILURE
Failed to start Rancher Kubernetes Engine v2 (server).
Unit rke2-server.service entered failed state.
rke2-server.service failed.