-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kind 0.20.0 pod create error with 0.20.0 node images #3309
Comments
/assign |
/remove-kind bug |
/kind support |
@bpfoster
Is there any specific pod which is failing ? |
Thanks @kundan2707 This is specific to a pod running one of our in-house images. I have been able to whittle down the real dockerfile to the following minimal reproduction image. Understand that it is not an efficient Dockerfile - I'll be looking at cleaning it up - but the real dockerfile is much more complex and this is essentially the end-result of run steps:
To reproduce:
Step 2 fails with kind node image This is beyond my understanding, but the errors seem to relate to whiteout files created during the last
Perhaps this is then a containerd issue..? |
Forget all the apt-get steps, I get a similar whiteout convert error with an image as simple as
|
This sounds like a containerd bug, thanks for debugging this so far. |
If I run containerd locally, things work fine. So my guess is it's something with being run within a container, and my uneducated guess is related to the overlay mounts. This looks to be the problematic commit in containerd: Prior to that commit it works, after it I get the error. I don't know enough here to say if it's a bug in containerd or something that kind needs to change to handle. |
Had some time to dig around, and it does seem to be a containerd bug. I've opened an issue with them: containerd/containerd#8851 |
Thanks! containerd/containerd#8851 (comment) Looks like we'll need to upgrade containerd to pick up this fix. |
This is now in the 1.7 branch containerd/containerd@2eaeb32 (since 5 days ago), given 1.7.3 released a week ago we may need to pick up a pre-release commit for a bit. |
Yeah unfortunately I wasn't familiar with their process and didn't request the 1.7 cherry-pick until after 1.7.3 had been released. |
@BenTheElder - looks like containerd 1.7.4 was released with this fix, with a 1.7.5 release shortly after. 1.7.4 also bumps runc to 1.1.9. |
We didn't bump containerd further w/ the k8s release hitting code freeze etc. I'm going to figure out go update in #3335 then bump everything and get reading for a v0.21 which will include this. Particularly long lifecycle this time, if workarounds weren't available we would have pressed forward with something sooner. |
Great, thanks for the update! |
What happened:
Just upgraded to kind 0.20.0. If I specify any of the node images that are listed in the release (e.g.
kindest/node:v1.22.17@sha256:f5b2e5698c6c9d6d0adc419c0deae21a425c07d81bbf3b6a6834042f25d4fba2
orkindest/node:v1.27.3@sha256:3966ac761ae0136263ffdb6cfd4db23ef8a83cba8a463690e98317add2c9ba72
) one of my pods (the only statefulset fwiw), fails to create with the following error:Interestingly if I switch the node image to one specified in the kind 0.19.0 release while still running kind 0.20.0 (for example
kindest/node:v1.22.17@sha256:9af784f45a584f6b28bce2af84c494d947a05bd709151466489008f80a9ce9d5
orkindest/node:v1.27.1@sha256:b7d12ed662b873bd8510879c1846e87c7e676a79fefc93e17b2a52989d3ff42b
), it works.What you expected to happen: Pods run without error
How to reproduce it (as minimally and precisely as possible):
kind create cluster --image=<image mentioned above>
helm install my-app
Anything else we need to know?: Running on rootless podman via systemd user scope
Environment:
kind version
):kind v0.20.0 go1.20.4 linux/amd64
docker info
orpodman info
):/etc/os-release
):kubectl version
):Kind config:
The text was updated successfully, but these errors were encountered: