Bad Gateway during Helm Upgrade/Rollout

Souvent22 · May 21, 2022, 1:29pm

Hello,

Using K8 Rollout strategy and using it with Helm ( or by itself), I get a small "blip" of 502 bad gateway. We really need 0 downtime deployments. We have tried the following:

Thinking Traefik needs time to pick up the new rolling out pods, lengthed the "pauseTime" to give Traefik more time to recognize them
Using a higher maxSurge

Is there something we are missing here? Our test rig send a request every .5 seconds and then we do a helm upgrade/rollout. That's how we're seeing the blip or a 502. I have some other thoughts:

Maybe a pod is being taken down in the middle of a request?
Traefik looks at the service, so perhaps there's something at the K8 level where I need to either deploy a new service as the current service may not know those pods are being taken out of rotation?

Thanks.

Souvent22 · May 21, 2022, 9:47pm

We think we located the issue. We are running Nginx, and when SIGTERM is sent, our version of Nginx does a fast shutdown instead of a safe shutdown. We updated our docker configs for our containers to set the STOPSIGNAL to SIGQUIT and seems to have fixed the issue.

So it wasn't Traefik, it was our containers/nginx.

jakubhajek · May 23, 2022, 11:19am

Thanks for the update @Souvent22

Topic		Replies	Views
502 blip when updating/restarting traefik deployment in Kubernetes Traefik v3 (latest) docker , kubernetes-crd , kubernetes-ingress	1	69	April 16, 2025
Bad Gateway on Kubernetes cluster Traefik v2 kubernetes-ingress	0	802	October 8, 2020
I access my service through traefik when the service rolling update in kubernetes Traefik v1 kubernetes-ingress	1	428	August 15, 2019
Problem with 404s when upgrading Helm Chart Traefik v2 kubernetes-ingress	11	1337	May 4, 2023
Upgrading Traefik from 2.4.8 to 2.5.6 returns 404 on all services Traefik v2 kubernetes-ingress	6	782	October 20, 2022

Bad Gateway during Helm Upgrade/Rollout

Related topics