Problem with 404s when upgrading Helm Chart

_cole · April 24, 2023, 2:49am

Hello,

I have recently upgraded the helm chart from v12.0.0 to v21.2.1, corresponding to traefik version 2.9.1 to 2.9.9.

Unfortunately, after upgrade all of my Ingress and IngressRoutes are returning nothing but 404.

I have:

Fixed service/deployment labels as described in traefik helm chart upgrade notes
Turned on the Traefik Dashboard and confirmed that these routes exist
Ensured that CRDs are up to date
tried using ingressClassName: traefik on the ingress as well as what I currently had in place (the kubernetes.io/ingress.class: traefik annotation)
Checked logging and turned up verbosity. It looks like Ingresses are getting picked up properly

time="2023-04-24T02:14:30Z" level=debug msg="Adding route for prom.staging.url with TLS options default" entryPointName=websecure
time="2023-04-24T02:14:30Z" level=debug msg="Adding route for cost.staging.url with TLS options default" entryPointName=websecure

But then curl -i https://cost.staging.url returns 404

confirmed that requests are reaching traefik (access logs, packet tracing, etc.)
Access log entries look like this:

my.ip.address - - [24/Apr/2023:02:45:19 +0000] "GET / HTTP/1.1" - - "-" "-" 379 "-" "-" 0ms

Does anyone have ideas about what could be done to gather some more diagnostic information? Do I need to start from scratch on this cluster / delete and recreate traefik and its load balancer? I'd rather avoid this since it creates the need to redefine DNS and whatnot. Moreover, I am not optimistic that this will solve the problem... (again, the routes all exist as expected in the Traefik Dashboard!). In fact, the helm-chart created ingressRoute is the only one that I can get to work... is it possible that the fact that I almost exclusively use Host() rules is breaking things for some reason?

I'm hopeful I am missing something obvious

svx · April 24, 2023, 1:04pm

Hi @_cole! Thanks for your interest in Traefik!

Could you run Traefik in DEBUG mode and post the output?

Thanks!

_cole · April 24, 2023, 2:17pm

Howdy @svx !

Am I understanding right that this is just running traefik with DEBUG log level? Or is there something more than that?

Thanks!

svx · April 24, 2023, 2:30pm

Hey @_cole,

Yes, that is right! Just run Traefik with log level set to DEBUG.

!

_cole · April 24, 2023, 3:09pm

I will admit, it is a bit much I can try to create a simpler example if that would be helpful?

In any case, it was 10x the length of what I was allowed to post, so I put it into a gist

gist.github.com

https://gist.github.com/colearendt/00d5e4e47b1ba8fc482aa8d8f49f8007

traefik-debug-clean.log

time="2023-04-24T14:43:28Z" level=info msg="Configuration loaded from flags."
time="2023-04-24T14:43:28Z" level=info msg="Traefik version 2.9.9 built on 2023-03-21T15:52:28Z"
time="2023-04-24T14:43:28Z" level=debug msg="Static configuration loaded {\"global\":{\"checkNewVersion\":true,\"sendAnonymousUsage\":true},\"serversTransport\":{\"maxIdleConnsPerHost\":200},\"entryPoints\":{\"metrics\":{\"address\":\":9100/tcp\",\"transport\":{\"lifeCycle\":{\"graceTimeOut\":\"10s\"},\"respondingTimeouts\":{\"idleTimeout\":\"3m0s\"}},\"forwardedHeaders\":{},\"http\":{},\"http2\":{\"maxConcurrentStreams\":250},\"udp\":{\"timeout\":\"3s\"}},\"psql\":{\"address\":\":5432/tcp\",\"transport\":{\"lifeCycle\":{\"graceTimeOut\":\"10s\"},\"respondingTimeouts\":{\"idleTimeout\":\"3m0s\"}},\"forwardedHeaders\":{},\"http\":{},\"http2\":{\"maxConcurrentStreams\":250},\"udp\":{\"timeout\":\"3s\"}},\"traefik\":{\"address\":\":9000/tcp\",\"transport\":{\"lifeCycle\":{\"graceTimeOut\":\"10s\"},\"respondingTimeouts\":{\"idleTimeout\":\"3m0s\"}},\"forwardedHeaders\":{},\"http\":{},\"http2\":{\"maxConcurrentStreams\":250},\"udp\":{\"timeout\":\"3s\"}},\"web\":{\"address\":\":8000/tcp\",\"transport\":{\"lifeCycle\":{\"graceTimeOut\":\"10s\"},\"respondingTimeouts\":{\"idleTimeout\":\"3m0s\"}},\"forwardedHeaders\":{},\"http\":{\"redirections\":{\"entryPoint\":{\"to\":\":443\",\"scheme\":\"https\",\"permanent\":true,\"priority\":2147483646}}},\"http2\":{\"maxConcurrentStreams\":250},\"udp\":{\"timeout\":\"3s\"}},\"websecure\":{\"address\":\":8443/tcp\",\"transport\":{\"lifeCycle\":{\"graceTimeOut\":\"10s\"},\"respondingTimeouts\":{\"idleTimeout\":\"3m0s\"}},\"forwardedHeaders\":{},\"http\":{\"tls\":{}},\"http2\":{\"maxConcurrentStreams\":250},\"udp\":{\"timeout\":\"3s\"}}},\"providers\":{\"providersThrottleDuration\":\"2s\",\"kubernetesIngress\":{\"ingressEndpoint\":{\"publishedService\":\"soleng/traefik\"}},\"kubernetesCRD\":{\"allowExternalNameServices\":true}},\"api\":{\"dashboard\":true},\"metrics\":{\"prometheus\":{\"buckets\":[0.1,0.3,1.2,5],\"addEntryPointsLabels\":true,\"addRoutersLabels\":true,\"addServicesLabels\":true,\"entryPoint\":\"metrics\"}},\"ping\":{\"entryPoint\":\"traefik\",\"terminatingStatusCode\":503},\"log\":{\"level\":\"TRACE\",\"format\":\"common\"},\"accessLog\":{\"format\":\"common\",\"filters\":{},\"fields\":{\"defaultMode\":\"keep\",\"headers\":{\"defaultMode\":\"drop\"}}},\"experimental\":{\"localPlugins\":{\"rewriteHeaders\":{\"moduleName\":\"github.com/XciD/traefik-plugin-rewrite-headers\"},\"templateHeaders\":{\"moduleName\":\"github.com/colearendt/traefik-plugin-template-headers\"}}}}"
time="2023-04-24T14:43:28Z" level=info msg="Stats collection is enabled."
time="2023-04-24T14:43:28Z" level=info msg="Many thanks for contributing to Traefik's improvement by allowing us to receive anonymous information from your configuration."
time="2023-04-24T14:43:28Z" level=info msg="Help us improve Traefik by leaving this feature on :)"
time="2023-04-24T14:43:28Z" level=info msg="More details on: https://doc.traefik.io/traefik/contributing/data-collection/"
time="2023-04-24T14:43:28Z" level=debug msg="Configured Prometheus metrics" metricsProviderName=prometheus
time="2023-04-24T14:43:28Z" level=info msg="Starting provider aggregator aggregator.ProviderAggregator"
time="2023-04-24T14:43:28Z" level=debug msg="Starting TCP Server" entryPointName=websecure

This file has been truncated. show original

I'm wondering if something related to my plugin setup may be the problem. Just strange b/c it all worked pre-upgrade.

svx · April 25, 2023, 12:55pm

Hi @_cole,
We can't find something in the log about your configuration.
There should be a line "Configuration received ..."

Did you remove this part from the log file?

_cole · April 25, 2023, 1:10pm

I didn't. Is Configuration loaded from flags not sufficient? I am using the helm chart, so I believe just about everything is either from flags or env vars - no config map or anything?

_cole · April 26, 2023, 2:28am

Maybe another way of attacking this problem - does anyone have an example of a deployment to EKS with service type LoadBalancer that uses Ingresses to route traffic? A fairly vanilla deployment to my same cluster is also struggling, all without any clear logging I'll keep digging - there is still something funky about this first non-default ingress class with a label selector.

svx · April 26, 2023, 1:09pm

Hi @_cole,
Just an observation from your log.
Are you sure that you're reaching Traefik?

We don't see anything in the logs.

_cole · April 26, 2023, 2:37pm

Merp. Sorry about that. I must have forgotten to hit the endpoint after restarting

I will generate a new log here in a bit. Basically, the only line I missed was the access log line that I mentioned above. The lack of a status is confounding - maybe that's just how things work if they don't match a router? But why it is not matching a router is the most surprising and befuddling:

my.ip.addr.90 - - [26/Apr/2023:14:33:27 +0000] "GET /ping HTTP/1.1" 200 2 "-" "-" 51 "ping@internal" "-" 0ms
my.ip.addr.90 - - [26/Apr/2023:14:33:27 +0000] "GET /ping HTTP/1.1" 200 2 "-" "-" 52 "ping@internal" "-" 0ms
my.ip.addr.90 - - [26/Apr/2023:14:33:37 +0000] "GET /ping HTTP/1.1" 200 2 "-" "-" 53 "ping@internal" "-" 0ms
my.ip.addr.90 - - [26/Apr/2023:14:33:37 +0000] "GET /ping HTTP/1.1" 200 2 "-" "-" 54 "ping@internal" "-" 0ms
my.ip.addr.90 - - [26/Apr/2023:14:33:47 +0000] "GET /ping HTTP/1.1" 200 2 "-" "-" 56 "ping@internal" "-" 0ms
my.ip.addr.90 - - [26/Apr/2023:14:33:47 +0000] "GET /ping HTTP/1.1" 200 2 "-" "-" 55 "ping@internal" "-" 0ms
my.ip.addr.32 - - [26/Apr/2023:14:33:50 +0000] "GET / HTTP/1.1" - - "-" "-" 57 "-" "-" 0ms
my.ip.addr.32 - - [26/Apr/2023:14:33:51 +0000] "GET / HTTP/1.1" - - "-" "-" 58 "-" "-" 0ms
my.ip.addr.32 - - [26/Apr/2023:14:33:52 +0000] "GET / HTTP/1.1" - - "-" "-" 59 "-" "-" 0ms

svx · April 27, 2023, 3:27pm

Could do a GET request to the /api/rawdata endpoint of Traefik and share the output?

Could you also share an example curl command of a request so that we can see which/if/how an entrypoint was reached?

_cole · May 4, 2023, 5:09pm

~ curl -i https://staging.domain.com/
HTTP/1.1 404 Not Found
Content-Type: text/plain; charset=utf-8
X-Content-Type-Options: nosniff
Date: Thu, 04 May 2023 17:01:54 GMT
Content-Length: 19

404 page not found

my.ip.addr.57 - - [04/May/2023:17:01:54 +0000] "GET / HTTP/1.1" - - "-" "-" 145165 "-" "-" 0ms

And the rawdata endpoint:

gist.github.com

https://gist.github.com/colearendt/c032d86cecdeaeab917be110884f1f1b

content.json

{
  "routers": {
    "connect-daily-company-connect-daily-staging-statenm-new-co-daily-connect@kubernetes": {
      "entryPoints": [
        "metrics",
        "psql",
        "web"
      ],
      "middlewares": [
        "connect-daily-add-templated-headers@kubernetescrd",

This file has been truncated. show original

Unfortunately, this is still a very complicated example as I have been pulled into other teams and will probably have to revert out of this upgrade for now. This is a super useful debugging note, though, thank you!! If you see anything obvious, please do let me know It is a bit much though

Topic		Replies	Views
Dashboard returns 404 after installing with the latest helm chart (traefik-10.9.1) Traefik v2 kubernetes-ingress , dashboard-api	0	677	January 21, 2022
Upgrading Traefik from 2.4.8 to 2.5.6 returns 404 on all services Traefik v2 kubernetes-ingress	6	762	October 20, 2022
Deploying Traefik via Helm - 404s for Dashboard and API Traefik v2 kubernetes-crd , dashboard-api	2	1053	August 13, 2020
Having issue while migrating from 2.11.x to 3.2.0 Traefik v3 (latest) kubernetes-ingress , middleware	5	119	November 30, 2024
404 on some ingress route Traefik v2 kubernetes-crd , kubernetes-ingress	1	2318	September 16, 2021

Problem with 404s when upgrading Helm Chart

Related topics