10s delays in forwarding requests

robert.pankowecki.t · March 10, 2021, 7:41am

Hey,

My team is using traefik:v2.2.1 and we observe unusual behavior in our end-to-end tests. Sometimes requests are not forwarded for 10s.

What we see in logs:

172.23.0.1 - - [09/Mar/2021:16:12:31 +0000] "POST /staff?operationNames=GetChrome HTTP/1.1" 200 379 "-" "-" 2724 "gg@file" "http://gg:5000/" 10046ms

What we observe in the downstream service (gg) is that it is hit with the request exactly 10s later than it should. Not at 16:12:31 but at 16:12:41 . The request processing took a few miliseconds, so I can imagine that the whole roundtrip took 46ms, and the 10000ms is some artificial delay...

Around 5 months ago we observed a very similar behavior but the requests were stuck even way longer. We worked around it by changing log level from DEBUG to ERROR. Reading the logs I even managed to asses that the problem must lie in some code here oxy/rr.go at master · vulcand/oxy · GitHub but I could not find anything obvious to nail the bug. It seemed like due to some kind of deadlock/timeout or a similar mechanism in either traefik or go itself, everything stops for some requests for 10s.

Did any of you observe any similar behavior in the past? Are there any known issues related to DEBUG log-level?

jakubhajek · March 11, 2021, 9:43am

Hey Robert,

Would it possible to upgrade Traefik to the latest version 2.4.7 and validate whether the issue still occurred?

robert.pankowecki.t · March 12, 2021, 10:43am

@jakubhajek We are in the process of doing this.

In the meantime, I think we found a potential candidate:

nginx - TCP connections between docker containers timeout after ~10000 connections - Server Fault
Logs stop consuming for container, blocking writes to stderr · Issue #6018 · docker/compose · GitHub

It seems that just too much logs causes these issues in docker-compose, and the problem is not specific to traefik. But it might be increased by using DEBUG log level which is quite verbose indeed.

robert.pankowecki.t · April 12, 2021, 7:13am

Upgrading docker-compose to version >= 1.28.5 fixed the issue for us.

system · April 15, 2021, 7:13am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Slow response time from docker host #7941 Traefik v2 docker , metrics , tracing , tcp	1	2959	March 3, 2021
504 Timeouts and slow requests Traefik v2 docker , docker-swarm , middleware	1	1902	October 18, 2022
Huge bandwith/download performance issue Traefik v2 docker , docker-swarm	19	9186	February 1, 2022
Request duration discrepancy Traefik v2 docker , docker-swarm	5	457	April 12, 2024
Slow transfer speeds (when running in docker swarm cluster) Traefik v2 docker-swarm	4	1517	December 7, 2024

10s delays in forwarding requests

Related topics