How to achieve better network isolation on Docker Swarm?

jerrac · March 11, 2024, 5:00pm

I've been experimenting with ways to try and get better isolation between stacks in my Docker Swarm Currently every service that needs Traefik to route traffic to it has to live on the same network. Thus, service Alpha can lookup dns and see service Gamma.

Not the end of the world, but I like defense in depth, and would like to not allow Alpha and Gamma know the other one exists.

I did find Isolating traffic between containers proxied by traefik which did not end up at a solution. (Kube is not in the cards.)

My current attempt is to have multiple instances of Traefik. A global instance that is accessible to the outside world, and then an instance per app stack. The idea being that only the Traefik instances will be able to see each other, and the app containers will not have to share the global Traefik network.

Currently that isn't working, so I thought I'd ask if anyone else has figured something out while I try and work around the problems...

So, anyone have any ideas they'd be willing to share?

kayson · March 11, 2024, 5:36pm

I asked the same question, and didn't get a great answer: Isolating traffic between containers proxied by traefik - #2 by daniel.tomcej

You definitely don't need multiple instances of traefik. You'd be much better off by just having a single network per service/stack/container. It's a giant pain to maintain, though, so I wrote this to solve your exact problem: GitHub - kaysond/trafficjam: A Docker firewall for your reverse proxy network

The idea is that you have all containers that traefik needs to talk to on a single docker network, then trafficjam will dynamically add iptables rules to prevent all the containers on that network from talking to each other (except traefik, which is whitelisted and can talk to all of them). It works on swarm too, and its a relatively simple bash script so it should be pretty secure.

bluepuma77 · March 11, 2024, 5:46pm

Wouldn’t you just need a different Docker network for every service, and Traefik attached to all of them?

Neuroforge has some interesting swarm related projects, like swarmgate.

jerrac · March 11, 2024, 5:51pm

I've been operating on the idea that you can only attach Traefik to one network, and then you attach your services to that network if you need Traefik to route traffic to them.

But a reread of Traefik Docker Documentation - Traefik makes me wonder if maybe you just don't need to set the main network setting at all, and Traefik will figure it out from your individual service labels.

Is that what you were thinking of?

bluepuma77 · March 11, 2024, 6:06pm

We have Traefik attached to 10+ own services, all in the same "proxy" Docker network.

Next security improvement will be a docker-socker-proxy, simple self-made, not an unknown 3 year old image from the Internet. Then dedicated non-root user and group. Traefik is the first point of contact and has in my view a high security risk.

If we want to separate networks, my first thought would be 10 Docker networks, one for each service, and attach Traefik to all of them. I assume the Traefik container would not route between the networks. And set docker.network for ever target service in the labels - but that’s probably not even required if the target service has no other networks (like one to DB).

jerrac · March 11, 2024, 9:08pm

So I did some testing. If you make sure that every stack's individual network is external and added to the Traefik stack/service, then Traefik doesn't need all the apps to share the same network like I thought. So, if you put anet and bnet on Traefik, and anet on stacka, and bnet on stackb, then Traefik will route traffic properly, but containers in stacka won't see containers in stackb.

If you don't put anet and bnet on the Traefik stack, it won't work. I tried.

The one thing I did find that is confusing me is that running Traefik outside of Docker doesn't seem to work. It is supposed to, right? I'm pretty sure it's a misconfiguration on my end, but it just isn't routing traffic from outside the Swarm to any of the containers in the stacks.

I'd like to run it outside of Docker so that it can pick up the new networks and stacks as they come in automatically instead of making me add them to the stack configuration and have to redeploy Traefik every time I add a stack to my Swarm.

jerrac · March 11, 2024, 9:15pm

FYI I took a look at trafficjam. It's an interesting idea, but my iptables skills aren't up to the task of evaluating it right now.

kayson · March 11, 2024, 10:47pm

Well the idea is that you can use it without knowing a thing about iptables but the rules are deliberately simple nonetheless: just basic drops/returns on subnets/ip's pulled from docker. Feel free to ask any questions if you want to dive in, either here or on github.

bluepuma77 · March 12, 2024, 6:56am

I am sure you can set Swarm services to expose their used port on the host with a random port (when running multiple).

Then you can create a script to inspect the stack/services to pull the IPs+ports and create a dynamic config file with routers and services, maybe even with labels (or env) for the URL, to make everything dynamic.

That should just not be done on a cloud VM with public IP - except if you can set a real firewall in front of it to block all ports, so no external party can directly access your internal services.

Never mind. That solves the issue of Traefik not being attached to individual Docker networks. But it does not improve isolation.

TommyEsteban · November 8, 2024, 7:26pm

I solved this scenario with the following procedure if anyone is interested.

create the overlay network for traefik, let's call it traefik-net
init the swarm and join the other nodes
create a dummy global service (--global flag) with normal traefik flags and attached to the traefik-net.
Create iptables rules in every worker node to avoid inter-container communication in the traefik-net namespace. In modern docker versions the network namespaces are located at /var/run/docker/netns. For example,

nsenter --net="$NETNS" -- iptables -I FORWARD -s 10.0.1.3/32 -j ACCEPT
nsenter --net="$NETNS" -- iptables -I FORWARD -d 10.0.1.3/32 -j ACCEPT
nsenter --net="$NETNS" -- iptables -I FORWARD 3 -s 10.0.1.0/24 -d 10.0.1.0/24 -j REJECT

NETNS is a variable containing the full path of the traefik-net namespace, for example /var/run/docker/netns/1-abcdefghij and abcdefghij is part of the traefik-net id.

The first and second iptable rules allow traffic from/to the traefik load balancer
The third rule prevent inter-container communication.

The limitation is that if no tasks (containers) are running in one node, the network namespace is destroyed with all iptables rules, but with the global dummy service running almost forever, the chances are very low. (I did use the traefik/whoami image). If you want something automatic to create the iptable rules then try the trafficjam project. (I did take this idea from there, thank you @kayson )

winter · January 14, 2025, 8:18pm

This is pretty much what I'm planning to move to. It's a bit annoying to have to manually add the traefik container to each of the new networks though.

Topic		Replies	Views
Isolating traffic between containers proxied by traefik Traefik v2 docker	2	1981	October 23, 2019
Deny access from docker networks Traefik v2 docker , docker-swarm	2	777	April 13, 2021
Docker Networking Traefik v1 docker	2	508	November 11, 2019
Can containers access each other through Traefik? Traefik v2 docker-swarm	0	536	February 14, 2020
Traefik setup design / is it possible to run traffic outside a docker swarm on a seperat docker host? Traefik v2 docker-swarm	2	406	August 23, 2022

How to achieve better network isolation on Docker Swarm?

Related topics