I'm trying to load test a Go microservice using Locust.
We've been exploring Railway's horizontal scaling since we are considering that a distributed approach might yield better results.
I tried a 4-replica setup with 2vCPU and 2GB RAM. The performance is definitely better than vertical scaling alone (as tested on 8vCPU, 8GB RAM).
However, I've noticed that based on the metrics, 2 replicas seem to not work at all. Their usage seem to be 0vCPU and just ~50MB RAM even under load.