Expected Behavior
When Elafros becomes overloaded with too many requests, it should immediately return a 503 Service Unavailable response.
Actual Behavior
There is no limit on the request allowed into the user code.
Steps to Reproduce the Problem
- Send a lot of requests at once.
- Notice latency blows up. Requests start timing out.
Additional Info
The first place to implement this is in the queue-proxy on the revision pods. We must
- enforce a maximum number of requests to be allow into the user code
- create a queue for any requests beyond that amount
- limit the queue to a maximum length and return 503's for any requests received beyond that limit
Expected Behavior
When Elafros becomes overloaded with too many requests, it should immediately return a 503 Service Unavailable response.
Actual Behavior
There is no limit on the request allowed into the user code.
Steps to Reproduce the Problem
Additional Info
The first place to implement this is in the queue-proxy on the revision pods. We must