Skip to content

Return 503 on overload #602

@josephburnett

Description

@josephburnett

Expected Behavior

When Elafros becomes overloaded with too many requests, it should immediately return a 503 Service Unavailable response.

Actual Behavior

There is no limit on the request allowed into the user code.

Steps to Reproduce the Problem

  1. Send a lot of requests at once.
  2. Notice latency blows up. Requests start timing out.

Additional Info

The first place to implement this is in the queue-proxy on the revision pods. We must

  1. enforce a maximum number of requests to be allow into the user code
  2. create a queue for any requests beyond that amount
  3. limit the queue to a maximum length and return 503's for any requests received beyond that limit

Metadata

Metadata

Assignees

No one assigned

    Labels

    area/autoscalekind/specDiscussion of how a feature should be exposed to customers.

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions