Skip to content

Router liveness probe failures #1504

@brenton

Description

@brenton

I hate filing issues like this (ie we don't yet know exactly how to reproduce the problem) but @sosiouxme, @sdodson, @thoraxe and I have hit an edge case where the router liveness check will fail which results in a tight loop of "destroy the router / launch the router".

Right now we mostly want to raise awareness of this problem to see if anyone is hitting it upstream. The code we're testing with is rarely ever more than a half a day old or so. I'm fairly certain we're dealing with an odd edge case. Any advice for debugging the problem the next time we see it would be appreciated. For starters:

  • How is the default router port 80 tcp liveness probe run? Does it run from the Master or the Node?
  • What ip does it hit? (I'm assuming it's using the published endpoints but I'm not sure where it's being run from).

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions