Reduce downtime after LB backend removal #1691

tmakita · 2017-03-16T10:05:51Z

When an LB backend is removed and the corresponding ipvs destination is
deleted, ipvs maintains dangling connections pointing to the destination
if the connections have existed since before destination removal.
Then packets going to this connection will be dropped in kernel because
there is no destination with this connection. This continues until the
connection expires (e.g. 60 seconds for SYN_RECV (tcp initial) state).
This in some cases causes TCP connection timeout if the connection is
initiated between container failure and ipvs destination deletion.

ipvs provides a parameter "expire_nodest_conn" to reduce the downtime.
When enabling the option, the staled connection immediately expires on
receiving a packet on the connection.

Although this option is not suitable if flapping can happen, I think the
user of LB should ensure it not to happen by taking enough time before
determining the container is unreachable.

Reference: https://www.kernel.org/doc/Documentation/networking/ipvs-sysctl.txt
Signed-off-by: Toshiaki Makita makita.toshiaki@lab.ntt.co.jp

mavenugo · 2017-03-16T11:13:42Z

@tmakita thanks for the patch and yes, I agree. The only flapping scenario I can think of is during HEALTHCHECK failures. But an HEALTHCHECK failure will also result in endpoint being removed.
So, I think this is a safe fix.

aboch · 2017-03-16T17:00:25Z

service_linux.go

+			err = ioutil.WriteFile("/proc/sys/net/ipv4/vs/expire_nodest_conn", []byte{'1', '\n'}, 0644)
+			if err != nil {
+				logrus.Errorf("Failed to write to /proc/sys/net/ipv4/vs/expire_nodest_conn: %v", err)
+				os.Exit(8)


This one should be os.Exit(9)

Thanks for the feedback.
I failed to find the meaning of the return value, but if it is assigned in incremental fashion, it should be 10 shouldn't it? Since 9 is already used.

aboch · 2017-03-16T17:01:13Z

Thanks @tmakita

Small comment, otherwise LGTM

When an LB backend is removed and the corresponding ipvs destination is deleted, ipvs maintains dangling connections pointing to the destination if the connections have existed since before destination removal. Then packets going to this connection will be dropped in kernel because there is no destination with this connection. This continues until the connection expires (e.g. 60 seconds for SYN_RECV (tcp initial) state). This in some cases causes TCP connection timeout if the connection is initiated between container failure and ipvs destination deletion. ipvs provides a parameter "expire_nodest_conn" to reduce the downtime. When enabling the option, the staled connection immediately expires on receiving a packet on the connection. Although this option is not suitable if flapping can happen, I think the user of LB should ensure it not to happen by taking enough time before determining the container is unreachable. Reference: https://www.kernel.org/doc/Documentation/networking/ipvs-sysctl.txt Signed-off-by: Toshiaki Makita <makita.toshiaki@lab.ntt.co.jp>

tmakita · 2017-03-17T01:41:29Z

Updated with error code 10, presuming it is assigned in incremental fashion.
Correct me if I misunderstood the meaning of the return value.

aboch · 2017-03-17T14:39:05Z

LGTM

tmakita · 2017-04-11T01:22:42Z

@mavenugo @aboch
What is the status of this PR? Anything to do?

mavenugo · 2017-04-11T01:26:42Z

@tmakita I think @sanimej wanted to test this changes in different scenarios to make sure it doesnt cause any regressions.

tmakita · 2017-04-11T07:44:12Z

@mavenugo thx!

thaJeztah · 2020-03-12T15:22:56Z

looks like this was taken care of by #2154, so let me close it, but feel free to comment if you think something is left to be done 👍

mavenugo approved these changes Mar 16, 2017

View reviewed changes

aboch reviewed Mar 16, 2017

View reviewed changes

tmakita force-pushed the reduce-lb-downtime branch from 1229132 to 9af59fd Compare March 17, 2017 01:35

thaJeztah closed this Mar 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce downtime after LB backend removal #1691

Reduce downtime after LB backend removal #1691

Uh oh!

tmakita commented Mar 16, 2017

Uh oh!

mavenugo commented Mar 16, 2017

Uh oh!

aboch Mar 16, 2017

Uh oh!

tmakita Mar 17, 2017

Uh oh!

aboch commented Mar 16, 2017

Uh oh!

tmakita commented Mar 17, 2017

Uh oh!

aboch commented Mar 17, 2017

Uh oh!

tmakita commented Apr 11, 2017

Uh oh!

mavenugo commented Apr 11, 2017

Uh oh!

tmakita commented Apr 11, 2017

Uh oh!

thaJeztah commented Mar 12, 2020 •

edited by arkodg

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Reduce downtime after LB backend removal #1691

Reduce downtime after LB backend removal #1691

Uh oh!

Conversation

tmakita commented Mar 16, 2017

Uh oh!

mavenugo commented Mar 16, 2017

Uh oh!

aboch Mar 16, 2017

Choose a reason for hiding this comment

Uh oh!

tmakita Mar 17, 2017

Choose a reason for hiding this comment

Uh oh!

aboch commented Mar 16, 2017

Uh oh!

tmakita commented Mar 17, 2017

Uh oh!

aboch commented Mar 17, 2017

Uh oh!

tmakita commented Apr 11, 2017

Uh oh!

mavenugo commented Apr 11, 2017

Uh oh!

tmakita commented Apr 11, 2017

Uh oh!

thaJeztah commented Mar 12, 2020 • edited by arkodg Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

thaJeztah commented Mar 12, 2020 •

edited by arkodg

Loading