-
Notifications
You must be signed in to change notification settings - Fork 886
[backport 17.06] Adding a recovery mechanism for a split gossip cluster #2169
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Dani Louca <dani.louca@docker.com> (cherry picked from commit 744334d)
|
Still WIP the backport was not clean, just checked that compiles at the moment |
|
@dani-docker can you take a look too? I did the backport of your fix on 17.06, there were a bunch of conflicts so better to have another pair of eyes on this |
|
@dani-docker cannot backport that part here is why: meaning that in this code base the rejoin of a node that lose connectivity but do no change identity has to happen through the logic of |
dani-docker
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
To test this PR: The last step should fail without this PR, and pass with this PR. Another way to test is: The two netPeers should not match with this PR, and match with this PR. |
|
@fcrisciani I tested this PR, and nslookup still fails with this PR. |
|
@fcrisciani I now managed to confirm this patch fixes the original issue. |
|
LGTM (IANAM) |
|
Ping @abhi |
abhi
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Backport bugfix #2134
Signed-off-by: Dani Louca dani.louca@docker.com
(cherry picked from commit 744334d)