Do not trust pending peers by default

We should not blindly trust pending peers. This makes the cluster vulnerable during membership changes to node failures. The way Riak uses `riak_ensemble` largely avoids this issue in practice, but there are corner cases where it can still be an issue.

Pending peers should check the backend module for trust the same as normal peers do, and move to `sync` if necessary. As written, there are a few things that would need to be handled.

First, a pending peer can leave the `pending` state from either a `commit` or `prepare` message. A `commit` message moves the peer straight to `following`. A `prepare` moves to `prefollow`, and ultimately to `following` via a future `commit`. Regardless of the path, the peer should move to `sync` before `following` if it is untrusted. Perhaps the easiest approach is to make `following(init)` check that the peer is in-fact trusted, and bail out to `sync` if not. This shouldn't matter for normal peers where we already check this property (`trust == true`) in `maybe_follow`.

Second, the `check_quorum, count_quorum, etc` support API calls that are used for support, debugging, and to make deterministic tests currently rely upon `commit` messages to determine peer count. Untrusted peers should not count. As currently written, only trusted peers accept commit messages (since pending peers are considered trusted by default). Once we make pending peers accept commits before being trusted, this is incorrect. Therefore, we need to change the support API calls from using `commit` to using a new message that is similar except that untrusted peers nack instead of accept.

/cc basho/riak#536

---

FYI, this issue is mentioned in a source code TODO at [riak_ensemble_peer.erl#L324-L327](https://github.com/basho/riak_ensemble/blob/2.0.0beta1/src/riak_ensemble_peer.erl#L324-L327):

``` erlang
%% TODO: Trusting pending peers makes ensemble vulnerable to concurrent
%%       node failures during membership changes. Change to move to
%%       syncing state before moving to following.
{next_state, pending, State2#state{trust=true}};
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not trust pending peers by default #17

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Do not trust pending peers by default #17

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions