We're updating the issue view to help you get more done. 

Quick successive failovers can leave the master in half-promoted state

Description

Acceptance test for this is system-tests' test_replication[3], but unfortunately this is somewhat random.

What can happen is:

  • a node starts trying to follow the master

  • the master dies before the follow process is completed

  • that node is then elected the new master

  • a retry of the previous follow runs at the same time or after the promote, breaking that node's nginx config

Relevant log (from "that node") - attached as "cloudify-cluster.log"

Status

Assignee

Łukasz Maksymczuk

Reporter

Łukasz Maksymczuk

Labels

Severity

None

Bug Type

None

Target Version

None

Severity

None

Sprint

None

Fix versions

Affects versions

4.2