TLDR: What Happens When a Master or Replica Fails
Date: 2021-09-07 Source: https://arpitbhayani.me/blogs/handling-outages-master-replica
Overview
Explore Master-Replica architecture - handling node outages, replica recovery, and master failover strategies for robust systems. Master-Replica architecture is one the most common high-level architectural pattern prevalent in distributed systems.
Key Points
- Master-Replica architecture is one the most common high-level architectural pattern prevalent in distributed systems.
- Discovering the crashed Master: The first step of fixing the crashed master is to identify that the Master crashed.
- Setting up the new Master: When the Master crashes, there are two common ways of setting up the new Master - manual and automated.
- Announcing the new Master: Once the new Master is set up, either manually or elected among the Replicas, this information must be conveyed to the end clients connecting to the Master.