Notes on Designing Large Scale Systems

Key Challenges

Database Replication
- Creating multiple copies (replicas) of a database distributed across servers.
- Ensures high availability and uninterrupted service.
- Scales read capacity.

Leader-Follower Replication
- One database acts as the leader (master), others as followers (slaves).
- Write operations directed to the leader; changes propagated to followers.
- Read operations distributed across leader and followers to enhance capacity.
Leader-Leader Replication
- Multiple databases act as leaders, each can accept read and write operations.
- Potential for conflicts between leaders requires conflict resolution mechanisms.

Asynchronous Replication
- Changes propagated to replicas in the background.
- Faster but risks temporary data inconsistencies.
Synchronous Replication
- Changes committed to both leader and followers simultaneously.
- Guarantees consistency but may impact write performance.

Timestamp-Based Resolution
- Most recent update wins.
Last Write Wins (LWW)
- Most recent write regardless of timestamp wins.
Custom Conflict Resolution
- Application-specific rules for resolving conflicts.

Data can be sharded based on criteria like:
- Customer IDs: e.g., users 1-1000 in shard 1, 1001-2000 in shard 2.
- Geographical Region: e.g., US users in one shard.

Determine data assignment to shards.
- Range-Based Sharding: Data partitioned based on ranges of the shard key.
- Hash-Based Sharding: Uses a hash function applied to the shard key for even data distribution, but less efficient for range queries.

SQL Databases: Typically lack out-of-the-box sharding; require custom logic.
NoSQL Databases: Many (e.g., MongoDB) provide built-in sharding support.